Isn't this Apache licensed? Regardless, you can run multiple models concurrently...

zozbot234 on March 17, 2024 | parent | context | favorite | on: Grok

Isn't this Apache licensed? Regardless, you can run multiple models concurrently on the same input using well-known ensemble techniques. (Not to be confused with mixture-of-experts, which is more like training a single model where only a few blocks are chosen to be active at any given time - a kind of sparsity.)

tlb on March 18, 2024 [–]

Not super easy if they have different tokenizers.