Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

If you described Speculative Decoding, 3bit Quantization, and Adapter based weight selection all two years ago then your AI Lab must be so sota /s


I am curious, what do you mean by "Adapter based weight selection"?


That's apparently Apple's term for LoRA[1].

[1] https://arxiv.org/abs/2106.09685


Sort of but Adapters allow for multiple weight adjustments (think loras) for specific skills so it is more like extra optimized mixture of experts or multi agent approach. They have a slide with adapters listed like summarization, prioritization, tone (happy, business, etc), editor, etc) -- this is not to be mixed up with Intents which is how on device apps publish their capabilities to the Intelligence system for real npu os level multi agent tool use.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: