This seems like the equivalent of a university designing an ICE car...
What does anyone get out of this when we have open weight models already ?
Are they going to do very innovative AI research that companies wouldn't dare try/fund? Seems unlikely ..
Is it a moonshot huge project that no single company could fund..? Not that either
If it's just a little fun to train the next generation of LLM researchers.. Then you might as well just make a small scale toy instead of using up a super computer center
Including how it was trained, what data was used, how training data was synthesized, how other models were used etc. All the stuff that is kept secret in case of llama, deepseek etc.
Super computers are being used daily for much toy-ier codes in research, be glad this at least interests the public and constitutes a foray of academia into new areas.
What does anyone get out of this when we have open weight models already ?
Are they going to do very innovative AI research that companies wouldn't dare try/fund? Seems unlikely ..
Is it a moonshot huge project that no single company could fund..? Not that either
If it's just a little fun to train the next generation of LLM researchers.. Then you might as well just make a small scale toy instead of using up a super computer center