Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Two reasons. First, someone else will release something similar. Second, I didn’t see a related push from them to work with other in the industry to do something productive towards safety with the time they got by delaying availability of these kinds of models. So it felt disingenuous.


Several groups already have. Facebook's OPT-175B is available to basically anyone with a .edu address (models up to 66B are freely available) and Bloom-176B is 100% open:

https://github.com/facebookresearch/metaseq

https://huggingface.co/bigscience/bloom


Yup. I meant when it had just come out.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: