Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

> DeepSeek-R1 has been making waves recently by rivaling OpenAI's O1 reasoning model while being fully open-source.

Do we finally have a model with access to the training architecture and training data set, or are we still calling non-reproducible binary blobs without source form open-source?




It sounds like if they owe you the training architecture and training data set.


It absolutely doesn't. It sounds like further diluting the term "open-source" isn't great.


I assume when people say "open source model" they mean "open weights model". The "open source" term doesn't really make sense here, since machine learning models are not compilations of source code. (Though DeepSeek has published several papers with details on their training process. It's more than just open weights.)


ML models do have a "source" though


If ML models have a source, brains have a source.

Brains don't have a source.

Therefore, ML models don't have a source.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: