Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is there any good quick summary of what's special about DeepSeek? I know it's OSS and incredibly efficient, but news laymen are saying it's trained purely on AI info instead of using a corpus of tagged data... which, I assume, means it's somehow extracting weights or metadata or something from other AIs. Is that it?


  Is there any good quick summary of what's special about DeepSeek?
Yes, section 2.3 of the Deepseek R1 paper summarizes the training part you're asking about, in less than a page.

https://github.com/deepseek-ai/DeepSeek-R1/blob/main/DeepSee...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: