Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That's got nothing to do with it. It's all about copyright. Can it reproduce its training data verbatim? If so, Meta is in hot water.


But if it's corpora do NOT include the Harry Potter books then Meta is NOT in hot water,! So take the Harry Potter books out of the corpora. What is lost? Nothing IMO useful other than the ability to discuss Harry Potter books. BFD.


I read harry potter, and you ask me about a page, and I can recite it verbatim, did I just commit copyright infringement?


Are you selling your ability to recite stuff? Then certainly.


there are plenty of open source LLMs trained on harry potter, is that fine?


No


I pay for a service. The service recites a novel to me. The service would need permission to do this or it is copyright infringement.


This is an extremely common strawman argument. We're not discussing human memory.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: