Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think you may think too highly of academic papers or more so that they oft still only have 1% in there.


I think you're missing the point. This is my own paper and these are my own new concepts. It doesn't matter if the definition of the new concepts are only 1% of the paper, the point is they are the concepts I'm asking the LLM to use, and are not in its training data.


How would one prove the premise that a concept is not present in the training data?

With how much data is being shoveled in there, our default assumption should be that significant components are present.


That would be a weird default assumption. It's not hard to come up with new ideas. In fact, it's trivial.

And if you want to know if a specific concept is known by the LLM, you can literally ask it. It generally does a great job of telling you what it is and is not familiar with.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: