Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It's a language model. It models language not knowledge.


You might find this interesting:

"Do Large Language Models learn world models or just surface statistics?"

https://thegradient.pub/othello/


I have zero doubt that transformers can construct sensible models of what they are taught.

My concern about LLM is that there's so little of actual knowledge in human language that it's easily drowned in the rest of the human language and neural network trained on not strictly restricted set of human language has very little chance of modelling knowledge.

In case of Othello game if you teach the neural network to predict all moves you get NN to learn how to play legal moves, not necessarily winning moves.

You'd have to train NN on only the moves of the winning side. Or even create some negative training data and method from the moves of loosing side to have any hopes of creating NN that plays Othello well.

Same should be true for LLMs. To have any hope of getting them to model knowledge you'd have to curate input to strictly represent knowledge and perhaps develop a negative reinforcement training method and feed it with all the language that doesn't represent truth.


great article. intruiging, exciting and a little frightening


Brilliant work.


So it cannot be a reliable search engine due to it hallucinating factual errors nor is it trustworthy to be one.

Just evidently another overhyped solution attempting to solving the search problem in a worse fashion, creating more problems once again.


And what is knowledge? It could very well be that our minds are themselves fancy autocompletes.


Knowledge is doing, language is communicating about it. Think about it this way:

Ask the bot for a cooking recipe. Knowledge would be a cook who has cooked the recipe, evaluated/tasted the result. Then communicated it to you. The bot gives you at best a recording of the cook's communication, at worst a generative modification of a combination of such communications, but skipping the cooking and evaluating part.


So why would anyone want a search engine to model language instead of knowledge?


To gain market share, why else do anything?


That's a great way to put it!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: