Reinforcement Learning in less than 400 lines of C

taeric · 2025-03-10T17:48:24 1741628904

I'm assuming point two about "no knowledge of the game is put into the program" is more that none is put into the neural network? Or am I misunderstanding what is meant there? Seems the scoring function is clearly how you score the game?

antirez · 2025-03-10T17:50:26 1741629026

This statement refers to the fact that, there are no rules like "you should put a O to block X.X" and things like that. Inside the program there is just the rule to understand if the game ended and who won. In the NN there is nothing at all of course, it's just random weights at start.

taeric · 2025-03-10T18:16:51 1741630611

Ah, that makes sense. Still having fun reading over the code. Thanks for sharing!

antirez · 2025-03-10T18:18:35 1741630715

Thanks for reading!