Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Reflexive Reinforcement Learning would be better use of words.

As a side note there are tons of different sources of data, ie. for programming languages you can simply run code through interpreter/compiler and possibly run the code itself to see its output as data for reinforcement learning.

The main point is removing humans from the equation.



Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: