Great paper! There are some similar ideas to this in game theory and reinforceme... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		programjames on June 22, 2024 \| parent \| context \| favorite \| on: Software design gets worse before it gets better Great paper! There are some similar ideas to this in game theory and reinforcement learning (RL): [1]: Thermodynamic Game Theory: https://adamilab.msu.edu/wp-content/uploads/AdamiHintze2018.... [2]: piKL - KL-regularized RL: https://arxiv.org/abs/2112.07544 [3]: Soft-Actor Critic - Entropy-regularized RL: https://arxiv.org/abs/1801.01290 [4]: "Soft" (Boltzmann) Q-learning = Entropy-regularized policy gradients: https://arxiv.org/abs/1704.06440

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact