This is my experience, too. As a concrete example, I'll need to write a mapper f...

lazyasciiart · 2025-05-30T08:08:01 1748592481

An LLM ability to do a task is roughly correlated to the number of times that task has been done on the internet before. If you want to see the hype version, you need to write a todo web app in typescript or similar. So it's probably not something you can fix with prompts, but having a model with more focus on relevant training data might help.

KTibow · 2025-05-30T22:24:41 1748643881

These days, they'll sometimes also RL on a task if it's easy to validate outputs and if it seems worth the effort.

akoboldfrying · 2025-05-30T08:25:39 1748593539

This honestly seems like something that could be better handled with pre-LLM technology, like a 15-line Perl script that reads one on stdin, applies some crufty regexes, and writes the other to stdout. Are there complexities I'm not seeing?