Oh man. That reminds me of a recent thing I've been testing with deepseek and gemini via roo/cline.
I pretty much only use them for mundane tasks, like 'heres 12 json files, add this field to each.' Boring right?
They are both so slow. They 'think' way too much before every single edit. Gemini is a little faster to start but 429s repeatedly so ends up being slower. It also would reorder some keys in the json for no apparent reason, but who cares.
In the end, I realize I could have probably done it myself in 1/3 the time it took those.
- me: how can I do X?
- llm: do this
- me: doesn't fully work
- llm: refactoring to make it more robust ...
- me: still doesn't fully work
- llm: refactoring ...
- me: now it's worse than before
- llm: refactoring ...
- me: better but now there's this other regression
- llm: refactoring ...
- me: we're back to the first issue again
- (eventually ... me: forget it, I could have done it myself by now)