I believe we were discussing the former not the latter? I agree that for lots of problem solving tasks it can be hit or miss - in my experience, all the models are quite bad at writing decent frontend code when it comes to the rendered page looking the way you want it to.
What you're describing is more about reasoning abilities - that's not really what the article was about or the problems the techniques are for. The techniques in article are more for stuff like Q&A, classification, summarization, etc.
What you're describing is more about reasoning abilities - that's not really what the article was about or the problems the techniques are for. The techniques in article are more for stuff like Q&A, classification, summarization, etc.