If I'm going to augment my education with AI, I'd at least want to know it could get basic numerical facts right. If a computer program struggles with the concept of a number being greater than another number, how do I have any confidence that it can teach physics?
your concern would be valid if an llm were the ONLY tool being used. applications use multiple tools so you can use the appropriate tool for the job. if you're doing math, you don't want a standalone llm
As a student, how would I know what things the LLM tutor can provide correct answers for, and what things I will need to "use appropriate tools" for? Should I rely on the LLM to help teach me spelling, or US history, or are there more appropriate tools for these, too?
if the product is great, hopefully you as the user would not have to worry about which tool is doing which task. the developers would worry about that. it's the same in any app. the user doesn't know or care what tool is used to render the frontend or store the data in the backend
Since we're all just speculating to the wind here, I can see multiple ways LLMs can be used. Maybe it'll help simplify TA triage, maybe it'll just be a Discord bot. Maybe a classifier will sample from multiple models.
I think if anyone can give this idea a fair shot it's Andrew Karpathy, an ML expert and a person known to be passionate about education.