I like this prompt for testing LLMs as the problem is easy to reason about but l...

mvdtnz · on May 10, 2023

I like that you're solving precisely a problem I face on a daily basis! (Still don't understand how ACC doesn't have a built in calculator for this).

washadjeffmad · on May 12, 2023

Interesting. Regenerating replies does seem to help, for whatever reason. I've noticed the first after model loading seems to have a higher likelihood of being inaccurate or a hallucination.

Regenerating a GPT4xAlpaca 30B model from its initial answer of 243L, I got a variation of the following for the next 5 rerolls:

"Based on your qualifying time of 2:04.317 and the length of the race being 20 minutes, we can calculate the number of laps needed for the entire race. Assuming each lap takes approximately 2 minutes (based on the average speed), there will be 20 / 2 = 10 laps during the race.

Using the information provided about the car using 2.73 liters per lap, we can determine the total amount of fuel required for the race. Therefore, it would be advisable to carry at least 10 * 2.73 = 27.3 liters of fuel for this race."

thorncorona · on May 10, 2023

Can you explain how the problem is solved for someone who doesn't understand racing?

underyx · on May 10, 2023

Sure thing! If the race is 20 minutes and each lap takes 2m04s, that means there will be 9.67 laps till the race is over, and you round that up to 10 since partial laps must be finished. You need 2.73 liters per lap, so the 10 laps will use 27.3 liters total. GPT-4 is correct in suggesting a tiny safety buffer above that in case fuel usage differs from expected.

jason-phillips · on May 10, 2023

It's a math word problem, in which LLM's would not perform well. I have no idea why people try stuff like this.

mvdtnz · on May 10, 2023

People try stuff like this because it's precisely the kind of problem that AI would be useful for. If one of these models turned out to be really good at it, it would signify that they're now useful for a whole class of problems.

chaxor · on May 10, 2023

If you want to solve math problems, LLMs are very useful for this. Exactly how trained professionals are.

You make the model write code for you to solve it.

Would you ask your dad to compute the correlation matrix between 40 thousand vectors? No? Then don't ask an LLM to do it.

mvdtnz · on May 11, 2023

I ask ChatGPT / Bard to do all kinds of things I wouldn't ask my Dad for. This is a weird perspective.

underyx · on May 10, 2023

Besides, GPT-4 did solve this question perfectly. I like that rather than just involving math, there’s also some real life knowledge needed to give a practical answer.

porkbeer · on May 10, 2023

Because it exposes accuracy problems as querys often involve implied or implicit math skills.