do you have a suggestion or a way to measure if model capabilities are getting d...

mensetmanusman · 2025-07-30T12:14:19 1753877659

These are now questions at the cutting edge of academic research. It might be computationally unknowable until checked.

RALaBarge · 2025-07-30T00:05:27 1753833927

Ask it a series of the same questions after you train that you posed before training started. Is the quality lower?

israrkhan · 2025-07-30T06:38:03 1753857483

That series of questions will measure only a particular area. I am concerned about destorying model capabilities in some other area that that I do not pay attention to, and have no way of knowing.

simonh · 2025-07-30T07:26:35 1753860395

Isn’t that a general problem with LLMs? The only way to know how good it is at something is to test it.