Hey! :) Coincidentally the seeds I always use are 3407, 3408 and 3409 :) 3407 be...

iamnotagenius · 2025-01-28T14:13:01 1738073581

would be great to have dynamic quants of V3-non-R1 version, as for some tasks it is good enough. Also would be very interesting to see degradation with dynamic quants on small/medium size MoEs, such as older Deepseek models, Mixtrals, IBM tiny Granite MoE. Would be fun if Granite 1b MoE will still be functioning at 1.58bit.

danielhanchen · 2025-01-28T21:32:56 1738099976

Oh yes multiple people have asked me about this - I'll see what I can do :)