Did it work? :) The architecture is very similar offset lstms which have been st...

cs702 · 2025-07-28T15:38:36 1753717116

I haven't had a chance to read the preprint carefully or play with the code yet. Best place to follow what's happening is by looking at the github repo, specifically open and closed issues and pull requests.

lumost · 2025-07-28T17:27:08 1753723628

I'll wait until some more benchmarks are run in this case. Unlike traditional software, vetting a model architecture works better than alternatives is a time and compute intensive process. You really can't just download it and "try it out" outside of general purpose models (which this is not).