For (English only) speech-to-text, NVIDIA's Parakeet-V2 is significantly faster ...

driscoll42 · 2025-07-22T19:57:52 1753214272

Compared to all Whister models? Or the faster ones? And which version of Whisper? All for a faster, more accurate model, but need a bit more.

ipsum2 · 2025-07-22T20:04:31 1753214671

All of them, in my experience.

driscoll42 · 2025-07-22T20:06:03 1753214763

Fair, looking at the ASR leaderboards it is truly better - https://huggingface.co/spaces/hf-audio/open_asr_leaderboard and NVIDIA's Canary might be even better? Will try these out. Appreciate bringing these to my attention!