Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
Jargonic Sets New SOTA for Japanese ASR (aiola.ai)
19 points by four_fifths 6 months ago | hide | past | favorite | 4 comments


SOTA: not used in the article but probably State Of The Art

ASR: Automatic Speech Recognition, speech-to-text


And here I was, as a ham radio operator, excited to read something about Summits On The Air.

shuffles dejectedly back to shack


Why no comparition to gpt-4o-transcribe?

If you don't compare to latest model on the market, how can you claim it's SOTA?

According to OpenAI, gpt-4o-transcribe has much better performance than whisper-large-v2.

https://openai.com/index/introducing-our-next-generation-aud...


Are there any details on what they changed to improve over other existing models?




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: