Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

WhipserX's diarization is great imo:

    whisperx input.mp3 --language en --diarize --output_format vtt --model large-v2
Works a treat for Zoom interviews. Diarization is sometimes a bit off, but generally its correct.


> input.mp3

Thanks but I'm looking for live diarization.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: