Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I'm after something that can transcribe medical notes and unfortunately it does not work well for that case. (almost nothing does though) There's quite a few people interested in something that doesn't turn "laparoscopic" into "leper as cop it".

Maybe the current progress will help though. Models adjusted by your own dictionary or from postprocessing fixes would be amazing.



OpenAI Whisper is really good.

Here’s an iOS app to play with it: https://whispermemos.com

It even formats recording as paragraphs by running through GPT.


This site is using Whisper:

> Built using transformers.js and the whisper-tiny.en model.


I've been using Whisper Memos for some time now. Simple but useful app to quickly save an idea or memory when you don't have time to type. Speech recognition is much better than native one, especially with languages which are not widely supported.


This app has been my go-to solution for efficiently recording thoughts or memories without the need to spend time typing. Its proficiency in speech recognition is notably outstanding.


I really like the accuracy of Whisper, but I feel like it operates at roughly real-time on my machine.


You can speed that up 16x with "faster whisper" https://github.com/guillaumekln/faster-whisper


How is the latency? This is whisper running on the iPhone?


I had an application for radiology and whisper large-v2 with beam size five was essentially 100% across multiple different types of dictated radiology reports.


We are using Nuance Dragon Medical at work which is intuitive to use and surprisingly accurate even with very fast dictation. I have yet to come across a solution, that is as accurate, although I'm not sure if they offer solutions for end users directly.


Have you tried Siri Dictation? The transcription process doesn’t leave iOS 14 and on - and I’ve been pleasantly surprised.

Laparoscopic - that worked fine ;)

Edit: if you have one available, mind sending over a sample deidentified note to my email in profile? I’m working on something.


> Have you tried Siri Dictation?

No, there's no Apple hardware available in my scenario.

Also in medical context, if I can't tell where the data goes, the solution is not usable.


Have you tried https://speechmatics.com/ ? I think they have a specially tuned medical version, and quite a generous free allowance.


I have not, not will do tomorrow. Thanks for the link.


I work at Rev.AI, we have a model tuned for medical and we are HIPAA compliant across the board. We do human and AI transcription and our ASR is #1 accuracy in the world right now


I took at a look at the Rev.AI website and didn't see any mention of a medical-specific model nor HIPAA compliance. It would be nice if this information was presented in your marketing!


picovoice processes on the device and you can fine-tune the models https://picovoice.ai/platform/cat/


Have you tried Whisper and simply saying to GPT the context of conversation and to fix it. I think it should work


You'll probably need a custom model tailored to medical content.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: