Hacker Newsnew | past | comments | ask | show | jobs | submit | erogol's commentslogin

Just needed a markdown editor with simple AI features. Couldn’t find any and I created one in ~45 mins thanks to chatgpt and copilot

It is a single html file, no server and does everything I need.

Maybe it’d be useful for you as well… Kudos…


understaffed -> it was only me and still only me but with a great list of contributors :)


Thanks for doing this! I got it running immediately and I'm impressed and will try it in a project I have in mind plus spread the good word.


I would really love to see you get funding for developing an open platform for TTS that can offer commercial options that fuel growth. I really want to see your work scale. Easy TTS from custom data needs to be a thing.

Good job with the stuff you've built, and good luck!


Check out Coqui TTS where we continue the work.

https://github.com/coqui-ai/TTS

Mozilla TTS is not maintained anymore (at least ATM).

Disclaimer: I've created both of the projects.


How does this compare to ESPNET2? (https://colab.research.google.com/github/espnet/notebook/blo...)

Do you support multiple speakers?

Also, do you mind if I email you and ask you a few questions about Coqui TTS?


We support multi speaker models and working on even multi lingual models.

Come and join our gitter room.

https://gitter.im/coqui-ai/TTS


I remember giving Mozilla TTS a try but the docker image would crash on punctuations and symbols. Seemed to require clean up of the text and submitting it in small chunks.

Any idea if these issues still exist? Thank you.


The examples are really impressive! Are multiple voice tones/genders supported?


we are working on it.

Check our latest work https://edresson.github.io/SC-GlowTTS/

You can also check other released models here

https://github.com/coqui-ai/TTS/releases


That's mind blowingly amazing. It's far and away the best TTS I've ever heard in my life!

We've come a long from from SAM and my Amiga, I tell ya.


It mainly reflects the quality of the trained dataset, the earlier stages of the project and some experiments.

I suggest you the check the latest uploads on soundcloud.


Yep. The aim is to solve TTS for all languages one at a time.

You can check out the released models page for the other models and languages.

https://github.com/coqui-ai/TTS/releases


When are you going to do Chuvash ? ;)


If you're willing to record a public domain dataset, I'll help train a voice :)


Hope I am not repeating any comments here. My suggestion is that you start recording as soon as possible and as much as possible without worrying about technicalities. You can also use if you have any old voice records or videos with a relatively good voice quality. For now maybe she can read a book aloud in a silent room. After you have the data I can also help if you like to create a TTS model.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: