Hacker Newsnew | past | comments | ask | show | jobs | submit | hephaes7us's commentslogin

Thanks for sharing! I was literally getting ready to build, essentially, this. Now it looks like I don't have to!

Have you ever considered using a foot-pedal for PTT?

Apple incidentally already has native STT, but for some reason they just don't use a decent model yet.


They do, and they even have that nice microphone F5 key for it, and an ideal OS level API making the input experience >perfect<.

Apparently they do have a better model, they just haven't exposed it in their own OS yet!

https://developer.apple.com/documentation/speech/bringing-ad...

Wonder what's the hold up...

For footpedal:

Yes, conceptually it’s just another evdev-trigger source, assuming the pedal exposes usable key/button events.

Otherwise we’d bridge it into the existing external control interface. Either way, hooks are there. :)


The only issue with Apple models is that they do not detect languages automatically, nor switch if you do between sentences.

Parakeet does both just fine.


sorry, PTT?

push-to-talk.


You could reduce your labor costs and reduce the aggravation you are causing teammates if you changed your attitude.

It's possible to drive results and create a culture of accountability without dragging people into the room with you just so you can interrupt their work in-person.


I'm making a "Podcast Search Engine", partially as an excuse to play with Elixir and Phoenix/LiveView.

It's basically just a frontend to a semantic search system, and is a tangent while I explore "knowledgebase" concepts.

I'm extremely interested in knowledgebases at the moment.


Domain privacy isn't for degenerates.


They didn't say it was?


It's getting less silly every month! So many people in that boat only use the web browser anyway.

With a well-supported hardware configuration and a working web browser, even a non-techie may have a more stable experience than they would with Windows.

That has as much to do with the decline of Windows as with the ascent of desktop Linux, but still.


This is just one way information goes from being private to being public. It is sensible that people who provide intelligence to the market be compensated, whether they're better at inferring/predicting or whether they just know something we don't.

Obviously, in a case like this, an individual would be violating the terms of their employment/non-disclosure agreement. I agree that is bad!

I don't think that damns the concept of "predicting known information".


A currency peg is a double-edged sword in and of itself, to say nothing of the risk brought into the equation by having to trust a stablecoin issuer.


FUTO keyboard is trying to do this. I think they have some kind of distillation of Whisper running on-device.


They are just shipping the same whisper-small that everyone else is using, and did not much to improve their models since release. Other models have been "coming soon" forever. https://keyboard.futo.org/voice-input-models


Sounds like we might have a new way to fund reporting!


You can have it write code that you review (with whatever level of caution you wish) and then run that on real data/infrastructure.

You get a lot of leverage that way, but it's still better than letting AI use your keys and act with full autonomy on stuff of consequence.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: