Hacker News new | past | comments | ask | show | jobs | submit login

I would say Gemini Live is getting there. It's lacking integration with NotebookLM and Keep. It would be amazing if I started a project conceptually and wanted to move to code it could fire up VS Code and let me get to work.

Gemini's home automation works nicely and it can understand comments like it's too dark in here or it's cold inside and act appropriately. This is using the Android app as an assistant, not live mode.

OpenAI's implementation is apparently similar but I haven't tried the voice mode as a free user.

I haven't tried Apple Intelligence yet on my M1 and don't have an iPhone, so I can't compare.

I've been looking at offline capabilities with open weight models but they aren't there either. A full speech-to-speech model [1] working on an M1 Mac would be incredible.

[1] https://arxiv.org/abs/2410.00037




Whisper is pretty good if you take the large model with gpu acceleration. But it's not instant like advanced voice mode.




Consider applying for YC's Summer 2025 batch! Applications are open till May 13

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: