Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I just set up Groq with Kimi K2 the other day and was blown away by the speed.

Deciding if I should switch to Qwen 3 and Cerebras.

(Also, off-topic, but the name reminds me of cerebrates from Starcraft. The Zerg command hierarchy lore was fascinating when I was a young child.)



Have you used Claude Code and how do you compare the quality to Claude models? I am heavily invested in tools around Claude, still struggling to make a switch and start experimenting with other models


I still exclusively use Claude Code. I have not yet experimented with these other models for practical software development work.

A workflow I've been hearing about is: use Claude Code until quota exhaustion, then use Gemini CLI with Gemini 2.5 Pro free credits until quota exhaustion, then use something like a cheap-ish K2 or Qwen 3 provider, with OpenCode or the new Qwen Code, until your Claude Code credits reset and you begin the cycle anew.


Are you using Claude code or the web interface? I would like to try this with CC myself, apparently with some proxy use an OpenAI compatible LLM can be swapped in.


I am using Claude code, my experience with it so far is great. I use it primarily from terminal, this way I stay focused while reading code and CC doing its job in the background.


I’ve heard this repeated that using the env vars you can use gpt models, for example.

But then also that running a proxy tool locally is needed.

I haven’t tried this setup, and can’t say offhand if Cerebras’ hosted qwen described here is “OpenAI” compatible.

I also don’t know if all of the tools CC uses out of the box are supported in the most compatible non-Anthropic models.

Can anyone provide clarity / additional testimony on swapping out the engine on Claude Code?


I've used Kimi K2, it works well. Personally I'm using Claude Code Router.

https://github.com/musistudio/claude-code-router


Issue most groq models are limited in context as that cost a lot of memory.


Obligatory reminder that 'Groq' and 'Grok' are entirely different and unrelated. No risk of a runaway Mecha-Hitler here!


instead risk of requiring racks of hardware to run just one model!




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: