More

Yoric · 2026-06-13T17:58:47 1781373527

Directly at DeepSeek? It was my understanding (but I didn't check) that some other AI operators were providing (some of?) DeepSeek's model for cheaper prices.

Still, that's interesting. What do you get for that price? Only coding, or also e.g. image generation?

alecco · 2026-06-13T21:30:06 1781386206

DeepSeek API gave 6x to 8x better caching rate for inputs over OpenRouter (even chosing DeepSeek as provider). And some of the cheaper providers are using FP4 quantizations.

https://openrouter.ai/deepseek/deepseek-v4-flash-20260423#pr...

After complaints the cached read is not listed anymore in that page, you have to click one by one. All providers for DeepSeek V4 Flash charge ~$0.02 while DeepSeek provider is $0.0028. For coding this is huge as caching often gets in the range of 90 to 99%. But OpenRouter messes your caching so don't use it. And it seems to be a VC-backed closed middle-man company, not open source or open anything.

atreids · 2026-06-13T20:37:50 1781383070

Footprint's comment is correct. I go directly to Deepseek's platform API which they linked. There's no image generation but you get access to Deepseek V4 Flash and Deepseek V4 Pro, both of which are very capable for general text based tasks and programming. Flash is insanely cheap for how good it is ($0.14 per 1M input tokens vs $15 with Claude 4.7). V4 Pro I would put somewhere in the range of 80 to 90% as good as Opus 4.6 (based just on anecdotal usage - I use Opus 4.6 heavily at work as my company pays for it) while again being significantly cheaper. According to a benchmark[1] I read, processing 1million tokens would cost you $250 for Opus 4.7, $300 on GPT5.5... and just $35 on V4 Pro.

I just use it for my side-project coding and brainstorming tasks. At work I use AWS's Kiro CLI + Opus 4.6. At home I use Opencode + V4 Flash for the majority of "general" usage. I swap to V4 Pro for complex tasks if I feel like V4 Flash is struggling.

One other thing I highly like about the platform.deepseek API usage is it's a metered setup - not subscription based. Which means you only pay for what you use (the money that you put in doesn't expire) and can't spend more than you've deposited. This works well for me for my non-work coding because it generally happens in bursts. I may not code for a whole month (and therefore if I had a subscription it would have been wasted) and then spend a whole weekend coding nonstop.

It's entirely possible that there are middle-man providers that give a discount on Deepseek's own pricing, but I'm quite happy with the amount I'm paying so I haven't really looked into it.

[1]: https://lushbinary.com/blog/deepseek-v4-vs-claude-opus-4-7-v...

Footprint0521 · 2026-06-13T20:00:29 1781380829

I’ve been doing this too, it’s a cheat code! 1/100th of the price of Claude/openai prices for 95% of the quality. Site is platform.deepseek.com for that. No image generation, just text, but if you use it right it works great

Yoric · 2026-06-13T17:57:32 1781373452

https://www.canirun.ai/?status=tight might answer that question

zkmon · 2026-06-13T19:38:48 1781379528

That site doesn't seem to consider the quants. So useless.

Yoric · 2026-06-13T17:57:05 1781373425

I'm assuming that you need to feed the human being (i.e. you) regardless of whether you use that human being for writing code or not. So, by this metric, there is simply no breaking even point. The cost of human + AI is always going to be higher than the cost of human.

Yoric · 2026-05-28T07:03:58 1779951838

Anecdotally, a few weeks into a Rust agent-first project, we're still trying to get the agent to maintain a minimum of coding discipline (e.g. don't use sync Mutex in tokio code). So far, the agent seems more interested in deactivating the linters than in complying.

Security? At this stage, I'm a bit afraid that it's a joke more than anything else.

cadamsdotcom · 2026-05-28T07:50:23 1779954623

That should be solvable by denying permission to edit the lint files with a message saying lint files cannot be edited and not to use workarounds (sed, scripting etc)

You could also use hooks to block running of scripts for some number of turns after an attempt to cheat.

Yoric · 2026-05-28T14:54:01 1779980041

The agent can disable the lints inline, so that's not sufficient.

Also, I haven't found a cross-platform + cross-agent mechanism to set permissions. Much less one that works.

Right now, I'm working on a hook that checks for changes in source files, but the plug-in system (at least of opencode) seems quite buggy.

Yoric · 2026-05-28T06:55:12 1779951312

> Why on earth would AI labs be bragging about how little the product they sell actually costs them to make? You don't want to do anything that reduces it's perceived value to the user, that might make them less willing to pay for it.

Wouldn't they be bragging about it to investors? It feels like something that would matter a lot to them, and at least OpenAI kinda feels desperate to find them.

There's also the small question about whether a drop in inference cost would actually change anything about profitability, when training seems to get exponentially more expensive.

Yoric · 2026-05-28T06:50:28 1779951028

What do you mean about syntax mistakes and memory problems?

Something like incorrect SELinux configurations?

Yoric · 2026-05-22T22:12:02 1779487922

It seems that this era is a marketing experiment for Mythos.

We're running forward without any idea of how we can get agents to write code that is even remotely safe or secure. It _will_ blow up with increasingly large blast radiuses.

Yoric · 2026-05-22T07:30:07 1779435007

Go is pretty good at performance, but pretty bad at expressing domain-specific logics. Python is the opposite, but once you have isolated the parts that need to be optimized, it's quite easy to rewrite them in a native language (in particular, the Rust-Python bindings are really good, although in this project, it's C++).

Yoric · 2026-05-20T16:49:16 1779295756

Wasn't pepper the P in PNaCl?

Randomno · 2026-05-20T17:00:18 1779296418

The P in PNaCl is Portable.

Pepper was a pun on Native Client (since NaCl = salt). Pepper Plugin API (PPAPI) was Google's more secure version of NPAPI (Netscape Plugin API). Flash Player was essentially the only thing using NPAPI/PPAPI by the end of its life.

jcranmer · 2026-05-20T17:22:49 1779297769

The most common plugins were Flash, Silverlight, Adobe Reader, and the Java applet plugin, and I think all of those were in mildly common use when plugins were on their last legs.

pjmlp · 2026-05-20T20:18:53 1779308333

Now you can have all of them running on top of WebAssembly, companies even pay for support.

Yoric · 2026-05-15T09:38:04 1778837884

When I was a manager in a start-up, ages ago, I argued the CEO against handing a (small) one-off bonus to one of my team members, and rather went shopping for a nice gift with the same sum. One of them was purely a transaction, the other one was a gift.

I believe that I was right.

jtokoph · 2026-05-15T15:20:39 1778858439

I was once gifted a $100 bottle of alcohol after a project finished. I don’t drink much booze. I would have rather had the $100.

It very much depends on the gift.

zippyman55 · 2026-05-17T21:21:52 1779052912

We gave out frozen turkeys. Terrible gift if you do not cook.

vrighter · 2026-05-16T06:45:27 1778913927

at a startup i worked on they gave everyone a ps4, but also the option to exchange it for cash