More

jrandolf · 2026-04-13T06:58:20 1776063500

This plugin uses sqlx underneath which handles prepared statement caching. Regarding migration, we just used a coding agent to migrate our database infrastructure to it. It takes <20 minutes and remember this really only helps with static queries. We do support sqlc's dynamic queries though.

Tip: you can use case statements and etc. to create static queries even when you have conditionals.

Also, read https://news.ycombinator.com/newsguidelines.html#generated

jrandolf · 2026-04-07T07:36:22 1775547382

First, thanks for signing up early. It means a lot.

The $10/mo price needed 465 people to fill a cohort before we could turn on a single GPU. People signed up and churned while waiting, so we looked at the reservation pattern and determined 80 slots was optimal. This reflects in the new price and throughput.

We're considering a 1-week option so people can test it out before committing to a full month. Would that help?

jrandolf · 2026-04-07T07:16:17 1775546177

We collect emails to notify you when the cohort fills or any important information such as cancellation. No one's selling your email.

Also, please read https://news.ycombinator.com/newsguidelines.html. HN is a community for thoughtful discussion.

sunsation · 2026-04-07T20:39:55 1775594395

I was monitoring all the cohort as I was looking for to this. The site was showing that 4 out of 6 had a sign up of 464 out of 465, so numbers were deceiving. Now I end up with not having a way to unsubscribe or close my account.

jrandolf · 2026-04-07T20:45:59 1775594759

We are aware of this. There was a bug that overcounted and now it's been fixed. If you'd like for us to delete your account, please contact support@sllm.cloud.

jrandolf · 2026-04-07T06:01:59 1775541719

The audience here is developers buying API access. They want to see the model, the price, and the throughput, not a hero image and three paragraphs about our mission. Marketing copy between a developer and that information is friction.

jrandolf · 2026-04-07T05:56:38 1775541398

See https://news.ycombinator.com/item?id=47670843

jrandolf · 2026-04-07T05:55:39 1775541339

jrandolf · 2026-04-07T05:53:12 1775541192

You're right that we're less flexible than OpenRouter or Chutes. We don't let you hop between models per-request. If you want that, use those. If you want predictable cost and guaranteed throughput on one model, that's us.

On TEE: yeah, it's stronger, but it also adds cost and latency. We run dedicated hardware with no prompt logging and an isolated proxy. For most people who just don't want their data in someone's training set, that's enough. If your threat model is more serious than that, we're not the right choice.

On models: we are focusing on Qwen for now. We add based on demand. Would you actually use MiMo-V2-Pro or Trinity if we had them?

jrandolf · 2026-04-07T04:49:50 1775537390

15-25 was a rate based on oversubscription. Now it's 60 like others :).

jrandolf · 2026-04-07T04:43:33 1775537013

Thanks to everyone who shared feedback. We’re implementing it now.

Here’s what’s changed:

- We’ve removed the other LLMs for now and are focusing entirely on Qwen 3.5. We’ll bring back additional smaller models later, but most usage was already concentrated on Qwen 3.5.

- Pricing is now around $50. You get roughly 2× the throughput (61 tok/s vs. 31 tok/s, verified in testing), and it’s still unlimited. For context, that’s about 158M tokens per month. Comparable providers like Novita charge around $3.2 per million tokens, so this comes out to roughly 10% of typical token costs.

- Context size is now capped at 32K tokens. For the vast majority of use cases, this is more than sufficient.

sowbug · 2026-04-07T19:18:44 1775589524

support@sllm.cloud is bouncing.

jrandolf · 2026-04-07T20:25:23 1775593523

Fixed.

jrandolf · 2026-04-05T17:14:07 1775409247

You get an API key