More

jensneuse · 2025-05-22T05:36:51 1747892211

This is what GraphQL was designed for. Only select fields you really need. We've built an OSS Gateway that turns a collection of GraphQL queries into an MCP server to make this simple: https://wundergraph.com/mcp-gateway

jensneuse · 2025-05-17T09:45:30 1747475130

The way we've solved this in our MCP gateway (OSS) is that the user first needs to authenticate against our gateway, e.g. by creating a valid JWT with their identity provider, which will be validated using JWKS. Now when they use a tool, they must send their JWT, so the LLM always acts in their behalf. This supports multiple tenants out of the box. (https://wundergraph.com/mcp-gateway)

jensneuse · 2025-04-11T05:31:04 1744349464

I think they removed arenas again.

jensneuse · 2025-03-31T23:06:22 1743462382

You can often fool yourself by using sync.Pool. pprof looks great because no allocs in benchmarks but memory usage goes through the roof. It's important to measure real world benefits, if any, and not just synthetic benchmarks.

makeworld · 2025-03-31T23:43:17 1743464597

Why would Pool increase memory usage?

jensneuse · 2025-04-01T05:44:50 1743486290

Let's say you have constantly 1k requests per second and for each request, you need one buffer, each 1 MiB. That means you have 1 GiB in the pool. Without a pool, there's a high likelihood that you're using less. Why? Because in reality, most requests need a 1 MiB buffer but SOME require a 5 MiB buffer. As such, your pool grows over time as you don't have control over the distribution of the size of the pool items.

So, if you have predictable object sizes, the pool will stay flat. If the workloads are random, you have a new problem because, like in this scenario, your pool grows 5x more.

You can solve this problem. E.g. you can only give back items into the pool that are small enough. Alternatively, you could have a small pool and a big pool, but now you're playing cat and mouse.

In such a scenario, it could also work to simply allocate and use GC to clean up. Then you don't have to worry about memory and the lifetime of objects, which makes your code much simpler to read and reason about.

jerf · 2025-04-01T13:36:08 1743514568

Long before sync.Pool was a thing, I wrote a pool for []bytes: https://github.com/thejerf/gomempool I haven't taken it down because it isn't obsoleted by sync.Pool because the pool is aware of the size of the []bytes. Though it may be somewhat obsoleted by the fact the GC has gotten a lot better since I wrote it, somewhere in the 1.3 time frame. But it solve exactly that problem I had; relatively infrequent messages from the computer's point of view (e.g., a system that is probably getting messages every 50ms or so), but that had to be pulled into buffers completely to process, and had highly irregular sizes. The GC was doing a ton of work when I was allocating them all the time but it was easy to reuse buffers in my situation.

theThree · 2025-04-01T20:08:00 1743538080

>That means you have 1 GiB in the pool.

This only happen when every request last 1 second.

xyproto · 2025-03-31T23:46:17 1743464777

I guess if you allocate more than you need upfront that it could increase memory usage.

throwaway127482 · 2025-04-01T03:59:24 1743479964

I don't get it. The pool uses weak pointers under the hood right? If you allocate too much up front, the stuff you don't need will get garbage collected. It's no worse than doing the same without a pool, right?

cplli · 2025-04-01T06:56:34 1743490594

What the top commenter probably failed to mention, and jensneuse tried to explain is that sync.Pool makes an assumption that the size cost of pooled items are similar. If you are pooling buffers (eg: []byte) or any other type with backing memory which during use can/will grow beyond their initial capacity, can lead to a scenario where backing arrays which have grown to MB capacities are returned by the pool to be used for a few KB, and the KB buffers are returned to high memory jobs which in turn grow the backing arrays to MB and return to the pool.

If that's the case, it's usually better to have non-global pools, pool ranges, drop things after a certain capacity, etc.:

https://github.com/golang/go/issues/23199 https://github.com/golang/go/blob/7e394a2/src/net/http/h2_bu...

nopurpose · 2025-04-01T07:04:53 1743491093

also no one GCs sync.Pool. After a spike in utilization, live with increased memory usage until program restart.

ncruces · 2025-04-01T10:12:50 1743502370

That's just not true. Pool contents are GCed after two cycles if unused.

nopurpose · 2025-04-01T14:41:07 1743518467

What do you mean? Pool content can't be GCed , because there are references to it: pool itself.

What people do is what this article suggested, pool.Get/pool.Put, which makes it only grow in size even if load profile changes. App literally accumulated now unwanted garbage in pool and no app I have seen made and attempt to GC it.

ahmedtd · 2025-04-01T19:04:18 1743534258

From the sync.Pool documentation:

> If the Pool holds the only reference when this happens, the item might be deallocated.

Conceptually, the pool is holding a weak pointer to the items inside it. The GC is free to clean them up if it wants to, when it gets triggered.

ashf023 · 2025-04-01T18:07:03 1743530823

sync.Pool uses weak references for this purpose. The pool does delay GC, and if your pooled objects have pointers, those are real and can be a problem. If your app never decreases the pool size, you've probably reached a stable equilibrium with usage, or your usage fits a pattern that GC has trouble with. If Go truly cannot GC your pooled objects, you probably have a memory leak. E.g. if you have Nodes in a graph with pointers to each other in the pool, and some root pointer to anything in the pool, that's a memory leak

andrewf · 2025-04-02T00:46:33 1743554793

https://github.com/golang/go/blob/master/src/sync/pool.go#L2...

The GC calls out to sync.Pool's cleanup.

jensneuse · 2025-03-14T06:43:28 1741934608

Is this a modern version of CGI with process isolation?

jedisct1 · 2025-03-14T11:30:16 1741951816

It's rather something that sits between WebAssembly and containers, combining the sandboxing guarantees of the former with the performance of the latter. From a security perspective, the composition is also really good (WebAssembly enforces memory limits, but doesn't have memory protection, NULL pointers are writable, etc. and this is solved here). But unlike WebAssembly, it is Linux-only. So, not something that can run in Web browsers.

jensneuse · 2025-01-25T18:56:31 1737831391

We wrote a breadth first algorithm to handle the second problem you're describing. I'm curious to hear your thought on it: https://wundergraph.com/blog/dataloader_3_0_breadth_first_da...

bassp · 2025-01-25T20:48:50 1737838130

That's really clever! Kudos. I'm gonna set aside some time this week to dive into the implementation

jensneuse · 2025-01-25T18:38:58 1737830338

Short answer:

gRPC is widely understood as an API tool for Microservices. Microservices solve an organizational problem, not a technical one. Ironically, gRPC doesn't really help to solve the organizational problem.

However, GraphQL in combination with Federation, also known as GraphQL Federation actually DOES help organizations to scale APIs across teams and services.

So, even though the typical popular opinion suggests that gRPC is better for Microservices than GraphQL, the reality looks different.

jensneuse · 2025-01-25T18:35:52 1737830152

There were times when it was popular to generate a GraphQL API from your database but that's not how the query language is used today.

GraphQL is a query language to implement query style APIs. These days, it's most widely used as a "Federation" layer to expose a single query-able graph on top of a (micro-) service architecture.

jensneuse · 2024-12-07T08:01:17 1733558477

I absolutely love Postgres, but please allow me to say that you absolutely don't want to expose an API generated from a database to people outside of your team. This limits you a lot in changing the way you store your data.

I wrote about this topic before and haven't changed my opinion much. You don't want to have that tight coupling: https://wundergraph.com/blog/six-year-graphql-recap#generate...

RedShift1 · 2024-12-07T08:13:30 1733559210

What exactly is the problem with tight coupling? You're going to insert an entire layer that basically translates format A to format B, just so you can later change a column name in the database and not have to change it in the API or something?

grey-area · 2024-12-07T08:28:39 1733560119

1. You don’t want or need to expose lots of implementation details. Many of your data structures should be private, and many should be partly private.

2. Your data structures should not dictate the shape of your api, usage patterns should (e.g. the user always needs records a,b,c together or they have a and want c but don’t care about b)

3. It stops you changing any implementation details later and/or means any change is definitely a breaking change for someone.

Cthulhu_ · 2024-12-07T09:23:01 1733563381

There's a few issues; one is that if you have the DB do everything, all of your business logic lives there too, instead of just the data. This is still fine if you have a single use case, but what if in addition to your main application, you also need to use it for things like BI, customer service, analytics / predictions, etc? It then quickly becomes better to use it as a datastore and have another layer decide what to do with it.

And in 30 odd years, everything will be different again, but your company's main data store will not have moved as fast.

haileys · 2024-12-07T08:22:46 1733559766

Yes. A lot of the work I've done through my career is essentially this, once you boil away the rest of the details.

hmottestad · 2024-12-07T08:29:21 1733560161

Normalization is one of those typical issues where you might be fine with having everything normalized when you start off, but then once performance gets bad you end up denormalizing tables that are typically joined.

pdimitar · 2024-12-07T15:03:55 1733583835

The extremely obvious problem is that how you store data is an implementation detail and those change when requirements (or the market) evolve. I'll give you an API and will make triple sure it's as fast as a machine can even serve it and you let me worry about how it's all stored.

To additionally answer you with an analogy: when you have a problem with a company, you call the call center, not Jenny from accounting in particular. Jenny might have helped you twice or thrice but she might leave the company next year and now you have no idea how to solve your problem. Have call centers to dispatch your requests wherever it's applicable in the given day and leave Jenny alone.

halfcat · 2024-12-12T07:33:58 1733988838

> * What exactly is the problem with tight coupling?*

As Joel Spolsky put it: ”the cost of software is the cost of its coupling”.

More specifically the cost of making changes when “if I change this thing I have to change that thing”. But if there’s no attention paid to coupling, then it’s not just the two things you gave to change, but “if I change this thing I have to change those 40 things”.

dunwaldo · 2024-12-07T08:06:42 1733558802

you could use views as a layer in between?

boomskats · 2024-12-07T09:45:20 1733564720

You should use views as the layer in between. They'll let you version your API and make changes without breaking things.

jensneuse · 2024-08-29T06:02:57 1724911377

Looks cool and I understand what it does. Can you explain the production use case for this solution? It's not clear to me how this would be used in real life.

sofixa · 2024-08-29T06:16:33 1724912193

Not OP and not at all related to the project, but I imagine in production you would just replace the container runtime of your Kubernetes/Nomad/Swarm/homemade orchestrator, and run Docker container as usual (but each one is a separate VM). The advantages would be that you have actual isolation between the containers, so it's great for higher security contexts (e.g. banks) or multi tenant setups.

newstechfounder · 2024-08-29T16:32:50 1724949170

That's about right. RunCVM should be compatible with orchestrators today that use the Docker command line or API. As an experimental project, Kubernetes compatibility is beyond our current scope but it would be an interesting exercise.