More

anonymousDan · 2025-03-26T23:23:48 1743031428

This sounds like a security nightmare.

soulofmischief · 2025-03-27T00:10:00 1743034200

As it currently stands, MCP is absolutely a security nightmare. Combine this with a general lack of appreciation for security culture amongst developers, and the emerging vibe coding paradigm where non-security-minded people automatically generate and fail to properly audit production-facing code, and it's a disaster waiting to happen.

Feels like we've slid back into the 90s in this regard. Great time to be a security researcher!

TeMPOraL · 2025-03-27T08:10:40 1743063040

> Feels like we've slid back into the 90s in this regard.

Thank $deity. 90s and early 2000s were the times software was designed to do useful work and empower users, as opposed to lock them into services and collect telemetry, both of which protected by the best of advancement in security :).

I'm only half-joking here. Security is always working against usefulness; MCP is designed to be useful first (like honest to $deity useful, not "exploit your customers" useful), so it looks like security nightmare. Some of that utility will need to go away, because complete lack of security is also bad for the users - but there's a tradeoff to be made, hopefully one that doesn't just go by modern security zeitgeist, because that is already deep into protecting profits by securing services against users.

> a general lack of appreciation for security culture amongst developers, and the emerging vibe coding paradigm where non-security-minded people automatically generate and fail to properly audit production-facing code

There is also a general lack of consideration of who is being protected from whom, and why in the security culture. MCP, vibe coding, and LLMs in general are briefly giving end-users back some agency, bringing back the whole idea of "bicycle for the mind" that was completely and intentionally destroyed when computing went mainstream. Let's not kill it so eagerly this time.

soulofmischief · 2025-03-27T10:03:01 1743069781

A non-exhaustive list of concerns:

- How does a consumer of a remote MCP server trust that it is not saving/modifying their data, or that it is doing something other than what it said it would?

- How does a consumer of a local MCP server trust that it won't wreck their machine or delete data?

- How do servers authorize and authenticate end users? How do we create servers which give different permissions to different users?

These are examples of things which must be done right, and sacrificing user security in order to achieve market dominance is ethically bankrupt. Pedestrians don't know exactly which regulations serve them when a bridge is built, so we don't expect pedestrians to be able to stop corruption and laziness in civil engineering. The same should be true for mass infrastructure; we have a duty as engineers to make the right call.

> MCP, vibe coding, and LLMs in general are briefly giving end-users back some agency, bringing back the whole idea of "bicycle for the mind"

I love what software might look like in 15 years. I don't plan to kill that. I want to protect it, and also protect everyone involved.

sarthak_chauhan · 2025-04-01T21:44:34 1743543874

would love your thoughts on this: https://blog.cloudflare.com/remote-model-context-protocol-se...

jcheng · 2025-03-27T00:52:02 1743036722

It’s pretty astounding to me that this aspect of MCP is not mentioned more. You’re putting a LOT of trust in both the model and the system prompt when you start attaching MCPs that provide unfettered access to your file system, or connect up to your REST API’s POST endpoints.

(That being said, I have to admit I’ve been writing my own powerful but extremely dangerous tools as an experiment (e.g. run arbitrary Python code on my machine, unsandboxed) and I have to admit the results have been incredibly compelling.)

vykthur · 2025-03-27T00:12:42 1743034362

I tend to agree with this.

No, MCP's have NOT Won (Yet) https://newsletter.victordibia.com/p/no-mcps-have-not-won-ye...

lurking_swe · 2025-03-27T02:07:05 1743041225

agreed. this sounds useless at the moment unless you’re sand boxing it in a throw-away VM lol. Scary!

anonymousDan · 2025-03-25T12:31:08 1742905868

Spoiler alert - they won't.

anonymousDan · 2025-03-23T06:59:40 1742713180

It's interesting that none of these papers seem to be coming out of academic labs....

pizza · 2025-03-23T08:48:30 1742719710

Checking if a recommendation system is actually good in practice is kind of tough to do without owning a whole internet media platform as well. At best, you'll get the table scraps from these corporations (in the form of toy datasets/models made available), and you still will struggle to make your dev loop productive enough without throwing similar amounts of compute that the ~FAANGs do so as to validate whether that 0.2% improvement you got really meant anything or not. Oh, and also, the nature of recommendations is that they get very stale very quickly, so be prepared to check that your method still works when you do yet another huge training run on a weekly/daily cadence.

bradly · 2025-03-23T20:57:09 1742763429

> you still will struggle to make your dev loop productive enough without throwing similar amounts of compute that the ~FAANGs do so as to validate whether that 0.2% improvement you got really meant anything or not

And do not forget the incredible of number of actual humans FAANG pays every day to evaluate any changes in result sets for top x,000 queries.

lmeyerov · 2025-03-23T14:15:04 1742739304

As someone whose customers do this stuff, I'm 100% for most academics chasing harder and more important problems

Most of these papers are specialized increments on high baselines for a primarily commercial problem. Likewise, they focus on optimizing phenomena that occur in their product, which may not occur in others. Eg, Netflix sliding window is neato to see the result of, but I rather students user their freedom to explore bigger ideas like mamba, and leave sliding windows to a masters student who is experimenting with intentionally narrowly scoped tweaks.

lmeyerov · 2025-03-23T14:21:12 1742739672

As someone whose customers do this stuff, I'm 100% for most academics chasing harder and more important problems.

Most of these papers are specialized increments on high baselines for a primarily commercial problem. Likewise, they focus on optimizing phenomena that occur in their product, which may not occur in others. Eg, Netflix sliding window is neato to see the result of, but I rather students user their freedom to explore bigger ideas like mamba, and leave sliding windows to a masters student who is experimenting with intentionally narrowly scoped tweaks. At that point, to top PhD grads at industrial labs will probably win.

That said, recsys is a general formulation with applications beyond shopping carts and social feeds, and bigger ideas do come out, where I'd expect competitive labs to do projects on. GNN for recsys was a big bet a couple years ago, and LLMs now, and it is curious to me those bigger shifts are industrial labs papers as you say. Maybe the statement there is recsys is one of the areas that industry hires a lot of PhDs on, as it is so core to revenue lift: academia has regular representation, while industry is overrepresented.

anonymousDan · 2025-03-20T18:20:41 1742494841

Just to be clear, you're perfectly fine with this?

anonymousDan · 2025-03-19T14:42:12 1742395332

Presumably if you disable this on a per-transaction basis you are still guaranteed that transactions are replicated in FIFO order? Otherwise you could end up in weird situations where a synced txn reads from a lost txn and your state is effectively corrupted.

anarazel · 2025-03-19T18:58:34 1742410714

Correct. Async commit transactions add their changes to the WAL the same way normal transactions do. The only relevant change is that the WAL is not synchronously flushed to disk before COMMIT completes.

ants_a · 2025-03-20T08:49:25 1742460565

Yes and no. On the primary durability order and visibility order are different. So an async transaction that starts committing later can become visible to readers before a sync transaction that comes before it.

A read-write sync transaction that reads the result of a non-durable transaction cannot commit ahead of the async transaction. But there is no such guarantee for read-only transactions. So a transaction could read a non-durable state, decide that no further action is needed, and have that decision be invalidated by a crash/failover.

To make things even worse, similar things can happen with synchronously replicated transactions. If a committing transaction waiting on replication is cancelled it will become visible immediately. A reasonably written application would run a retry of the transaction, see that the result is there and conclude that no action is necessary, even though the data is not replicated and would be lost if a failover then happens.

anonymousDan · 2025-03-20T18:15:39 1742494539

Yeah these are the kind of subtle issues I expected the article might be either glossing over or unaware of.

anonymousDan · 2025-03-14T00:04:23 1741910663

Is that ack sent once the request is received or once it is stored on the remote disk?

anonymousDan · 2025-03-14T00:03:07 1741910587

Nice article, but the replicated approach isn't exactly comparing like with like. To achieve the same semantics you'd need to block for a response from the remote backup servers which would end up with the same latency as the other cloud providers...

anonymousDan · 2025-03-09T00:13:54 1741479234

What were the problems you had when using it as a persistent data store if you don't mind me asking? Also, what kind of workloads did you have?

lukaslalinsky · 2025-03-09T07:03:07 1741503787

The problematic use case was using Redis as a counter for various app related metrics, which I want to make visible to the customer later on. App was writing to Redis, background job was reading these partially aggregated metrics from Redis and writing them to another database. So you could say it was a remote buffer. On way too many occasions, after short failure of the host where the current master instance was running, the replica was not propagated to master. That was using redis sentinel. I guess my main issue was always redis sentinel.

anonymousDan · 2025-03-02T00:23:20 1740875000

I've actually used it to generate business plans and strategies for some technical work I've done. I've not followed up but the options it created were all credible and would definitely lower the bar for someone less commercially oriented to get up and running with a business idea.

anonymousDan · 2025-02-28T21:14:09 1740777249

Some things are worth fighting for you quisling.

just-ok · 2025-02-28T21:41:46 1740778906

As long as someone else does the fighting, right? Last I checked, the majority of Ukrainians themselves want a quick end to the war.

[1]: https://news.gallup.com/poll/653495/half-ukrainians-quick-ne...

ceejayoz · 2025-02-28T21:49:43 1740779383

I want a million dollars. Very much!

That doesn't mean I'll accept your proposal to rob a bank.

sterlind · 2025-02-28T22:42:28 1740782548

What a dogshit poll. There were three options given:

1. Ukraine should continue fighting until it wins the war.

2. Ukraine should seek to negotiate an ending to the war as soon as possible.

3. Don't know/Refused.

"Ukraine should surrender unconditionally" and "Ukraine should negotiate permanent security guarantees" and "Ukraine should fight its way into a better negotiating position" are all in the same bucket. This is maliciously bad poll design.