More

mpalmer · 2026-04-17T12:34:53 1776429293

    This is not only average. This is actual magic.

    So let's be real: the SQL is average. The joins are average. The chart is average. And that took us less than 5 minutes and that was amazing, that is the entire point.

    You did not need a data engineer to model your HubSpot data, or a meeting to agree on whether it should be last-click or first-click or linear or time-decay or whatever.

    You needed a query, written fast, on data you already own. Your LLM wrote it. You confirmed it made sense. Your manager got a link.


    Honestly, average is clearly magic; prove me wrong.

I'll give it a go. This is generated slop, and the poor, factory-made quality of the writing undercuts every aspect of the argument.

It is like nails on a chalkboard.

Axel2Sikov · 2026-04-17T13:27:39 1776432459

Author here; I suppose the... side eye awkward monkey meme was a bit lost on you; it was written on purpose. Funnily enough. Everything is slop if you want it to be slop. This however, was written by hand my little hands. Now I might be a bad writter - that is indeed another subject.

mpalmer · 2026-04-15T17:55:57 1776275757

    At that point, asking the model to e.g. note any ambiguities about the task at hand is exactly equivalent to asking it to evaluate any input

This point is load-bearing for your position, and it is completely wrong.

Prompt P at state S leads to a new state SP'. The "common jumping off point" you describe is effectively useless, because we instantly diverge from it by using different prompts.

And even if it weren't useless for that reason, LLMs don't "query" their "state" in the way that humans reflect on their state of mind.

The idea that hallucinations are somehow less likely because you're asking meta-questions about LLM output is completely without basis

alexwebb2 · 2026-04-15T18:06:14 1776276374

> The idea that hallucinations are somehow less likely because you're asking meta-questions about LLM output is completely without basis

Not sure who you're replying to here – this is not a claim I made.

mpalmer · 2026-04-15T18:28:59 1776277739

That's fair, but I'm not sure why you chose to address the one part of my comment that isn't responsive to your points.

mpalmer · 2026-04-15T11:18:55 1776251935

LLMs do not have special or unique insight into how best to prompt them. Not in the slightest.

https://aphyr.com/posts/411-the-future-of-everything-is-lies...

crustycoder · 2026-04-15T12:46:08 1776257168

"Not in the slightest" is an overreach, the paper the second level down from that link doesn't really support the conclusion in the blog post - the paper is much more nuanced.

Are they going to fib to you sometimes? Yes of course, but that doesn't mean there's no value in behavioural metaqueries.

Like most new tech, the discussion tends to polarise into "Best thing evah!" and "Utter shite!" The truth is somewhere in between.

mpalmer · 2026-04-15T13:44:42 1776260682

You're retreating from your position. You started at "major step" and "extremely important", and you've arrived at "there's not no value".

crustycoder · 2026-04-15T17:32:23 1776274343

Picking phrases from what I said and deliberately misquoting them out of context does not make you right.

mpalmer · 2026-04-15T17:58:16 1776275896

How exactly did I misquote you?

crustycoder · 2026-04-15T18:29:06 1776277746

Go figure it out, it will be a useful challenge for you.

hansmayer · 2026-04-15T14:41:10 1776264070

> Like most new tech

It's nothing like "most new tech". Most new tech tends to be adopted early by young people and experienced techies. In this case it is mostly the opposite: The teens absolutely hate it, probably because the shitty AI content does not inspire the young mind, and the experienced techies see it for what it is. I've never seen such "new tech" which was cheered on by the proverbial average "boomers" (i.e. old people doing "office jobs", not the literal age bracket) and despised by the young folks and experienced experts of all ages.

alchemism · 2026-04-15T16:00:15 1776268815

Judging from Claude Code and the sheer number of “Make Your Favorite Anime Crush Into An AI” SaaSes on the market, I’d posit that both the young and experienced are quite enthusiastic about the new tech.

hansmayer · 2026-04-15T17:06:28 1776272788

If you had kids, or friends and family with kids, you wouldn't be making false conclusions based on some weird proxy "metric".

crustycoder · 2026-04-15T17:35:39 1776274539

You clearly missed the "The truth is somewhere in between" bit.

hansmayer · 2026-04-15T17:57:44 1776275864

No mate, this tech is marketed as superintelligence. Nation of PhDs in a datacentet. Yadda,yadda,yadda. No in-betweens please. Why is it not delivering after so many years and hundreds of billions in investment?

crustycoder · 2026-04-15T18:33:38 1776278018

Name me a new bit of tech that hasn't been hyped beyond reasonable bounds. And yes, this is one of the worst examples. But saying it doesn't have its uses isn't reasonable either.

hansmayer · 2026-04-15T19:40:40 1776282040

None was hyped like this ever before. What are you talking about? Mac was about "it just works" (and it f*ing did), iPhone was "a phone, an iPod and Internet access device". Need more? Microsoft Excel - actually more powerful if you know the tool compared to the bullshit machine. C#, the programming language: "Java done right". And it bloody was! What is in common: None of these techs were hyped beyond reasonable doubt. They were hyped a bit, but not to the level of bullshit LLMs. And none of these techs claimed to do incredible stuff only to underdeliver. After so much money burnt, yes I want to see that nation of PhDs. I want to see AI "writing all the code" in six months (Anthropic claimed this in January this year). Enough of bullshit and people being told they are stupid for not knowing how to win the lottery system and comparing lottery systems. Show me the superintelligence or shut the f. up.

mpalmer · 2026-04-15T04:28:40 1776227320

Why does everything always have to reveal something?

It's such a definitive, decisive word, which is abused to the point of meaninglessness by clickbait.

Claude Code's source could imply, suggest, point to, highlight, call attention to, indict, or invite deeper reflection about AI engineering culture.

Quit sucking all the life out of words to get clicks. The way we use them, they're a finite resource.

adrian_b · 2026-04-15T07:13:35 1776237215

Reveal = show something that was hidden previously.

Seems like the appropriate word to use about a source code leak.

The words proposed by you are suitable for describing the consequences of a revelation, while no longer containing any hint about their original cause, so using them would have lead to a more verbose sentence for delivering the same information.

mpalmer · 2026-04-15T11:13:30 1776251610

It wasn't hidden previously. It was fairly well-understood.

The CC source doesn't "reveal" a single thing about anything other than Anthropic internals. It says nothing about the industry at large, certainly nothing new.

And this:

    The words proposed by you are suitable for describing the consequences of a revelation, while no longer containing any hint about their original cause

doesn't make any sense. There is no "revelation" here. And the word "reveal" contains no connotations whatsoever about the "cause" of a "revelation".

mpalmer · 2026-04-14T02:50:58 1776135058

Next headache? Any team that is actually reviewing code will notice this immediately.

mpalmer · 2026-04-08T17:42:12 1775670132

This does what the best speculative fiction does, attempts to stretch and expand your understanding of the real world by presenting a provocative fictional reality.

The author is trying to get you to speculate on the kind of intelligence that would say this about humans.

mpalmer · 2026-04-07T18:26:59 1775586419

> Claude Mythos Preview’s large increase in capabilities has led us to decide not to make it generally available.

A month ago I might have believed this, now I assume that they know they can't handle the demand for the prices they're advertising.

skippyboxedhero · 2026-04-07T18:48:25 1775587705

GPT-2, o1, Opus...been here so many times. The reason they do this is because they know it works (and they seem to specifically employ credulous people who are prone to believe AGI is right around the corner). There haven't been significant innovations, the code generated is still not good but the hype cycle has to retrigger.

I remember when OpenAI created the first thinking model with o1 and there were all these breathless posts on here hyperventilating about how the model had to be kept secret, how dangerous it was, etc.

Fell for it again award. All thinking does is burn output tokens for accuracy, it is the AI getting high on its own supply, this isn't innovation but it was supposed to super AGI. Not serious.

chaos_emergent · 2026-04-07T19:46:29 1775591189

> All thinking does is burn output tokens for accuracy

“All that phenomenon X does is make a tradeoff of Y for Z”

It sounds like you’re indignant about it being called thinking, that’s fine, but surely you can realize that the mechanism you’re criticizing actually works really well?

b65e8bee43c2ed0 · 2026-04-07T19:13:57 1775589237

>I remember when OpenAI created the first thinking model with o1 and there were all these breathless posts on here hyperventilating about how the model had to be kept secret, how dangerous it was, etc.

I've read that about Llama and Stable Diffusion. AI doomers are, and always have been, retarded.

vonneumannstan · 2026-04-07T18:54:02 1775588042

Lol you haven't used a model since GPT2 is what it sounds like.

skippyboxedhero · 2026-04-07T19:07:04 1775588824

Just checked my subscription start date for Anthropic. September 2023, I believe before they announced public launch.

Sorry kid.

SyneRyder · 2026-04-07T19:32:12 1775590332

Genuine question - if you don't think the models are improved or that the code is any good, why do you still have a subscription?

You must see some value, or are you in a situation where you're required to test / use it, eg to report on it or required by employer?

(I would disagree about the code, the benefits seem obvious to me. But I'm still curious why others would disagree, especially after actively using them for years.)

skippyboxedhero · 2026-04-07T19:48:28 1775591308

The assumption that the other person made was that I would only use it for coding. If you look through my other comments today, I suggest that they are useful for performing repetitive tasks i.e. checking lint on PR, etc. Also, can be used for throwaway code, very useful.

I don't think the issue is with the model, it is with the implication that AGI is just around the corner and that is what is required for AI to be useful...which is not accurate. The more grey area is with agentic coding but my opinion (one that I didn't always hold) is that these workflows are a complete waste of time. The problem is: if all this is true then how does the CTO justify spending $1m/month on Anthropic (I work somewhere where this has happened, OpenAI got the earlier contract then Cursor Teams was added, now they are adding Anthropic...within 72 hours of the rollout, it was pulled back from non-engineering teams). I think companies will ask why they need to pay Anthropic to do a job they were doing without Anthropic six months ago.

Also, the code is bad. This is something that is non-obvious to 95% of people who talk about AI online because they don't work in a team environment or manage legacy applications. If I interview somewhere and they are using agentic workflow, the codebase will be shit and the company will be unable to deliver. At most companies, the average developer is an idiot, giving them AI is like giving a monkey an AK-47 (I also say this as someone of middling competence, I have been the monkey with AK many times). You increase the ability to produce output without improving the ability to produce good output. That is the reality of coding in most jobs.

AI isn't good enough to replace a competent human, it is fast enough to make an incompetent human dangerous.

carbon_14 · 2026-04-08T17:22:53 1775668973

so true.

vonneumannstan · 2026-04-07T19:18:29 1775589509

So you are doubly stupid, by not seeing any improvement in the models and also paying for models you believe are terrible? lol

skippyboxedhero · 2026-04-07T19:22:40 1775589760

That doesn't follow logically from what I said. You should ask your AI for help with this. You are in need of some artificial intelligence.

simianwords · 2026-04-07T19:13:36 1775589216

Incredible that people still think like this.

skippyboxedhero · 2026-04-07T19:14:13 1775589253

You're completely right.

simianwords · 2026-04-07T19:18:16 1775589496

uhh the model found actual vulnerabilities in software that people use. either you believe that the vulnerabilities were not found or were not serious enough to warrant a more thoughtful release

mlsu · 2026-04-07T19:46:12 1775591172

So did GPT-4.

https://arxiv.org/html/2402.06664v1

Like think carefully about this. Did they discover AGI? Or did a bunch of investors make a leveraged bet on them "discovering AGI" so they're doing absolutely anything they can to make it seem like this time it's brand new and different.

If we're to believe Anthropic on these claims, we also have to just take it on faith, with absolutely no evidence, that they've made something so incredibly capable and so incredibly powerful that it cannot possibly be given to mere mortals. Conveniently, that's exactly the story that they are selling to investors.

Like do you see the unreliable narrator dynamic here?

simianwords · 2026-04-07T20:00:31 1775592031

I don't see the problem here. How would you have handled it differently? If you released this model as such without any safety concern, the vulnerabilities might be found by bad actors and used for wrong things.

What do you find surprising here?

mlsu · 2026-04-07T20:44:47 1775594687

Vulnerabilities were found, probably a few by bad actors, when GPT4 was released. Every vulnerability found now is probably found with AI assistance at the very least. Should they have never released GPT4? Should we have believed claims that GPT4 was too dangerous for mere mortals to access? I believe openAI was making similar claims about how GPT4 was a step function and going to change white collar work forever when that model was released.

The point is that this whole "the model is too powerful" schtick is a bunch of smoke and mirrors. It serves the valuation.

simianwords · 2026-04-07T20:54:07 1775595247

Its far more simple to believe that they are releasing it step by step. Release to trusted third parties first, get the easy vulnerabilities fixed, work on the alignment and then release to public.

Do you don't believe that the vulnerabilities found by these agents are serious enough to warrant staggered release?

mgfist · 2026-04-07T20:32:10 1775593930

On the other hand I've gotten to use opus-4.6 and claude code and the quality is off the charts compared to 2023 when coding agents first hit the scene. And what you're saying is essentially "If they haven't created God, I'm not impressed". You don't think there's some middleground between those two?

Also they just hit a $30B run-rate, I don't think they're that needy for new hype cycles.

IceWreck · 2026-04-07T19:35:19 1775590519

Didn't OpenAI say something similar about GPT-3? Too dangerous to open source and then afew years later tehy were open sourcing gpt-oss because a bunch of oss labs were competing with their top models.

FeepingCreature · 2026-04-07T20:03:10 1775592190

OpenAI didn't release GPT-2 initially because they were worried it would make it too easy to generate spam. Which it kinda did.

abroszka33 · 2026-04-07T20:23:04 1775593384

OpenAI said that GPT-5 was too dangerous to release... And look where we are now. It's mostly hype.

wg0 · 2026-04-07T18:39:40 1775587180

That's for the investors basically. Scarcity and FOMO.

causal · 2026-04-07T20:35:40 1775594140

*Until GPT-6 comes out, at which point Mythos will coincidentally be sufficiently safety-tested to release :)

b65e8bee43c2ed0 · 2026-04-07T19:03:33 1775588613

you would be a fool to believe it at any point in time. Amodei is anthropomorphic grease, even more so than Altman.

Anthropic is burning through billions of VC cash. if this model was commercially viable, it would've been released yesterday.

landtuna · 2026-04-07T19:26:56 1775590016

If there's limited hardware but ample cash, it doesn't make sense to sell compute-intensive services to the public while you're still trying to push the frontier of capability.

b65e8bee43c2ed0 · 2026-04-07T19:35:15 1775590515

that's more or less what I'm saying. "Claude Mythos Preview’s large increase in capabilities has led us to decide not to make it generally available", translated from bullshit, means "It would've cost four digits per 1M tokens to run this model without severe quantization, and we think we'll make more money off our hardware with lighter models. Cool benchmarks though, right?"

mpalmer · 2026-04-07T13:40:31 1775569231

Think of all the things that took hundreds/thousands/millions of years to develop and mature, which humans have managed to destroy in relatively short order.

Every 50 years we cycle out an entirely new batch of thinking humans. What cognitive legacy is it exactly that you think is going to be self-preserving?

TeMPOraL · 2026-04-07T13:47:15 1775569635

You're talking about system altering the environment. GP was talking about the system altering itself. The system is a massive self-stabilizing collection of feedback loops. Unlike the static environment[0], it's incredibly hard to intentionally move such system to a different equilibrium. If it weren't, we'd already solved all the thorny world problems long ago.

--

[0] - Any self-stabilizing system that operates much slower than us - such as ecosystems or climate - is, from our perspective, static.

mpalmer · 2026-04-07T14:36:36 1775572596

> The system is a massive self-stabilizing collection of feedback loops.

Source? lol

Actual, measurable literacy is in the toilet. The average person reads at the 6th grade level. What sort of equilibrium are you trying to claim we are in right now?

> Unlike the static environment, it's incredibly hard to intentionally move such system to a different equilibrium.

It's not intentional. That's the point!

Jerrrrrrrry · 2026-04-07T15:35:57 1775576157

Plato said the same thing.

mpalmer · 2026-04-07T12:17:00 1775564220

I've seen way, way worse. Either someone LLM-polished something they already wrote, or they did their own manual editing pass.

The short sentence construction is the most suspicious, but I actually don't see anything glaring. It normally jumps out and hits me in the face.

bookofjoe · 2026-04-07T13:15:41 1775567741

>Hemingway's 4 Fast Rules For Effective Writing

1. Use Short Sentences

https://www.wordsthatsing.com.au/post/hemingway-rules

mpalmer · 2026-04-07T13:45:54 1775569554

I didn't say they're dispositive. I said they're suspicious. Most people don't write effectively.

NetMageSCW · 2026-04-07T14:02:54 1775570574

So LLMs write effectively and when people do you accuse them of using an LLM?

mpalmer · 2026-04-07T14:49:29 1775573369

No, they don't. They use short sentences in weird, stilted ways.

bookofjoe · 2026-04-07T20:07:13 1775592433

But you have the ability to detect those "weird, stilted ways." Impressive.

bookofjoe · 2026-04-08T19:06:36 1775675196

See also: https://www.joanwestenberg.com/the-ai-writing-witchhunt-is-p...

mpalmer · 2026-04-06T17:00:23 1775494823

It's the strongest possible memetic weapon humans would have - I think it's entirely consistent with the meta-nature of the book, especially the self-conscious part.

mjburgess · 2026-04-06T17:07:06 1775495226

If the take is religion is itself the weapon and the depiction given is mere evidence of that, OK, that's at least avoids the ending being totally awful. HOWEVER

The book spends much of its time saying the transcendent cannot even be represented, to people, to us the read -- then just represents it, and in a tawdry christian way.

I think the violation of that norm, as well as the ending being played straight -- with literally a long paragraph explaining with ideaspace is... that's a fourth-wall break into christianity imv

Which makes the whole book read as, "the issue with humans is our physical bodies in a fallen world which are limited. just die, go to heaven, then you can know/represent/understand everything. Yay! Death!"

OK. Just kinda naff.

It reads as a religious person who accidentally wrote a good sci-fi book then hurridly, at the end, reminds us all that its really a parable with a Noble Message that in Death all things are trascended.

doug_durham · 2026-04-06T17:17:59 1775495879

I read the book and at no time did I think "Christianity". It seems like motivated reasoning on your part. At no time did the book ever preach, or was even moralistic.

mjburgess · 2026-04-06T17:30:46 1775496646

I'm referring to the ending of the published version, which is quite different than v1, which ends abburptly, in particular the sections before and after:

> “She steps back from him. She flexes what could be wings.”

> “In ideatic space everything is possible and everything is real and every metaphor is apt. She sees a galaxy of shining points: people, all the people who have ever existed, packed almost densely enough to form a continuum, living and dead, real and fictional and borderline. Similar people, who think in similar ways and who stand for similar things, are closer together. Significant people, the famous and iconic, are brighter. There are stars for inanimate entities, too, and events and abstracts: countries, homes, works of art, births and first steps and words, shocks and dramas, archetypes, numbers and equations, long arcs of stories, grand mythologies, philosophies, politics, tropes. Every truth and lie is here. Ideatic space itself—the human conception of it, at least—is here too, a fixed point embedded inside itself. The idea of the Unknown Organization is here. The idea of Adam Quinn is here. Marie, rising, waking, is here. And occupying the same space as the first brilliant spiral is a second, its counterpart, a galaxy whose points are relationships between the points of the first: what each person means to each other person. Loves, mutual and unrequited; admirations, aspirations, intimidations, fears, and revulsions. Conceptions and misconceptions. There is Adam’s shining link with Marie, and Marie’s link back to Adam. And Marie’s link to the Organization. And at the core of the whole dazzling ecosystem is an ultimate singular point, to which every other point is connected: humanity.

> And the whole thing, the entirety of human ideatic space, is being torn apart. U-3125 hangs above it, a monumental, blinding new presence, a singular entity more massive and luminous than both spirals combined. Its malevolent gravity drags humanity and all human ideas into its orbit, warping them beyond recognition. Beneath it, within its context, everything becomes corrupted into the worst version of itself. It takes joy and turns it into vindictive glee; it takes self-reliance and turns it into solipsistic psychosis; it turns love into smothering assault, pride into humiliation, families into traps, safety into paranoia, peace into discontent. It turns people into people who do not see people as people. And civilizations, ultimately, into abominations.

> U-3125 is titanic in its structure, brain-breaking in its topology. It comes from another part of ideatic space, a place where ideas exist on a scale entirely beyond those of humans. Its wrongness and[…]”

> “She sets a course. Outbound, to the deepest limit of ideatic space.”

Etc. The references to U3125 incarnating, and it being The Adversary. And the explicit ascention narrative with Mary getting wings, flying thru clouds of Ideas -- which are actually animate and incarnated in this world, ie., they are souls. I mean, it's terribly misjudged ending

biophysboy · 2026-04-06T18:04:53 1775498693

Is this book just riffing about embedding space? I thought about reading it eventually, but the quoted passage is kind of annoying/tedious

skeaker · 2026-04-06T18:46:20 1775501180

No, it really just gets like that at the end which is what this chain has been going over.

weberer · 2026-04-07T13:04:13 1775567053

It sounds more like it was inspired by Chris-chan's Dimensional Merge than anything from the bible.

mpalmer · 2026-04-07T16:09:46 1775578186

You are presupposing that in this fictional universe, Christianity itself is not contained in and described by this memetic pattern.

And why would you expect a memetic victory condition not to have an element of cliche?

AndrewDucker · 2026-04-08T09:49:37 1775641777

That doesn't look like anything specific to Christianity to me.

FrustratedMonky · 2026-04-06T22:21:16 1775514076

Could be religion/Christian.

But, also, all systems. Capitalism. Governments.

I took it as all 'groups of people' form these 'structures', that can then take on a life of their own.

I don't this this was supposed to be specifically christian