More

Tenobrus · 2026-05-04T03:34:37 1777865677

o1 has a METR time horizon of around 40 minutes, opus 4.7 has an implied horizon of 18 hours based on its ECI score. this study is on a model that's several generations behind wrt the kind of tasks it can complete. it would be shocking if this number were anywhere near as low with GPT 5.5, to the point it seems nearly totally irrelevant to talk about these results

Tenobrus · 2026-03-29T20:57:04 1774817824

"Japanese soldier who kept fighting 29 years after World War 2"

cedws · 2026-03-29T21:02:22 1774818142

I watched a talk from Bjarne Stroustrup at CppCon about safety and it was pretty second hand embarrassing watching him try to pretend C++ has always been safe and safety mattered all along to them before Rust came along.

einpoklum · 2026-03-29T21:52:23 1774821143

Well, there has been a long campaign against manual memory management - well before Rust was a thing. And along with that, a push for less use of raw pointers, less index loops etc. - all measures which, when adopted, reduce memory safety hazards significantly. Following the Core Guideliness also helps, as does using span's. Compiler warnings has improved, as has static analysis, also in a long process preceding Rust.

Of course, this is not completely guaranteed safety - but safety has certainly mattered.

cedws · 2026-03-29T22:54:28 1774824868

>Following the Core Guideliness also helps

Yes, this what Stroustrup said and it makes me laugh. IIRC he phrased with a more of a 'we had safety before Rust' attitude. It also misses the point, safety shouldn't be opt-in or require memorising a rulebook. If safety is that easy in C++ why is everyone still sticking their hand in the shredder?

einpoklum · 2026-03-30T12:34:13 1774874053

You're "moving the goal posts" of this thread. Safety has mattered - in C++ and in other languages as well, e.g. with MISRA C.

As for the Core Guidelines - most of them are not about safety; and - they are not to be memorized, but a resource to consult when relevant, and something to base static analysis on.

Tenobrus · 2026-02-26T23:31:20 1772148680

those two stipulations were always their only ones, and they were included explicitly in their original contract with the DoW.

Tenobrus · 2026-01-11T07:06:35 1768115195

some ai detectors work now. pangram detects this as 57% AI written, and the parts it thinks are human are.... the ascii diagrams / screenshots. all the actual text it detects as generated.

Tenobrus · 2026-01-10T02:46:48 1768013208

strongly think you should go read the thread to get a sense of the level of expertise and amount of effort put in by the humans involved: https://www.erdosproblems.com/forum/thread/728#post-2852

Tenobrus · 2025-12-24T01:25:04 1766539504

ai detectors are never totally accurate but this one is quite good and it suggests something like 80% of this article is llm generated. honestly idk how you didn't get that just by reading it tho, maybe you haven't been exposed to much modern llm-generated content?

https://www.pangram.com/history/5cec2f02-6fd6-4c97-8e71-d509...

Tenobrus · 2025-08-13T22:23:21 1755123801

A lot of people forget how whimsical and strange and beautiful the old smaller GPT models and the original 3 base model pre-RHLF could be. Nowadays hundreds of millions of people have talked to heavily assistant tuned 4 or 5, but comparatively very few people have ever even seen GPT-1 outputs. It's cheap to run so I threw up a simple interface + single server hosting it.

Tenobrus · on March 29, 2024

It looks like the person who added the backdoor is in fact the current co-maintainer of the project (and the more active of the two): https://tukaani.org/about.html

kzrdude · on March 30, 2024

In various places they say Lasse Collin is not online right now, but he did make commits a week ago https://git.tukaani.org/?p=xz.git;a=summary

kzrdude · on March 29, 2024

Makes me wonder if he's an owner of the github organization, and what happens with it now?

Tenobrus · on Feb 22, 2024

it is very clear to me that humans do in fact have a recursive self-improvement ability, and i'm confused why you think otherwise

astrange · on Feb 22, 2024

I think people can read books (self improvement) and have children (recursive), but neither of those are both.

lucubratory · on Feb 22, 2024

Why do you think that the human population is more intelligent, knowledgeable, and achieves greater technological feats as time goes on? It's because of recursive self-improvement, we are raised and educated into being better in a quite general sense, which includes being better at raising and educating; nearly every generation this cycle repeats and has for all of human history, at least since we acquired language. We also build machines that help us to make better machines, and then we use those better machines to make even better machines, another example of recursive self-improvement.

webmaven · on Feb 24, 2024

You're pointing out that groups/institutions/cultures/civilizations are examples of recursively self-improving entities, but the original point was about a recursively self-improving individual intelligent entity.

Well, to the extent that a human-level intelligence is an individual, anyway. We ourselves are probably a mixture-of-experts in some sense.

lucubratory · on Feb 24, 2024

An individual human starts out a mewling baby and can end up a maxillofacial surgeon through at least partial examples of recursive self-improvement. Learn to walk, talk, read, write, structure, argue, essay, study, cite etc all the way through to the end, with what you previously learned allowing you to learn even more. There's a huge amount of outside help, but at least some of it is also self-improvement.

Also, for the purposes of talking about the phenomenon of recursive self-improvement, individual vs society isn't the end of analysis. Part of the reason AI recursive self-improvement is concerning is that people are worried about it happening on much faster than societal timescales, in ways that are not socially tractable like human societies are (e.g. if our society is "improving" in a way we don't like, we or other humans can intervene to prevent, alter, or mitigate it). It's also important to note that when we're talking about "recursive self-improvement" when it comes to AI, the "self" is not a single software artifact like Llama-70B. The "self" is AI in general, and the most common proposed mechanism is that an AI is better than us at designing and building AIs, and the resulting AI it makes us even better at designing and building AIs.

rralian · on Feb 22, 2024

New generations build onto the scientific knowledge of previous generations. It may not be fast but that sounds like recursive improvement to me. It seems reasonable for AI to accelerate this process.

astrange · on Feb 22, 2024

I think saying all of society is doing it is plausible, but not the same thing as a single human or AI doing it.

Though… still don't think it's true. Isn't "society is self improving" what they call Whig history?

killerstorm · on Feb 22, 2024

AI might have multiple instances within a single computing environment, so it's more like a population than a single individual.

I.e. "You can only use the memory which you currently use" would be a weird artificial constraint not relevant in practice.

spacecadet · on Feb 22, 2024

A very small percentage maybe. I think I agree with the notion that most people bias toward thinking they are improving while actually self-sabotaging.

Tenobrus · on Dec 4, 2022

all this capability was already easily available in GPT-3 with some prompt engineering, this mostly just changed it from few-shot to zero-shot.