More

throwaway71271 · 2024-12-29T15:18:01 1735485481

then you will be even more impressed by Rule 110 being turing complete with a specific background pattern.

https://en.wikipedia.org/wiki/Rule_110

Joel_Mckay · 2024-12-29T15:32:36 1735486356

It gets even funnier, some folks implemented a version in geometry nodes within Blender.

https://www.youtube.com/watch?v=iSoqddxoCl8

One could do all sorts of fun abstractions with the simulation data. =3

throwaway71271 · 2024-12-20T18:30:56 1734719456

    | Name                                 | Semi-private eval | Public eval |
    |--------------------------------------|-------------------|-------------|
    | Jeremy Berman                        | 53.6%             | 58.5%       |
    | Akyürek et al.                       | 47.5%             | 62.8%       |
    | Ryan Greenblatt                      | 43%               | 42%         |
    | OpenAI o1-preview (pass@1)           | 18%               | 21%         |
    | Anthropic Claude 3.5 Sonnet (pass@1) | 14%               | 21%         |
    | OpenAI GPT-4o (pass@1)               | 5%                | 9%          |
    | Google Gemini 1.5 (pass@1)           | 4.5%              | 8%          |

https://arxiv.org/pdf/2412.04604

kandesbunzler · 2024-12-20T19:43:52 1734723832

why is this missing the o1 release / o1 pro models? Would love to know how much better they are

Freebytes · 2024-12-21T02:15:08 1734747308

This might be because they are referencing single step, and I do not think o1 is single step.

aimanbenbaha · 2024-12-20T23:41:41 1734738101

Akyürek et al uses test-time compute.

throwaway71271 · 2024-12-17T09:35:03 1734428103

I disagree with the premise of the article. We absolutely do not know how to teach programming both pedagogically and andragogically, you can see that 80% of the students after getting their masters in CS absolutely can not program, do not understand computation, or the computer, not to mention higher order abstractions and their interactions.

There are university students now in their 3rd year that paste chatgpt's javascript program into their c# code (also written by chatgpt) and ask me why is everything red and its not "working".

I think it is a problem of designing for the 'average' student, which of course does not exist.

The academia focuses too much on the abstract languages and constructs, but the reality is, the code will run on a man-made digital computer, which has certain properties, it uses memory in a certain way, has a clock, and is discrete, certain things are possible, and others are not. No matter how much we pretend, the equal sign (=) leaks through, and that in itself creates infinite confusion in people who pretend to get it.

I think hyper individualistic approach is the solution. My daughter for example really dislikes scratch and various graphical languages, but she does really well with machine code and assembly and c. When we were working on a chess game, and she said: "WAIT! the queen is just rook + bishop", we went into how can we actually structure our code to express that. The journey is completely non linear. Sometimes we go back, sometimes forward.

Hopefully AI + teacher can help with that. But the truth is, some kids get various concepts, like recursion or references and values, naturally, some never get it. And we are trying to teach them the same thing at the same time for the same duration, and just move on.

SICP is best for some, PDP11 machine code for others, godot or roblox lua for third, and so on. The problem is we don't know which is which :)

PS: Some of the things I made to teach my daughter: https://punkx.org/

aleph_minus_one · 2024-12-17T13:55:53 1734443753

> No matter how much we pretend, the equal sign (=) leaks through, and that in itself creates infinite confusion in people who pretend to get it.

I have a feeling that this issue might be amplified by the facts that

1. many US-American students are monolingual.

2. I have observed that many native English speakers are less interested in their own language than native speakers of other languages for their mother tongue.

For 1: If you are used to learn multiple (natural) languages, you are very used to the fact that (nearly) identical words can have very different meanings in different languages (false friends).

Examples:

sensible [en] vs sensibel [de] ("sensitive" in English)

bite [en] vs bite [fr] (a vulgar word)

So for me there is nothing surprising about the fact that "=" means different things in mathematics vs programming languages that use = as an assignment operator.

For 2: There exist quite a lot of English words that can have very different meanings depending on the context (arbitrary example: "left" can both mean a direction, and be the simple past and past participle of "to leave"). So what is so difficult about the fact that = means something different in a C/C++/Java/C#/Python program than in a math equation?

crabbone · 2024-12-17T11:36:47 1734435407

This isn't what the article says...

The articles says roughly that "while we know what to strive for, we are falling short of achieving our goals." It doesn't make the claim that we know how to teach programming.

Now, unrelated to what the article says, I wanted to bring up this point: there are several ways we know how to teach something, and they apply better to different disciplines. For instance, the approach where we do lecture-lecture-lecture-seminar-test is what's used to teach subjects that require very little hands-on work (eg. law, history etc.) Then, there's lecture-lab-lecture-lab-lecture-lab-test approach, which is better suited to subjects like biology or electrical engineering etc.

Some subjects have their own tradition of teaching that's mostly unique to them. Eg. painting is taught by assigning students to the professor's workshop, where they, basically, live for a few years. No formal "teaching" takes place, but the workshop's master interacts with students whenever he/she feels fit. Or, conservatory students who, usually, spend hours in one-on-one training with their teacher practicing their instrument of choice.

What I think is happening in teaching to program is that because the field originated in math + engineering faculties, the tradition of how to teach it was borrowed from that space too: lectures + labs + tests. But, I think, that programming is more similar to painting in the way how it's done and how it could potentially be taught. So, instead of short-term labs and tests, it would be beneficial for the students to have long-term projects, possibly worked on in groups, where interaction with the teacher happens when the teacher inspects the work done so far and decides to provide feedback.

The drawback of this approach is the grading that will not be defined formally and will be grounds for disputes between the students and the teachers. But, I still think it would've been better than the sequence of short-term assignments that must be worked on alone, because the later doesn't welcome and sometimes precludes the use of industry tools (s.a. version control systems, bug trackers). Essentially, today, those who are taught to program aren't taught how to use programming tools at all. Which is very weird to anyone coming from a school with a practical slant, where studying the tools would've been an essential part of the training.

throwaway71271 · 2024-12-17T12:05:32 1734437132

> This isn't what the article says...

thats not the only thing it says, but it does say it:

> The conclusions to be drawn from the points made throughout this paper are reasonably obvious: We need a new language for teaching novices at secondary school and introductory university level.

It also does states that somehow we know how to teach programming, and the discussion is around finding an incremental improvement given the learnings so far.

crabbone · 2024-12-17T13:44:39 1734443079

That's taken out of context. The article says that in order to become good at programming students will need to learn multiple languages, including those popular in the industry. But, since students are going to learn multiple languages anyways, it's also necessary to have a language that is useful to teach programming concepts rather than the ropes of the "real world". And this is the language it's talking about in your quote.

toolslive · 2024-12-17T10:27:22 1734431242

I concur, but it's even worse: I have a son who's doing a master in engineering (not CS). He had several programming courses in python and R. When looking at the example solutions, I noticed that whoever wrote them can't code, doesn't understand programming and certainly should not be allowed to teach it.

throwaway71271 · 2024-12-17T10:47:07 1734432427

I never went to university, but in high school I had two different teachers, and one was hundreds of times better than the other.

I am not really sure how teachers get hired, how they are interviewed and assessed, but the system is clearly not working; particularly in subjects that the learning curve is not a curve (like math, or physics or cs)

throwaway71271 · 2024-12-13T10:35:32 1734086132

> If I want to understand an individual human being, I must lay aside all scientific knowledge of the average man and discard all theories in order to adopt a completely new and unprejudiced attitude.

-- Carl Jung, The Undiscovered Self

Everything in the average is infinitely reduced, even if you make a chair, a perfect chair for one is torture for another. So we compromise for all.

It is the same with user interface, or education.

Now however I think we can break out of this, with new unique interfaces for each individual, or teaching every child what they are struggling with, be ahead or behind the other children, there is no need to teach quadratic equations to 1 million kids in the very same time in the very same way, some get them in the first lesson, some in the last, and some never get them.

throwaway71271 · 2024-12-07T09:49:51 1733564991

looking at the despair at https://www.reddit.com/r/cscareerquestions i am completely certain people are not above cheating

even if its not an app, everything is so cheap and small now, 1mm camera and microphone, and very tiny ear piece (invisible with longer hair), or even earless headphones, realtime whisper turbo and groq llama3.3 can absolutely pass most interviews and answer most questions

even without the tech, i am sure people are now cheating on face to face interviews as well

i worked at fairly big company, and did maybe 500 interviews, i never knew how the candidate would look in advance, so i dont see why cant someone else shows up to the interview, pass it and then the real person shows up for the job

throwaway71271 · 2024-11-28T16:47:03 1732812423

> But where are they hearing about these effects that get them so concerned? Is it the Australian news?

Their friends who have kids?

People still talk to other people.

throwaway71271 · 2024-11-17T11:41:35 1731843695

https://edition.cnn.com/2024/10/30/tech/teen-suicide-charact...

there are more extreme cases

techjamie · 2024-11-17T14:54:07 1731855247

https://nypost.com/2024/10/23/us-news/florida-boy-14-killed-...

This article has screenshots from the conversation. While the outcome is definitely more extreme, the actual conversation between him and the bot when it came to the message that encouraged him to go through with it is a little more questionable. He didn't ask it if he should kill himself directly, he told it he was going to "come home" to it, and it told him that he should.

But the bot being a roleplay bot could easily respond as if that's just a part of the roleplay, and his character was literally going home. It isn't responding with a prompt indicating that it's responses might effect a real person that way.

I would say what could really make it damning depends on the rest of the conversation, which will likely come out in court, and seeing if suicidal tendencies are present within its context window.

Ferret7446 · 2024-11-18T00:03:03 1731888183

The other problem is that this is for roleplaying. What if you wanted to roleplay a suicidal couple situation? To what extent should websites be legally obligated to verify the mental condition of its visitors?

throwaway71271 · 2024-11-17T09:39:39 1731836379

or how much money they lost on missed business

reissbaker · 2024-11-17T22:33:07 1731882787

You don't have opportunity cost of missed business, if the alternative is that you don't exist. If it allows you to launch faster, you aren't losing business by not having every feature; you're gaining business that you would've lost by not launching sooner. You can always build the extra features later.

Midjourney avoided having to make a frontend at first, meaning they launched faster. Once they got product/market fit, they built a web frontend, which is now better than the Discord interface. Seems like a pretty smart strategy, that worked well, for a completely bootstrapped company that never raised VC.

throwaway71271 · 2024-10-23T11:45:29 1729683929

I have a friend that has it.

On his first day of work he missed the bus, so he called our boss and just said "I missed the bus, the next one is tomorrow morning, see you tomorrow".

Working on any kind of problem with him is absolute joy, from outages to designing new features, including once when the server burned down and everything was lost.

thunderbong · 2024-10-23T12:44:48 1729687488

Would love to hear some more anecdotes

throwaway71271 · 2024-10-12T08:11:33 1728720693

many are cpu bound for stupid things like https://nee.lv/2021/02/28/How-I-cut-GTA-Online-loading-times...

or various accidentally quadratic functions

of course they shouldnt, but such is life

mahmoudhossam · 2024-10-12T08:20:12 1728721212

That's a perfect storm of bad coding though, not something I'd buy a new CPU for.

homebrewer · 2024-10-12T11:08:00 1728731280

If you spend as much time in online games as some people do (it's not rare to find a steam account with thousands of hours in GTA 5), it would absolutely make sense to get a new CPU which will save you 10-15 minutes of waiting time per day.

Also, "perfect storm of bad coding" is the norm for games, and not the exception, no?

throwaway71271 · 2024-10-12T09:03:27 1728723807

well ultimately you buy a new CPU so that your software runs faster

sadly after few years faster CPUs are normalized and we write sloppier code that makes the programs slow again

then amd has to work on next generation of speculative execution innovations and almost AGI branch predictors :) and we go again

hu3 · 2024-10-12T13:52:03 1728741123

Much easier to buy faster CPU than to decompile, debug and fix bottlenecks in games.

dundarious · 2024-10-12T15:30:08 1728747008

When you socialize the benefit with 0 cost to players, that's "easier" still. It's all about perspective.