More

anon22981 · 2025-03-14T12:42:57 1741956177

The point is still kind of valid, it is a decently affordable EV compared to many other ”luxury” EVs. They also had (have?) very cheap payment options available. I’d suspect the expenses are a big factor in the purchasing decision.

edit: obviously it’s not a car for poor people as it is still very expensive. I understand your point too.

anon22981 · 2025-03-09T19:37:54 1741549074

Seems more like trying to find out the most proficient LMM users than anything else. I’ve never done interviews but I imagine I’d be hard pressed to skip candidates solely because they aren’t using LLMs.

Each to their own and maybe their method works out, but it does seem whack.

ZeroTalent · 2025-03-10T00:42:33 1741567353

The thing is, when you're doing frontend, a human programmer can't write 4,000 lines of React code in 1 hour. A properly configured LLM system can.

This is why I wouldn't hire a person who doesn't know how to do this.

christophilus · 2025-03-10T01:34:01 1741570441

What are you doing where 4000 lines of LLM-generated code per hour is a net positive? Sounds like a techdebt machine to me.

ZeroTalent · 2025-03-12T22:50:21 1741819821

UIs in React are very verbose. I'm not saying this is running 24/7.

anon22981 · 2025-03-07T16:53:13 1741366393

I find it laughable this whole thread is people throwing around opinions instead citations

anon22981 · 2025-02-19T13:27:17 1739971637

Bugfix =! rewriting code, though. Reworking an existing solution to be better is fun, but a bug fix often isn’t that.

(When talking about ”rewriting code” I think of a larger scale than for example three lines of code)

anon22981 · 2025-02-11T15:30:23 1739287823

I don’t think there’s anything surprising here and nothing to do with metacognition.

LLM should be able to answer ”I don’t know (for certain)” to questions where the training material also says ”this is not known and there are only speculations”. It’s the answer it’s training data gave.

anon22981 · 2025-02-11T11:22:33 1739272953

The people dismissing criticism are the worst.

A fine case example is Cyberpunk 2077 at release - the game was ridden with bugs and several of them gamebreaking. That on top of failing to reach what it’s marketing had promised.

I enjoyed the game ok then and a lot more after a lot of patches - it’s a good game atm imo.

But the criticisms were more than valid yet there was no shortage of very vocal defenders. According to them the critics were haters, complaining about nothing, they themselves didn’t encounter any bugs (or maybe two in twenty hours) etc. It’s baffling how some people want to defend shortcomings in products - wouldn’t it serve them too if the products were better? (Some goes with any brand fanboys, not just games)

isaacremuant · 2025-02-11T14:36:34 1739284594

I don't know if people have a natural inclination towards authority or they've been primed against it but there's a notable trend online to demonize consumer advocacy of this kind.

anon22981 · 2025-02-05T15:15:03 1738768503

The punchline is that after all is set and done these people will still believe the lies Trump and Musk give to them. Just shift blame to something else, have that common enemy, bring up some of the ”positives” you’ve done and they are just as hooked, if not more so.

anon22981 · 2025-02-02T14:35:43 1738506943

The reason you overestimate their capabilities is because you use them for things you don’t know anything about. It’s like when your nephew made a simple HTML website for himself twenty years ago that was <h1>Hi I am Mark</h1> — it seemed impressive, but you just didn’t know that it wasn’t. Using LLMs in real world complex cases (in programming or art) instantly reveal their significant shortcomings. They are a very good nephew for making stuff that seem impressive, but a bad expert or consultant.

anon22981 · 2025-02-02T14:23:16 1738506196

I use the 4o very often in my work and it mostly sucks. Sometimes it’s very good, sometimes it has nice knowledge that was faster to find from it than a search engine. Mostly it spouts out unhelpful noise (for my problems).

I’m sure if you need to make a to-do list in react it’s like magic (until the app gets complicated). In real world use, not so much.

(Also I have often code reviewed PRs from people who are heavy users and surprise surprise - their output is trash and very prone to bugs or being out of spec.)

TuxSH · 2025-02-03T00:05:22 1738541122

I also think 4o sucks, but have you tried DeepSeek R1 (free on their website)? I thought it night and day between 4o and o3-mini on the following topics:

- reverse engineering: when fed assembly (or decomp or mock impl), it's been consistently been able to figure out what the function actually does/why it's there from a high-level perspective. Whereas ChatGPT merely states the obvious

- very technical C++ questions: DSR1 gives much more detailed answers, with bullet points and examples. Much better writing style. Slightly prone to hallucinations, but not that much

- any controversial topic: ChatGPT models are trained to avoid these because of its "safety" training

ChatGPT is a bit better (and faster) at writing simple code and doing some math faster, but that's it.

(obviously, common sense about what to share and not to share with these chatbots still apply, etc.)

TeMPOraL · 2025-02-03T09:12:57 1738573977

You can access DeepSeek R1? For me, both chat and API have been down for over a week now (it shut down minutes after I topped up my account and generated an API key - I never got to use it :/).

There's lots of fiddling with these models. I found Claude 3.5 Sonnet to be superior to both GPT-4o and o1-preview in around 99% of the things I do; I only started comparing it against o3-mini, and right now it's a mixed bag. Then again, I tend to develop and refine specific prompts for Sonnet, which I haven't for o1-preview and o3-mini, so that could be a factor. Etc.

TuxSH · 2025-02-03T13:11:05 1738588265

> You can access DeepSeek R1?

Yes, well, I live in the EU and thus can avoid US work hours and Chinese peak hours. I think availability has been a bit better since they disabled websearch (also I noticed DSR1 half a week before it made the mainstream news).

> There's lots of fiddling with these models.

Agreed

TeMPOraL · 2025-02-03T18:08:50 1738606130

I live in the EU too. For me, the status page[0] shows a continuous API outage for the past 8 days, that is still ongoing. Since it started, my API requests bounce back with an error, which changes seemingly at random between "unauthorized" and "insufficient balance". Neither of those reasons are valid, since I'm using a valid API key I made after creating an account, which I topped up with $20 (and have an invoice from them to prove it). I must have had a mightily bad luck that the service went down soon after I generated the API key - I'm guessing my user/key is currently stuck in the middle of some migration, or possibly wasn't captured in a backup and got subsequently wiped. For now, I'll just patiently wait for them to fix their service.

--

[0] - https://status.deepseek.com/

TuxSH · 2025-02-03T18:26:01 1738607161

AFAIK it's hosted on Chutes for free too (though limited to between 2k and 10k output tokens). Azure as well, though it might be ratelimited there (or at least it is through openrouter)

anon22981 · 2025-01-28T13:21:01 1738070461

Something being a concern and doing something about it are two different things.

I’m concerned about plenty of things, for example regarding my health or relationships, yet mostly fail to take action to address these concerns. The same holds true for the state of this planet; climate change and microplastics all over etc.

I’m sure ”planet health” is in the top1000 concerns for most people not denying climate change.

paulcole · 2025-01-28T13:30:12 1738071012

I guess that’s where I see a difference. If you look at a problem and say, “I definitely care about this, I’m just failing to take action,” I’ll say, “Can’t be that big a problem then…”