More

fancyfredbot · 2025-05-18T22:40:19 1747608019

How often is this happening? If criminals are aware that 401k transfers are done using cheques in the post, and some of the addresses these are typically sent to, then I'm expecting this to be a very common type of fraud. Yet the practice of 401k cheques in the mail continues? Weird.

fancyfredbot · 2025-05-16T07:47:25 1747381645

Stylish and practical. Reminds me of these beauties by ArchiZoom.

https://www.italiandesignclub.com/2021/11/22/the-superonda-s...

fancyfredbot · 2025-05-07T22:38:59 1746657539

This is simultaneously why most people desperately want to invest in OpenAI and at the same time why all the best gen AI researchers want to work for anthropic. The less you understand the more impressive this seems. Conversley the more you understand the more embarrassing this seems.

gizmodo59 · 2025-05-08T01:15:33 1746666933

Can you give more detail on this? Or this is a vibe comment? Who do you consider as “best”?

fancyfredbot · 2025-04-24T07:43:42 1745480622

It's strange because there's no need to make this assumption about GPT-4o in order to demonstrate their point.

fancyfredbot · 2025-04-24T07:41:00 1745480460

If you game the benchmark then you always get found out by your users. Yet the practice remains common in hardware. Outright lies are uncommon but misleading and cherry picked numbers are pretty much standard practice.

The fact that misleading benchmarks don't even drive profit at Meta didn't seem to stop them doing the same thing, but perhaps this isn't very surprising. I imagine internal incentives are very similar.

Unlike the hardware companies though, gaming the benchmark in LLMs seems to involve making the actual performance worse, so perhaps there is more hope that the practice will fade away in this market.

fancyfredbot · 2025-04-23T10:31:17 1745404277

I had the same thought. I'm guessing rigorous and expensive safety certification, a custom designed steam driven turbo and alternator and stripping back and rebuilding the engine carriage? The fact it's got batteries and an alternator and a turbo suggests some stringent requirements.

fancyfredbot · 2025-04-23T09:13:36 1745399616

> like a distracted Karen playing Candy Crush as her SUV rolls through a busy intersection.

FYI this comes across a bit misogynistic! I'd word this differently

elzbardico · 2025-04-23T11:51:29 1745409089

Each sex can have their own stereotypes if you wish: The male drunk driver rushing through the same intersection is probably even more common than the unfortunatelly common screen distracted Karen.

fancyfredbot · 2025-04-23T17:33:45 1745429625

I'm really not trying to say that OP is misogynistic here, just mentioning that it can come across that way because the "Karen" example is oddly specific and not relevant to the overall point. Yes it's a stereotype and you could pick a male one but that'd also be weird and potentially also come across as sexist.

tialaramex · 2025-04-23T09:20:40 1745400040

Yeah, I rephrased.

fancyfredbot · 2025-04-22T19:53:04 1745351584

Wanted to add my voice to the chorus of appreciation for this article (actually a series of 8). Very informative and engaging.

fancyfredbot · 2025-04-19T13:50:58 1745070658

As opposed to before the AI age when biological and chemical warfare were a friendly affair?

fancyfredbot · 2025-04-17T07:29:45 1744874985

They used deep neural networks, reinforcement learning, and Monte Carlo tree search. All except the MCTS are critical components of modern LLMs. MCTS is a form of planning which you can argue has parallels to "reasoning" models, although that's pretty tenuous I admit.