Hacker Newsnew | past | comments | ask | show | jobs | submit | wild_pointer's commentslogin

Looking at the title, I was confused why a recommendation of some random PC gamer is interesting. Capitalization is important.

Hilarious, some of them are easy with the keyboard


It's not so cheap, in terms of maintenance and mental load


* The tool is truly amazing. Both for simple usage, and the advanced queries that it accepts. Very powerful, like a command line tool.

* As another comment says, v1.5 alpha has many advantages. Despite the alpha label, I find it to be very stable.

* Several software integrations exist: https://www.voidtools.com/forum/viewtopic.php?t=6326, I mostly like being able to see folder sizes instantly in explorer. I used xplorer2 in the past, which has a plugin, but I went back to native explorer, which has a Windhawk mod, feels like what Microsoft should have done: https://windhawk.net/mods/explorer-details-better-file-sizes


Everything about this feels like what Microsoft should have done. It’s absolutely amazing to me that search is so broken in Windows and yet a free third-party tool can instantly find any file anywhere.


One hypothetical I wonder about is what the windows ecosystem would be like if third parties could make distributions of windows, if somehow that could be licensed and enough windows building/packaging was opened up. It'd be interesting to see whether collaborations of projects would form where they pull out MS parts and substitute their own, presumably with the constraint that they maintain compatibility. I imagine it'd take a while for any commercial products thinking of getting involved to figure out sharing, trust, and how to offer it in a way companies or individuals might want to donate/pay for.


Windows file search has been useless as far back as I can remember. Especially file indexing and the load it puts on the CPU. I usually just disable file indexing on a new windows install.


I genuinely just don't use the Start Menu anymore. It cannot find anything, and every search will include two Internet results (Bing only of course) and a Microsoft Store reference.


This is why it’s slow, everything you enter is being exfiltrated for ads. Windows is corporate malware.


Hey, I remember this Black Mirror episode!


If mine ever start chirping, I will pull the plug.


Listen team lead and the whole team, make this button red.


Principal engineers! We need architecture! Marketing team, we need ads with celebrities! Product team, we need a roadmap to build on this for the next year! ML experts, get this into the training and RL sets! Finance folks, get me annual forecasts and ROI against WACCC! Ops, we’ll need 24/7 coverage and a guarantee of five nines. Procurement, lock down contracts. Alright everyone… make this button red!


We have to reject claude can do it simply by a prompt, then everyone can do it. As SWE's we are not going to pragmatically accept we are done. https://www.youtube.com/watch?v=g_Bvo0tsD9s


ha! The default system prompt appears to give the main agent appropriate guidance about only using swarm mode when appropriate (same as entering itself into plan mode). You can further prompt it in your own CLAUDE.md to be even more resistant to using the mode if the task at hand isn't significant enough to warrant it.


I like opencode for the fact I can switch between build and plan mode just by pressing tab.


Isn't it the same in base claude-code?


Yes.


Its shift-tab in Claude Code, fyi


Don't make mistakes.


ubiquitous? "Vienam" (with quotes) shows this page as the first result.


I wonder how much of it is due to the model being familiar with the game or parts of it, be it due to training of the game itself, or reading/watching walkthroughs online.


There was a well-publicised "Claude plays Pokémon" stream where Claude failed to complete Pokemon Blue in spectacular fashion, despite weeks of trying. I think only a very gullible person would assume that future LLMs didn't specifically bake this into their training, as they do for popular benchmarks or for penguins riding a bike.


If they game the pelican benchmark, it’d be pretty obvious.

Just try other random, non-realistic things like “a giraffe walking a tightrope”, “a car sitting at a cafe eating a pizza”, etc.

If the results are dramatically different, then they gamed it. If they are similar in quality, then they probably didn’t.


While it is true that model makers are increasingly trying to game benchmarks, it's also true that benchmark-chasing is lowering model quality. GPT 5, 5.1 and 5.2 have been nearly universally panned by almost every class of user, despite being a benchmark monster. In fact, the more OpenAI tries to benchmark-max, the worse their models seem to get.


Hm? 5.1 Thinking is much better than 4o or o3. Just don't use the instant model.


5.2 is a solid model and I'm actually impressed with M365 copilot when using it.


> as they do for popular benchmarks or for penguins riding a bike.

Citation?


9/10 for originality. 2/10 for usefulness.

Not bashing, that's how good ideas are found. But not this time IMO :)


I disagree.


OK.


In the era of LLM-generated content, such a high-quality writeup is a breath of fresh air. Well done!


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: