Hacker Newsnew | past | comments | ask | show | jobs | submit | csmantle's commentslogin

> GNU grep also generates output ”with indifference to the truth”.

GNU grep respects user arguments and input files to the dot. It is not probabilistic.


Also GNU grep doesn't claim to be intelligent.

Now you tell me!

"AI" comes in various flavors. It could be a expert system, a decision forest, a CNN, a Transformer, etc. In most inference scenarios the model is fixed, the input/output shapes are pre-defined and actions are prescribed. So it's not that dynamic after all.

This is also true of LLMs. I’m really not sure of OP’s point - AI (really all ML) generally is like the canonical “trivial to preallocate” problem.

Where dynamic allocation starts to be really helpful is if you want to minimize your peak RAM usage for coexistence purposes (eg you have other processes running) or want to undersize your physical RAM requirements by leveraging temporal differences between different parts of code (ie components A and B never use memory simultaneously so either A or B can reuse the same RAM). It also does simplify some algorithms and also if you’re ever dealing with variable length inputs then it can help you not have to reason about maximums at design time (provided you just correctly handle an allocations failure).


CVEs are better viewed as "a uniform numbering system that ensures we are talking about the same bug" today. But updating software is good anyway.

> Browsers are designed to be secure from default settings.

Not quite. They are usually designed to be both fast and safe, but neither goal is considered "done" yet in modern ones. If you want max security, you'll likely have to disable all performance boosts like JS JIT.


This seems to pass a transitive requirement to users.

Suppose your libpopular forbids ill-faith actors from using it. Also suppose that I wrote a my-utility, a neutral tool, that depends on libpopular. If some bad actor uses my-utility for wrongdoing, will I be responsible for their behavior? Will my-utility be in breach of your license?


> programmers should be some of the most worry-free individuals on this planet, the job is easy, well-paid, not a lot of health drawbacks if you have a proper setup and relatively easy to find a new job when you need it

Not in where I live though. Competition is fierce, both in industry and academia, for most posts being saturated and most employees face "HR optimization" in their late 30s. Not to mention working over time, and its physical consequences.


"Not in where I live though"

I mean, not anywhere, and the data absolutely annihilates their ridiculous claims. In subsequent posts they've retreated back to "yeah, but someone somewhere has it worse", invalidating this whole absurd thread.

Their comment has little correlation with reality, and seems to be a contrived, self-comforting fiction. Most firms have implemented hiring freezes if not actively downsizing their dev staff. Many extremely experienced devs are finding the market absolutely atrocious, getting zero bites.

And for all of the "well us senior devs are safe" sentiment often seen on here, many shops seem to be more comfortable hiring cheap and eager junior devs and foregoing seniors because LLMs fill in a lot of the "grizzled wisdom". The junior to senior ratio is rapidly increasing, and devs who lived on golden handshakes are suddenly finding their ego bruised and a market where they're fighting for low-pay jobs.


> Their comment has little correlation with reality

Or you know, we live and experience different parts of the world? Where you are, you might be right, and where I am, I might be right.

But nuance tends to be harder than trying to find some absolute truth and finding out it doesn't match your preconceived notion about the whole world.


Again, compare this to other professions, don't look at in isolation, and you'll see why you're still (or will have, seems you're a student still) having a much more pleasant life than others.


This is completely irrelevant. The point is that the profession is being devalued, i.e. losing value relative to where it was. If, for example, the US dollar loses value, it's not a "counterargument" to point out that it's still much more valuable than the Zimbabwe dollar.


It isn't though, none of our lives are happening in isolation, even if you don't believe it, there are other humans out there, with real responsibilities outside of computers.

Even if the competition is fierce, do you think it isn't for other professions, or what's the point? Of course a job that is well-paid, has few drawbacks and let you sit indoors in front of computer, probably doing something you enjoy in general, is popular and has competition.


> even if you don't believe it, there are other humans out there, with real responsibilities outside of computers.

Are those people's lives getting better because the capital class is able to devalue more skilled jobs every year?


Do other professions expect you to work during personal time? At least blue collar people are done when they get told they're done

I get your viewpoint though, physically exhausting work is probably much worse. I do want to point out that 40 hours has always been above average, and right now its the default


> Do other professions expect you to work during personal time? At least blue collar people are done when they get told they're done

No, and after my first programming job, neither does it happen in development. Make sure you join the right place, have the right boss, and set expectations up front, and you too can surely avoid it if it's important to you :) Usually you can throw in "work/life balance" somehow to gauge how they feel about it.

And yes, plenty of blue collar people are expected to be available during your personal time, for various reasons. Sometimes just quick questions (especially if you're a manager and you're having time off), sometimes emergencies that requires you to head on over to the place. Ask anyone who owned or even just managed a restaurant about that specific thing, and maybe you'll be surprised.


This “compare it to other professions” thing doesn’t really work when those other professions are not the one you actually do. The idea that someone should never be miserable in their job because other more miserable jobs exist is not realistic.


It's a useful thing to look at when you feel like all hope is lost and "wow is so difficult being a programmer" strikes, because it'll make you realize how easy you have it compared to non-programmers/nom-tech people.


Realizing how supposedly “easy” you have it compared to other people is not as encouraging or motivational as you’re implying it is. And how “easy” do you have it if you can’t find a job in your field?


Might be worth investigating why it isn't if so. People stressed about their situation usually find some solace in being helped realize what their position in the world actually is, as everything is always relative, not absolute.


This would benefit from combining the literal rules from TrumpScript [0]:

> All numbers must be strictly greater than 1 million. The small stuff is inconsequential to us.

[0]: https://github.com/samshadwell/TrumpScript


This will soon gets cumbersome if we're trying to construct some large struct literals (rather than arrays) directly on heap. Rust should be able to elide the unnecessary stack allocation here.


From the paper [0], they're using a specialized model structure, so at least they are not part of the LLM hype. That's good. But I still wonder how this compares to existing rule- and manual heuristics-based approaches like github/linguist.

[0]: https://securityresearch.google/magika/2025_icse_magika.pdf


I just compared it to "file" on my downloads folder and here's a summary of the differences (in essentially no particular order, but alphabetically by filename [not extension] if that matters):

1. Magikia identified more csv/tsv than file, but also had a few false-positives, most of which were one-word-per line (e.g. m3u, dictionaries, a list of checksums), so I guess technically a CSV with only one entry per line if you want to stretch the definition

2. A few PDF files were missed by file but flagged correctly by Magika; Zathura (muPDF backend) printed warnings about "repairing document" when opening, so these were malformed, but not so much that they wouldn't render. There was one PDF file that file recognized and appears to be well-formed but missed by Magika

3. Magika completely failed to recognize any old "mod" tracker audio files (.xm .mod and .it .s3m extensions) while file caught them perfectly. Maybe a hole in the training data?

4. Magika completely messed up on an old-school fixed-column field ALL CAPS data file, tagging it as vbscript. File just called it "plain text"

5. Slight nitpick, but magika flagged all ELF files as executables, even non-executable .ko and .o files

6. File did not flag a single YAML file (instead marked as plain text), magika caught at least some of them

7. Magika mis-flagged 3 out of 4 ssh public keyfiles; one as javascript, two as powershell

8. Magika was much better at specifically identifying zip-wrapped formats. One JAR file was missed by file, but caught by magika and an android .jar was identified as android by magika but just JAR by file.

9. File incorrectly tagged several raw disk images with a file-format that was near the beginning of the image; magika just gave up and called it an octet-stream

10. Magika missed the only .mobi file, calling it octet-stream while file got it correct

11. Magika missed all of the djvu files (maybe another hole in the trainign data?); file got them all correct.

12. A source file with a .jsx extension[A] that is definitely not the well-known jsx (no XML-like syntax and "final class Foo" declarations). Magika just said plain-text and file said C++ which is definitely wrong, so Magika wins here.

13. A very short asciidoctor file was misidentified as TCL by Magika and plain-text by file

14. An html snippet was called "Twig template" by Magika and plain-text by file

15. A Wikipedia markup file was called "javascript" by Magika and plain-text by file

16. A unix .mbox style e-mail file was correctly identified by Magika and called plain-text by file

17. Magika caught a .mp4 file that file missed; which should have been a slam-dunk for file; I'll dig into this one later and file a PR for file, since it doesn't appear to be malformed at all

18. A project gutenberg .txt file (The Divine Comedy) was tagged as vbscript; looking above at other issues, being aggressive about thinking things are programming files is a bias of Magika

19. A pk-zip file containing only .jpg files was misidentified as a TIFF file by magika. File got it correct; not sure what went wrong here

20. Magika misidentified an HEIC as a video file, which makes some sense

21. File correctly identified an old ASF video, Magika gave up and just called it an octet-stream

22. Magika correctly identified every iso-9660 file I manually looked for; file missed a couple

23. An html file misidentified as a ruby file; not a small one either, but missing the <html> header

24. What appears to be a whitespace-separated file with "#" line comments was identified as CSV by Magika and plain-text by file

25. several Gentoo ebuild files were identified as such by file, but as a shell-script by Magika (syntax is very similar)

26. Magika did not appear to identify any of the zstandard compressed files, file caught all of them

27. A brotli compressed json file was identified by Magika but just as octets by file

28. Two shell scripts were marked plain-text by Magika but caught by file

29. Another plain-text novel was identified as .csv by Magika

30. A zip file was missed by file, but caught by Magika

31. 3 .html files missing html headers were recognized by Magika but not by file

A: https://gist.github.com/jasom/973c12f37d7b8f7fc92463543f0da0... if anyone wants to take a crack at figuring out what it is


This is a single-cycle, architecture-level simulator with no microarch details or "complex" features (privileged infra, mapped memory, etc). But it is a good starting demo.

BTW, why invent Yet-Another-Toy-Arch(tm)? If a more established architecture is used, existing toolchains can be utilized to produce images for running. Many popular RISC ISAs have "simplified" editions which are void of many complex features, so they hardly need more efforts to implement with respect to this one.


I think innerText and setHTML() have different purposes. The former inserts the whole string as a text leaf, while the latter tries to preserve structures that are meaningful in context.

---

Libraries can surely do the same job, but then the exact behavior would vary among a sea of those libs. Having specs defined [0] for such an interface would hopefully iron out much of these variations, as well as enabling some performance gains.

[0]: https://wicg.github.io/sanitizer-api/#dom-element-sethtml


And if you need something that is not in a spec, you have to use a library anyway. Also the point was that browser should be as simple as possible and not like a whole new OS.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: