Hacker Newsnew | past | comments | ask | show | jobs | submit | e1g's commentslogin

+1 from another happy Whispr Flow power user. I tried 4-5 similar apps and even built one with Assembly AI, but Whispr is a significant upgrade above the rest for correctly recognizing my accent and jargon. Having the custom vocabulary helps.


Do you happen to have a link with a more nuanced technical analysis of that (emergent) behavior? I’ve read only the pop-news version of that “escaping” story.


There is none. We don't understand LLMs well enough to be able to conduct a full fault analysis like this.

We can't trace the thoughts of an LLM the way we can trace code execution - the best mechanistic interpretability has to offer is being able to get glimpses occasionally. The reasoning traces help, but they're still incomplete.

Is it pattern-matching? Is it acting on its own internal goals? Is it acting out fictional tropes? Were the circumstances of the test scenarios intentionally designed to be extreme? Would this behavior have happened in a real world deployment, under the right circumstances?

The answer is "yes", to all of the above. LLMs are like that.


You might have missed the appendix the Anthropic blog post linked to, which has additional detail.

https://www.anthropic.com/research/agentic-misalignment

https://assets.anthropic.com/m/6d46dac66e1a132a/original/Age...


Knowing who is behind this campaign, 90% chance the extra white space, the graffiti, and this article were all commissioned by them intentionally.

Luckily this did not translate to sales, or we’d have another wave of Cluely BS copycats.


The cynic always sounds smart and remains poor.


I think the issue is more that a lot of people weren't cynical enough. I knew Bitcoin was a shit currency when I first heard about it, and thought that was all there was to it. I didn't understand that while it was a shit currency, it was a great speculative asset. I thought people would look at it, go "that's dumb", and move on. Apparently I hadn't heard of, or understood the Dutch tulip mania and similar historical events. I presumed people would be better than they turned out to be, and that cost me a lot of potential capital gains.


> The cynic always sounds smart and remains poor.

Why would cynics be poor? The OP mentioned "techies", many of whom have jobs paying 6 figures a year.


The Cynic on the other hand knows how to enjoy life with just enough. He is free, a spy for the gods.


But he keeps writing and talking about people who have more than enough, and how they are wrong.


He points out their foolishness - he has what they will never have. Enough.


It takes no imagination or insight to see reasons why something wouldn’t work. It’s the default mental pathway for every risk-averse beast. Skepticism is not born out of contentment and abundance but out of self-preservation. It’s not correlated with feeling enough, but with feeling bitterness and envy of those who took risks and gained an advantage instead of suffering consequences.


People who are content feel less need to take risks by accepting dubious statements without proof. They have what they need so why risk it for more?

Sceptical people will be grounded by what we know to be true. They will explore new ideas but will not be swept up by them. We need people like that or we'll waste our time on flights of fancy. But we need the irrational optimists to explore new ideas too. It's a classic exploration vs exploitation trade-off.


Many people who have risked their money by placing it on Bitcoin likely had enough, and they risked the extra money that they had lying around. Why not place bets on something you think might be probable? Is there something morally wrong in making some extra buck? Is it morally superior just to keep your money lying on bank account or what?


To have enough by your definition and to feel like one has enough are two very different ideas of enough.

The Cynic has enough if he has his cloak and found some food in the garbage can. He feels like he has enough. You might feel like that's not enough.

Conversely I might think the richest man in the world (by net worth) has enough. He feels like he needs more.


I'm pretty sure these peeps who hang out at /r/buttcoin are going to work like regular people to get some fiat currency to their beloved government blessed bank accounts. So I guess they don't feel like they have enough.


I have no idea what a buttcoin is, sorry.


To be honest I don't think the skeptical people thought bitcoin's success was probable and that's why they didn't bet on it. It's not really anything to do with them being content with what they have.

But it could be this too in some cases.

Some people do things unless they find a reason not to but so a skeptical person will only do things if they find a reason.

People who really feel they have enough might not see any reason to spend their time or effort placing bets, even on things they think are probable. But I don't think many people think that way.


> It takes no imagination or insight to see reasons why something wouldn’t work. It’s the default mental pathway for every risk-averse beast.

Quite the opposite: it takes a lot of strong will and risk to talk against a hype. A kind of risk-affinity that unluckily rarely makes you rich. :-(


To remove resulting notifications, see instructions here https://github.com/orgs/community/discussions/174283#discuss...

These spam repositories have been deleted, but I still had lingering notifications stuck on GitHub, and I couldn't see them in the UI to remove them (but the small blue notification dot was constantly on). The API hack resolved this problem.


Came here looking for this. Thank you - removed the annoying blue notification now.


Extremely slow for me - takes minutes to get anything done. Regular GPT5 was much faster. Hoping it’s mostly due to the launch day.


I've been using gpt-5 on effort=high but for gpt-5-codex, try: `-c model_reasoning_effort=medium`.

On high it is totally unusable.


even on medium ... gpt-5 was way faster, at least that's my first impression


> clean/erase/undo/soft-delete/hard-delete mistakes[...] make the change tracking capable of time travel itself [...] Transitioning to an EAV

I just finished building out all of that + more (e.g., data lineage, multi-verse, local overrides, etc), also on PG. Reach out if you want to chat and get nerd sniped!


For “Organizational Use”: the people who designed and own Myers-Briggs say this is a bad idea -

> The MBTI® assessment is not intended for use in selection of job candidates, nor for making internal decisions regarding job placement, selection for teams or task forces, or other similar activities. […] Given that it is not appropriate for selection, there have been no meaningful studies evaluating the MBTI’s ability to predict job performance. Established researchers in the field of predicting job performance would not use the MBTI assessment for this purpose.

[1] https://www.themyersbriggs.com/en-US/Support/MBTI-Facts


This is accurate for three reasons:

1) MBTI cannot be used adversarially - if people know certain types are preferred, they will answer to the test.

2) MBTI is strictly worse than Big 5 in study design since it doesnt have continuous variables and they correlate anyway.

3) There are huge liability issues if you say your product will help you hire or promote in the united states. Like q tips saying you can't ever use them in your ears.


You need to be generating >100M of them within the same millisecond before even remembering that collisions can theoretically happen.


Apparently there's 500 hours of video uploaded to YouTube every minute (30 seconds every millisecond). Assuming 4K@60fps, that works out to 14,929,920,000 pixels per millisecond.

If YouTube wanted to give every incoming pixel its own UUIDv7, they'd see a collision rate just under 0.6%.


    > Assuming 4K@60fps [...] they'd see a collision rate just under 0.6%
This doesn't detract from your point of collisions like that being viable at that scale, but assuming an average of 4K@60fps is assuming a lot. The average video upload there is probably south of 1080p@30fps.


You're glossing over the fact that they assumed youtube would want to assign a UUID to each pixel in a 4k@60fps video as the use case that this would fail for...


Excellent example. And at that scale, you are generating 100TB/s in UUIDs so if you need to store them, you have much bigger problems than collisions.


>You

The entire universe. Else it's not universally unique.


I like UUIDv7s as database IDs since they sort chronologically, are unique, and are efficient to generate. My system chooses the UUIDs; I don't allow externally generated IDs in. If I did, then an attacker could easily force a collision. As such, I only care about how fast I create IDs. This is a common pattern.

If your system does need to worry about UUIDv7s generated by the rest of the universe, you likely also need to worry about maliciously created IDs, software bugs, clocks that reset to unix epoch, etc. I worry about those more than a bonefide collision.


Your app is must be popular to be having an entire universe "amount" of users lol

joke aside all of this is theorical, in practical application its literally impossible to hit it that it doesn't matters if its possible or not since you are not google scale anyway


It's not just your app. It's any other app or data provider that you may now or in the future interact with.


Only if the other side uses your key as theirs, and uses it to store data from many sources. I, personally, don't feel it's hardly worth considering. A primary key under your own control doesn't cost much, and is a better choice.


That's not how namespacing works though, is it?

Getting UUID 'A' from app 'X' is easily distinguishable from UUID 'A' from app 'Y'.


The point of the first U in UUID, universal, is that you don't need to use namespacing.


Universal mean unique that uid wouldn't be used anyone else in any point in history or just universal available in one app????

because you just overreach at this point, if you can develop a better one. be my guest


Obviously, just the part within our light cone.


Another strategy is to pass your values as an array param (e.g., text[] or int[] etc) - PG is perfectly happy to handle those. Using ANY() is marginally slower than IN(), but you have a single param with many IDs inside it. Maybe their ORM didn’t support that.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: