I wish they'd release some data or evaluation methodology alongside such claims.... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		padolsey 50 days ago \| parent \| context \| favorite \| on: Claude Memory I wish they'd release some data or evaluation methodology alongside such claims. It just seems like empty words otherwise. If they did 'extensive safety testing' and don't release material, I'm gonna say with 90% certainty that they just 'vibe-red-teamed' the LLM.

Agentlien 50 days ago [–]

I really hope they release something as well, because I loved their research papers on analyzing how Claude thinks[0] and how they analyzed it[1] and I'm eager for more.

[0] https://transformer-circuits.pub/2025/attribution-graphs/bio...

[1] https://transformer-circuits.pub/2025/attribution-graphs/met...

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact