randomwalker's submissions

1.		Towards a science of AI agent reliability (normaltech.ai)
		1 point by randomwalker 46 days ago \| past
2.		When AI Builds AI – Findings from a Workshop on Automation of AI R&D [pdf] (georgetown.edu)
		1 point by randomwalker 75 days ago \| past
3.		The Longitudinal Expert AI Panel: Understanding Expert Views on AI [pdf] (static1.squarespace.com)
		1 point by randomwalker 5 months ago \| past
4.		Holistic Agent Leaderboard: The Missing Infrastructure for AI Agent Evaluation (arxiv.org)
		1 point by randomwalker 5 months ago \| past
5.		America's AI Action Plan [pdf] (whitehouse.gov)
		11 points by randomwalker 8 months ago \| past
6.		Could AI slow science? Confronting the production-progress paradox (aisnakeoil.com)
		2 points by randomwalker 8 months ago \| past
7.		AI as Normal Technology (knightcolumbia.org)
		239 points by randomwalker 12 months ago \| past \| 92 comments
8.		Why an overreliance on AI-driven modelling is bad for science (nature.com)
		1 point by randomwalker on April 9, 2025 \| past
9.		Is AI progress slowing down? (aisnakeoil.com)
		5 points by randomwalker on Dec 19, 2024 \| past \| 1 comment
10.		We Looked at 78 Election Deepfakes. Political Misinformation Isn't an AI Problem (knightcolumbia.org)
		5 points by randomwalker on Dec 13, 2024 \| past
11.		Inference Scaling FLaws: The Limits of LLM Resampling with Imperfect Verifiers (arxiv.org)
		3 points by randomwalker on Nov 27, 2024 \| past
12.		Is the UK's liver transplant matching algorithm biased against younger patients? (aisnakeoil.com)
		93 points by randomwalker on Nov 11, 2024 \| past \| 62 comments
13.		Core-Bench: Computational Reproducibility Agent Benchmark (arxiv.org)
		1 point by randomwalker on Sept 18, 2024 \| past
14.		AI companies are pivoting from creating gods to building products (aisnakeoil.com)
		133 points by randomwalker on Aug 19, 2024 \| past \| 195 comments
15.		AI Agents That Matter (aisnakeoil.com)
		35 points by randomwalker on July 3, 2024 \| past \| 10 comments
16.		AI Agents That Matter (arxiv.org)
		4 points by randomwalker on July 2, 2024 \| past
17.		Scientists should use AI as a tool, not an oracle (aisnakeoil.com)
		124 points by randomwalker on June 3, 2024 \| past \| 106 comments
18.		AI safety is not a model property (aisnakeoil.com)
		2 points by randomwalker on April 8, 2024 \| past
19.		AI safety is not a model property (aisnakeoil.com)
		3 points by randomwalker on March 13, 2024 \| past
20.		On the Societal Impact of Open Foundation Models [pdf] (stanford.edu)
		2 points by randomwalker on Feb 27, 2024 \| past
21.		Will AI transform law? The hype is not supported by current evidence (aisnakeoil.com)
		2 points by randomwalker on Jan 25, 2024 \| past
22.		Generative AI's end-run around copyright won't be resolved by the courts (aisnakeoil.com)
		4 points by randomwalker on Jan 22, 2024 \| past \| 2 comments
23.		Model alignment protects against accidental harms, not intentional ones (aisnakeoil.com)
		1 point by randomwalker on Dec 1, 2023 \| past
24.		What the executive order means for openness in AI (aisnakeoil.com)
		2 points by randomwalker on Oct 31, 2023 \| past
25.		The Foundation Model Transparency Index (stanford.edu)
		47 points by randomwalker on Oct 18, 2023 \| past \| 16 comments
26.		Evaluating LLMs Is a Minefield (princeton.edu)
		3 points by randomwalker on Oct 5, 2023 \| past
27.		Does ChatGPT have a liberal bias? (aisnakeoil.com)
		4 points by randomwalker on Aug 18, 2023 \| past \| 2 comments
28.		The REFORMS checklist for ML-based science (aisnakeoil.com)
		2 points by randomwalker on Aug 17, 2023 \| past
29.		ML is useful for many things, but not for predicting scientific replicability (aisnakeoil.com)
		116 points by randomwalker on Aug 11, 2023 \| past \| 37 comments
30.		Is GPT-4 getting worse over time? (aisnakeoil.com)
		7 points by randomwalker on July 19, 2023 \| past \| 1 comment
		More