More

bmc7505 · 2025-08-02T00:18:06 1754093886

The difference is that SAT/SMT solvers have primarily relied on single-threaded algorithmic improvements [1] and unlike neural networks, we have not [yet] discovered a uniformly effective strategy for leveraging additional computation to accelerate wall-clock runtime. [2]

[1]: https://arxiv.org/pdf/2008.02215

[2]: https://news.ycombinator.com/item?id=36081350

PaulHoule · 2025-08-02T03:24:52 1754105092

RETE family algorithms did turn out to be somewhat parallelizable, enough to get a real speed-up on ordinary multicore CPUs. There was an idea in the 1980s that symbolic AI would be massively parallelizable that turned out to be a disappointment.

https://en.wikipedia.org/wiki/Fifth_Generation_Computer_Syst...

bmc7505 · 2025-08-02T11:48:36 1754135316

You could argue that since automatic differentiation and symbolic differentiation are equivalent, [1] symbolic AI did succeed by becoming massively parallelizable, we just needed to scale up the data and hardware in kind.

[1]: https://arxiv.org/pdf/1904.02990

yorwba · 2025-08-02T08:33:33 1754123613

> [2]

In the comments, zero_k posted a link to the SAT competition's parallel track. The 2025 results page is here: https://satcompetition.github.io/2025/results.html Parallel solvers consistently score lower (take less time) than single-threaded solvers, and solve more instances within the time limit. Probably the speedup is nowhere near proportional to the amount of parallelism, but if you just want to get results a little bit faster, throwing more cores at the problem does seem like it generally works.

bmc7505 · 2025-08-02T11:31:07 1754134267

> The solvers participating in this track will be executed with a wall-clock time limit of 1000 seconds. Each solver will be run an a single AWS machine of the type m6i.16xlarge, which has 64 virtual cores and 256GB of memory.

For comparison, an H100 has 14,592 CUDA cores, with GPU clusters measured in the exaflops. The scaling exponents are clearly favorable for LLM training and inference, but whether the same algorithms used for parallel SAT would benefit from compute scaling is unclear. I maintain that either (1) SAT researchers have not yet learned the bitter lesson, or (2) it is not applicable across all of AI as Sutton claims.

bmc7505 · 2025-05-14T01:03:36 1747184616

The correct way to do this is with finite model theory but we're not there yet.

bmc7505 · 2025-04-04T13:23:32 1743773012

Gingsberg stole it from Yeats — “the best lack all conviction…” / “the best minds of my generation…” — many similar verses, e.g., “what rough beast…” / “what sphinx of cement…”

https://www.poetryfoundation.org/poems/43290/the-second-comi...

miltonlost · 2025-04-04T13:55:58 1743774958

Those aren't nearly close enough to be considered stolen. Possibly allusions (which is not stealing), but even then, the only similarity of the bests is "The best" usage. Nothing about the rest of the lines, or before, are similar enough to be "stolen" (potentially the Ginsberg troping Yeat's "full of passionate intensity" of the worst into his best's "madness, starving hysterical", but that too is allusion, not stealing).

The best lack all conviction, while the worst // Are full of passionate intensity.

vs

I saw the best minds of my generation destroyed by madness, starving hysterical naked, // dragging themselves through the negro streets at dawn looking for an angry fix,

How is this stealing in any form?

throw4847285 · 2025-04-04T13:47:39 1743774459

He stole the concept of poetry from Yeats?

bmc7505 · 2025-02-20T23:32:51 1740094371

https://cstheory.stackexchange.com/questions/632/what-is-the...

bmc7505 · 2025-02-13T07:23:04 1739431384

Called it three years ago: https://news.ycombinator.com/item?id=30142353

bmc7505 · 2025-01-13T00:40:08 1736728808

https://dl.acm.org/doi/10.1145/3704837

bmc7505 · 2025-01-10T13:30:20 1736515820

Although Wolfram doesn't mention it by name, this is closely related to what he is trying to do: https://en.wikipedia.org/wiki/Reverse_mathematics

vasco · 2025-01-10T18:29:28 1736533768

Wanted to thank you for the share, added "Reverse Mathematics: Proofs from the Inside Out" to my reading list which was referenced in the article you posted: https://www.goodreads.com/book/show/34928283-reverse-mathema...

bmc7505 · 2025-01-10T21:59:56 1736546396

Depending on how comfortable you are with model theory you might also enjoy Dzhafarov and Mummert’s textbook, which first brought the subject to my attention.

bmc7505 · 2024-12-04T16:18:27 1733329107

This is roughly the intuition I have developed -- any computational function requires time and space to evaluate. Most computations carry with them some epistemic or aleatoric modeling uncertainty, but sometimes even a perfectly deterministic function with a worst case constant time complexity is worth approximating, as the constant factor may be prohibitive.

Given an exact decision procedure with astronomical lower bounds, and an approximate one that is identical on 99.99% of IID sampled inputs that takes a second to evaluate, which would you prefer? Given a low latency, high variance approximation, would you be willing to exchange latency for lower variance? Engineering is all about such tradeoffs.

There is a neat picture [1] in GEB that captures a similar idea.

[1]: https://miro.medium.com/v2/resize:fit:4800/format:webp/1*VU1...

bmc7505 · on Nov 14, 2024

FWIW, I’ve had a very similar encounter with another famous AI influencer who started lecturing me on fake automata theory that any CS undergrad would have picked up on. 140k+ followers, featured on the all the big podcasts (Lex, MLST). I never corrected him but made a mental note not to trust the guy.

bmc7505 · on Aug 12, 2024

This is a handy tool, but I wish it supported edge snapping. If you inspect the generated LaTeX it doesn't actually link up the FSM states, it just anchors them to raw TikZ coordinates.