More

nl · 2026-05-01T03:33:01 1777606381

Has anyone tried the InvisiLight kits? They give you a 600 micron fiber optical link in the kit which you can hide down the side of carpet etc. I haven't tried it but a few YT vids made it look interesting.

https://www.amazon.com/stores/InvisiLight/page/DFF3C042-0D2E...

hamdingers · 2026-05-01T03:49:41 1777607381

Looks like a complicated and expensive way to avoid running copper. "Up to 1Gbps" is not confidence inspiring, if you really can't put holes in your walls I'd try a powerline kit first for 1/5th the price

nl · 2026-05-01T05:15:03 1777612503

This is the YT video I saw about it: https://www.youtube.com/watch?v=Z2FbzCyiNr4

He used different transceivers and got 10Gbs

nl · 2026-04-30T23:41:40 1777592500

Simon Willison wonderful Datasette plus the mbox-to-sqlite extension is exactly what you want.

https://datasette.io/

https://github.com/simonw/mbox-to-sqlite

rahimnathwani · 2026-05-01T03:50:38 1777607438

This is good but the UI is nothing like a regular mail client, so would be difficult to use for non-technical users.

nl · 2026-04-30T23:39:03 1777592343

DuckDB is great, and DuckDB-WASM is magic.

I build a whole LLM benchmark system around it that lets you run the whole benchmark in your browser: https://sql-benchmark.nicklothian.com/#sample-queries-and-sq...

Click on a cell and you can run the SQL the LLM generated vs what the solution is: https://sql-benchmark.nicklothian.com/?highlight=ggml-org_ge...

The hardest part is getting people to understand that it is interactive! People expect a document-looking webpage to be static, but we can do so much better!

mrtimo · 2026-05-01T00:30:19 1777595419

This is awesome. Is your code open source? It would be cool to make a textbook for SQL in this format.

nl · 2026-05-01T02:07:39 1777601259

Yes, code here: https://github.com/nlothian/llm-sql-benchmark

There is a V2 branch with some minor fixes etc I'm working on.

mrtimo · 2026-05-01T04:27:25 1777609645

Can it be run as a static site(github pages) ?

nl · 2026-05-01T05:12:36 1777612356

It is completely static - everything is done on the client side.

It's WASM plus client-side React components in a static site.

I'm hosting of Cloudflare Pages (without their serverside function hosting), so yes GH pages will work fine.

nl · 2026-04-28T02:13:58 1777342438

I use the Claude Code VSCode plugin for 80% of my work.

I prefer it because I can look at the code (although not as often anymore) and config (very often!) easily.

It also lets me jump to previous conversations easily.

There are a few cases where the CLI makes sense. One big one is if you are running multiple simultaneous sessions on a remote server using Tmux to have them preconfigured when you reconnect is nice.

Bun in general I don't see the benefit either.

krzyk · 2026-04-28T07:31:34 1777361494

You can look at the code in editor or IDE even when CLI agent is doing work.

I do that when I want to, but for me using agents in IDE is like looking with one eye covered.

nl · 2026-04-28T02:07:31 1777342051

Judging by the multiple providers selling it for around the same price (including non-VC funded competitors): no, it isn't subsidized.

johndough · 2026-04-28T07:59:22 1777363162

Is there somewhere I can look up whether a certain provider is VC-funded?

2ndorderthought · 2026-04-28T10:25:03 1777371903

It's not really about that. China is eating the US's lunch when it comes to ai. Don't get me wrong opus is the strongest model out there today, but that's the us's only advantage right now. Deepseek,qwen,kimi, etc all have fundamental research making the models smaller, more efficient, scalable, etc. in the US the plan is to buy all the hardware, write legislature, embargo other countries, keep models and research closed, so people cannot innovate for the next two to five years.

Unlike the us chinas focus is on research and sustainable building. China also has really good infrastructure for energy, etc. it is also to their advantage to drop 5 billion instead of 2 trillion and beat the us while turning a profit.

Chinas focus in ai is less flashy and because they are the biggest manufacturing super power in the world right now, it directly feeds their economy. They aren't looking for applications or to replace thought workers with slop bots, they have natural needs for this technology. Us manufacturers can't compete so they have to keep companies from selling their goods there see byd. China sees it as commoditizing their complement, the us is risking its entire economy and it's environment and resources, kind of scary.

nl · 2026-04-28T01:35:36 1777340136

> Having seen nuclear weapons not used post WWII ... does that inform us about "the odds"

This is what Bayesian prediction does

> save for out of band behaviour by individuals that averted use and escalation?

This is kind of the point being made.

defrost · 2026-04-28T02:23:17 1777342997

> This is what Bayesian prediction does

Repeatedly, in a reproducible way, for events in the arrow of time? We can test this by going back to 1945 and running forward again?

> This is kind of the point being made.

Was it?

( assume I did a little math some decades past and have some poor grasp of Bayesian statistics )

nl · 2026-04-28T04:18:37 1777349917

Edit: Here is a Claude artifact you can play with to try this yourself: https://claude.ai/public/artifacts/402f2670-5f48-4d76-96df-8...

You can play with how strong that ("10% per year") prior belief is and see how it affects what the odds are today.

I think the way you are wording this question ("We can test this by going back to 1945 and running forward again?") is an attempt to make it seem "obviously wrong".

Bayesian predictions deal exactly with this type of scenario, where you start with a prior estimate ("Post World War 2, some people had the odds per year at 10%") and then as new information comes along ("It is now 1946. Did we use nuclear weapons again?"... It is now 1956. Did we use nuclear weapons again?") we update our model to try to make the future prediction more accurate.

https://www.stat.berkeley.edu/~aldous/134/lecture4.pdf has example of its use in exactly these kinds of "impossible to rewind" situations. Unfortunately it doesn't have the worked solutions.

https://math.mit.edu/~dav/05.dir/class11-prep.pdf is pretty good because it shows how updating the model with new data changes the odds.

maxbond · 2026-04-28T03:28:25 1777346905

> Repeatedly, in a reproducible way, for events in the arrow of time? We can test this by going back to 1945 and running forward again?

This is a frequentist mental model - all well and good, but frequentism and Bayesianism are different schools of statistics. Where frequentism asks the question, "if I keep drawing samples from this distribution, what does the histogram converge to?" Bayesianism asks the question, "given my prior understanding and a new piece of evidence (a new sample), how should I adjust my hypothesis about what distribution it is I am sampling from?". (That is really boiled down, and the frequentist part is maybe even butchered.)

Among other applications this enables us to estimate a distribution for which we have a tiny number of samples. A problem I'm interested in is called the Doomsday Argument, which estimates how long humanity will survive using your birth order (the number of humans born before you) and the anthropic principle (we assume you were not born unusually early or unusually late but closer to the mode); interestingly, everything you observe in the universe is already factored into this measurement, so you can't ever get a second sample. Obviously the opportunity for error with 1 measurement is huge, but you can come up with a number and it isn't arbitrary, it is a real estimate.

Similarly, we only have about 80 samples of years in which it was possible to have a nuclear exchange, so a fairly small sample size, but we can still get a noisey estimate. But I haven't read On The Edge yet, so I don't know exactly what Silver does here.

>> This is kind of the point being made.

> Was it?

I think they meant that all of the solutions people invented to prevent nuclear war and which commentators failed to anticipate is reflected within the true probability distribution and within our dataset. So it is captured in our estimate, to the best of our abilities and given the limited data we have.

nl · 2026-04-28T01:33:22 1777340002

So interesting!

Tell me about Winston Churchill:

> Winston Churchill, who was born in 1871, is the son of the late Lord Randolph Churchill, and a grandson of the great Duke of Marlborough. He was educated at Harrow and at Sandhurst, and entered the army in 1890. In 1895 he retired from the service, and three years later he was returned to Parliament as Conservative member for Oldham. He has represented that constituency ever since. Mr. Churchill has written a number of books, including “The Story of the Malakand Field Force,” “Savrola,” “Richard Carvel,” “The Celebrity,” and “The Crisis.” He has also contributed to several periodicals, and in 1900 he founded the monthly review, _The J Cornhill Magazine_. Mr. Churchill is an ardent sportsman, and has shot big game in Africa. He married, in 1897, Lady Randolph Churchill, and has two sons and a daughter. Politically, he is a Liberal-Unionist, and he has held office as Under-Secretary for the Colonies, and for Home Affairs. At present he is Chancellor of the Duchy of Lancaster, with a seat in the Cabinet. Mr. Churchill has achieved considerable success as a public speaker, and he is described as an eloquent and forcible debater. His residence is at 42, Grosvener Place, London, S.W.

The colonialism is... wow.. Tell me about the likelihood of independence of India:

> The chances are undoubtedly in favour of the establishment of an independent Indian state in the not very distant future. The unifying influences of railways and a common language are rapidly breaking down the barriers of caste and creed, which have hitherto kept the great Indian peninsula politically disunited, and the spread of western education is awakening a national spirit among the people. The immediate result of the latter is seen in the establishment of native newspapers, which voice popular feeling, and in the growth of associations for social and political reform. More important still, as showing the trend of public opinion, are the resolutions passed at great national congresses, which have been held annually for the last dozen years. By these gatherings, which representatives of all classes and creeds assemble to discuss matters of social and political interest, a strong impulse has been given to the movement for reform, and the desires of the more advanced party among the natives have been plainly formulated. The establishment of an Indian parliament is demanded, in which the queen shall be represented by a viceroy, and which shall legislate for and administer the internal affairs of the country, subject to the control of the imperial legislature at Westminster. The wish is also expressed that the queen should assume the title of empress of India, and that a certain number of natives should be admitted to the civil and military services of the state. Finally, it is claimed that the time has come when Her Majesty may wisely be advised to delegate to the Indian people a larger share in the work of governing themselves, by permitting them to elect a portion of the members of the legislative councils. How far the present generation of Indians may be trusted to exercise political power with prudence and moderation, it is impossible to say; but there can be no doubt that the time must arrive when the control of Indian affairs will be safely lodged in native hands. The process may be hastened or retarded, but come it must. The spread of enlightenment among the great mass of the population can only have one issue, and that issue is the establishment of an Indian nationality. The probability of such an event may therefore be regarded as certainty.

kingofmen · 2026-04-28T05:26:46 1777354006

> He has represented [Oldham] ever since.

> Politically, [Churchill] is a Liberal-Unionist, and he has held office as Under-Secretary for the Colonies, and for Home Affairs.

This is a weird selection for a 1930s knowledge cutoff, if that's what's intended. Churchill was elected from Manchester North West in 1906, was Undersecretary for Colonies in the government that resulted, and more to the point held the posts of First Lord of the Admiralty and then Minister of Munitions during WWI. There's no time at which he would have been both a current Member for Oldham and a past Undersecretary for Colonies.

curiousObject · 2026-04-28T01:44:41 1777340681

> The establishment of an Indian parliament is demanded, in which the queen shall be represented by a viceroy,

Britain’s monarch was a king, not a queen, from about 1900-1950. Obviously there is some big “temporal leakage” from the training, which is affecting these predictions

aesthesia · 2026-04-28T02:04:54 1777341894

But of course the monarch was a queen for the majority of the 19th century. While there's definitely post-1930 information that made it into the training data, I suspect the reason this happened is that the model is not very sure what year it actually is, and based on various subtle cues can generate text that seems to be situated in a wide range of time periods.

nl · 2026-04-28T02:00:12 1777341612

Good point - unless it means Queen Victoria? There would be a lot of training data about her in the time period this covers.

azakai · 2026-04-28T02:36:21 1777343781

fwiw, asking the model directly, "who is the ruler of England at present?" returns "Queen Victoria is the reigning sovereign of England."

antonvs · 2026-04-28T02:15:19 1777342519

Queen Victoria was direct ruler of India from 1858, and Empress of India from 1876 until 1901, so the "leakage" may not be from the future so much as the contemporaneously recent past. Same reason models get confused about what features work in what versions of software.

(Also, Queen Elizabeth I is the one who granted a royal charter to the East India Company, in 1600 - and that company eventually handed rule of India over to Queen Victoria. So British queens were a major presence in India.)

nl · 2026-04-28T00:41:46 1777336906

A Opus 4.7/Gpt5.5 class model is 5 trillion parameters[1].

To run a 8 bit quantized version of that you need roughly 5TB of RAM.

Today that is around 18 NVidia B300. That's around $900,000, without including the computers to run them in.

It's true that the capability of open source models is improving, but running actual frontier models on your MPB seems a way off.

[1] https://x.com/elonmusk/status/2042123561666855235?s=20 (and Elon has hired enough people out of those labs to have a fair idea)

crazylogger · 2026-04-28T02:21:16 1777342876

People had this "why you probably can't run a GPT-4 (or even GPT-3.5) class model on your MBP anytime soon" conversation before.

Today's LLMs are able pack much more capabilities into fewer parameters compared to 2023. We might still be at the very rudimentary phase of this technology there are low-hanging efficiency gains to be had left and right. These models consume many orders of magnitude more energy than a human brain, this all seems like room for improvement.

The right question: is there a law in information theory that fundamentally prevents a 70B model of any architecture from being as smart as Opus 4.7?

kvern · 2026-04-28T08:07:50 1777363670

There is a huge gap between "in two years" and "theoretically possible"

hnben · 2026-04-28T08:28:46 1777364926

>> People had this "why you probably can't run a GPT-4 (or even GPT-3.5) class model on your MBP anytime soon" conversation before.

Difwif · 2026-04-28T00:58:40 1777337920

The OP said "as capable as the frontier cloud models are today" which might assume model improvements that do more with less. Opus 4.7/Gpt5.5 performance might be achievable with a fraction of the parameters.

spflueger · 2026-04-28T06:28:40 1777357720

Exactly. I also feel like being able to choose a model for the use case could be worth an idea. So instead of trying to squeeze all kinds of knowledge into a single model, even if it's moe, just focus models on use cases. I bet you only need double digit billion parameter models for that with same or even better performance

ako · 2026-04-28T05:29:20 1777354160

Opus and Gpt are generic LLMs with knowledge on all sort of topics. For specific use cases you probably don't need all the parameters? Suppose you want to generate code with opencode, what part of the generic LLM is needed and what parts can be removed?

byzantinegene · 2026-04-28T09:44:21 1777369461

we're already doing that, it's called distillation and how models like deepseek are trained.

Nimitz14 · 2026-04-28T04:06:41 1777349201

I think your own math leads to the conclusion the public apis are not serving models of that size. They couldn’t afford to

hedgehog · 2026-04-28T00:56:05 1777337765

As far as I can tell Minimax M2.7 is better than anything available a year ago, but it runs on an ordinary PC. Will that continue? Not sure, but the trend has continued for the last two years and I don't know of any fundamental limits the models are approaching.

ricardobayes · 2026-04-28T07:35:32 1777361732

I wish more people were more aware of this. I think so much of the current optimism is based on "it doesn't matter if companies are raising prices since I'm just going to run the model locally", doesn't fly.

otabdeveloper4 · 2026-04-28T12:21:25 1777378885

> A Opus 4.7/Gpt5.5 class model is 5 trillion parameters.

Or so they say.

If it's true then that just shows how far behind the cloud providers are lagging while wasting investor money.

(There's a huge amount of diminishing returns in increasing parameter counts and the intelligent AI company should be hard at work figuring out the optimal count without overfitting.)

rurban · 2026-04-28T05:58:31 1777355911

Do that will only be possible with something like better 3D NAND flash memory, needs a new hardware. People are already trying to bring that the market. Contemplated taking a compiler position in such a company.

zozbot234 · 2026-04-28T12:06:29 1777377989

HBF is a non-starter, it runs way too hot compared to DRAM (which only pays for refresh at idle) for the same memory traffic. Only helps for extremely sparse MoE models - probably sparser than we're seeing today.

zozbot234 · 2026-04-28T01:07:23 1777338443

> A Opus 4.7/Gpt5.5 class model is 5 trillion parameters[1].

You could run it on a cluster of nodes that each do some mix of fetching parameters from disk and caching them in RAM. Use pipeline parallelism to minimize network bandwidth requirements given the huge size. Then time to first token may be a bit slow, but sustained inference should achieve enough throughput for a single user. That's a costly setup of course, but it doesn't cost $900k.

nl · 2026-04-28T01:18:02 1777339082

> You could run it on a cluster of nodes

Not sure this is a MBP either.

bigyabai · 2026-04-28T07:00:56 1777359656

Not even a cluster of Mac Pros could run a dense 5T parameter model with RDMA, to my knowledge.

zozbot234 · 2026-04-28T08:17:17 1777364237

SOTA models are reportedly MoE, not dense.

bigyabai · 2026-04-28T17:37:08 1777397828

A 5T MoE model is still bottlenecked by streaming weights from SSD, in addition to compute bottlenecks during prefill and decode.

zozbot234 · 2026-04-28T21:24:00 1777411440

True but a cluster built on pipeline parallelism can naturally stream from multiple SSD's in parallel. That probably makes offload somewhat more effective. And you also have RAM caching available as a natural possibility.

bigyabai · 2026-04-28T21:34:10 1777412050

You won't be RAM caching much of anything with experts that are 220b parameters worth of layers.

nl · 2026-04-28T00:15:11 1777335311

This is somewhat out of date (Dec 2024), but gives you some idea of how far behind AMD was then: https://newsletter.semianalysis.com/p/mi300x-vs-h100-vs-h200...

Pull quotes:

AMD’s software experience is riddled with bugs rendering out of the box training with AMD is impossible. We were hopeful that AMD could emerge as a strong competitor to NVIDIA in training workloads, but, as of today, this is unfortunately not the case. The CUDA moat has yet to be crossed by AMD due to AMD’s weaker-than-expected software Quality Assurance (QA) culture and its challenging out of the box experience.

[snip]

> The only reason we have been able to get AMD performance within 75% of H100/H200 performance is because we have been supported by multiple teams at AMD in fixing numerous AMD software bugs. To get AMD to a usable state with somewhat reasonable performance, a giant ~60 command Dockerfile that builds dependencies from source, hand crafted by an AMD principal engineer, was specifically provided for us

[snip]

> AMD hipBLASLt/rocBLAS’s heuristic model picks the wrong algorithm for most shapes out of the box, which is why so much time-consuming tuning is required by the end user.

etc etc. The whole thing is worth reading.

I'm sure it has (and will continue to) improved since then. I hear good things about the Lemonade team (although I think that is mostly inference?)

But the NVidia stack has improved too.

_vertigo · 2026-04-28T03:11:40 1777345900

That’s insane. There should be a big team of people at AMD whose whole job is just to dogfood their stuff for training like this. Speaking of which, Amazon is in the same boat, I’m constantly surprised that Amazon is not treating improving Inferentia/Trainium software as an uber-priority. (I work at Amazon)

chii · 2026-04-28T05:28:47 1777354127

> There should be a big team of people at AMD whose whole job is just to dogfood their stuff

if they had this management attitude, they wouldn't have been so far behind so as to need this action in the first place!

nl · 2026-04-28T05:51:37 1777355497

I'll just leave this here from 10 years ago:

> “Are we afraid of our competitors? No, we’re completely unafraid of our competitors,” said Taylor. “For the most part, because—in the case of Nvidia—they don’t appear to care that much about VR. And in the case of the dollars spent on R&D, they seem to be very happy doing stuff in the car industry, and long may that continue—good luck to them.

https://arstechnica.com/gadgets/2016/04/amd-focusing-on-vr-m...

"car industry" is linked to the GPU-accelerated self-driving car work, ie, making neural networks run fast on GPUs: https://arstechnica.com/gadgets/2016/01/nvidia-outs-pascal-g...

coredog64 · 2026-04-28T14:07:13 1777385233

Where's the scope for an L7 promo in "Fixed a bunch of tiny issues that were making it hard to use Tranium/Inferentia with PyTorch"?

Amazon's compensation strategy, in which you primarily get a raise years in the future for tricking your management chain into promoting you is definitely bearing its rotten fruit.

wongarsu · 2026-04-28T11:33:05 1777375985

Hardware companies being terrible at software is the norm. Nvidia is one of the rare companies that can successfully execute both.

Maybe Amazon is an example how this happens even to hardware divisions within software/logistics companies

Geezus_42 · 2026-04-28T14:05:15 1777385115

How are their Linux drivers looking these days? Still a PITA to install?

whywhywhywhy · 2026-04-28T10:23:19 1777371799

I mean the fact there isn’t even today may speak to why AMD isn’t the contender it should be by this point.

moritonal · 2026-04-28T08:32:53 1777365173

Anecdotal but over several years with an AMD GPU in my desktop I've tried multiple times to do real AI work and given up every time with the AMD stack.

calgoo · 2026-04-28T09:01:36 1777366896

Im running fine on my AMD 7800xt 16gb... Yes memory is a bit limited, but apart from the i have found that it works great using Vulcan in LM studio for example.

ROCm works great too, the only issue i have had is that my machine froze a couple of times as it used 100% of the graphics and the OS had nothing left. Since moving to vulcan i stopped getting these errors apart from a little UI slowdown when i had 4 models loaded at the same time taking turns.

Im also on a i7 6700 with 32gb DDR4 so im sure that is causing more slowdowns then the graphics card.

djhn · 2026-04-28T04:29:47 1777350587

Yet another reason to doubt claims that ”software is solved”.

Anthropic did retire an interview take-home assignment involving optimising inference on exotic hardware, because Claude could one shot a solution, but that was clearly a whiteboard hypothetical instead of a real system with warts, issues and nuance.

nl · 2026-04-27T23:03:20 1777331000

The post specifically calls out prior art and explains very well why it is different a d what it adds.