More

joshlk · 2025-03-15T07:36:37 1742024197

When using low precision formats like float8 you usually have to upscale the activations to BF16 before normalising. So the normalisation layers are proportionally using more compute when going to lower precision. Replacing these layers would help reduce the compute cost significantly.

joshlk · 2025-02-05T13:56:53 1738763813

According to Stackoverflow trends, Julia’s popularity is decreasing and very small

https://trends.stackoverflow.co/?tags=julia

amval · 2025-02-05T14:24:55 1738765495

That's mostly because Julia questions get answered on its Discourse or Slack. The sharp decline is due to an automatic cross-post bot that stopped working.

No one bothered fixing it, in great part due to Discourse being the main place of discussion, as far as I know.

NeutralForest · 2025-02-05T14:11:50 1738764710

Even languages like Python and Javascript who are huge show a decline after 2022 which suggests ChatGPT is probably responsible. It would be better to have some other measure imo.

joshlk · 2025-02-05T14:14:04 1738764844

It measures the proportion of questions for that language out of all languages. So, if there is a general decline in Stackoverflow questions, it’s already accounted for in the metric

NeutralForest · 2025-02-05T14:30:00 1738765800

There are too many confounding factors still.

eigenspace · 2025-02-05T17:44:47 1738777487

Julia users don't go to Stack Overflow because we have better options.

mjgant · 2025-02-05T14:06:58 1738764418

Or thats the LLM/ChatGPT effect. Can see similar downtrends with other languages

veqq · 2025-02-05T21:11:04 1738789864

Stackoverflow's popularity's decreased a lot, many communities have entirely left.

joshlk · 2025-02-02T17:22:50 1738516970

Can you use to launch an Intel VM on Apple Silicone and visa versa? I’m interested in doing this so I can compile C++ applications for different architectures on MacOS. Do you know of any other “easy” methods?

bydo · 2025-02-02T18:57:38 1738522658

I don't believe the built in virtualization framework supports emulation, but you can do this with QEMU. An easy way to get started is with UTM:

https://mac.getutm.app

knowitnone · 2025-02-03T16:36:49 1738600609

I tried UTM - didn't like it, inconsistent, shows a black screen and you don't know what's going on. Use qemu instead.

mrpippy · 2025-02-02T21:45:04 1738532704

You can do this without virtualization/emulation, pass ‘-arch x86_64’ or ‘-arch arm64’ to clang. Or both, for a universal binary. And on Apple Silicon, you can test them both thanks to Rosetta.

latchkey · 2025-02-02T17:37:25 1738517845

It should be possible. I did this in the early 90's... I had a windows vm running on a powerpc Mac, writing x86 assembly for college class.

tripplyons · 2025-02-02T20:11:49 1738527109

I sometimes use Docker for this, assuming you are talking about running Linux on x86-64.

joshlk · 2025-01-15T22:44:05 1736981045

Other than bandwidth, is there any other performance differences between Cloudflare and GitHub Pages?

chris_pie · 2025-01-15T23:31:43 1736983903

Extremely anecdotal, but I've always found GitHub pages to be noticeably slower. Which is weird, because they use Fastly which is generally good.

joshlk · 2025-01-06T15:22:58 1736176978

Tensors used in deep learning are not the same as the definition used by Physicists - blame the DL community for this :). So DL tensors are just N-dimensional arrays of data, and there is no concept of covariance and contravariance of the dimensions. You could think of DL tensors as Cartesian tensors and they don't need to conform to the same transformation laws that Physics tensors do.

joshlk · 2024-11-26T17:56:08 1732643768

Warp is great - I use it as my daily terminal. The best features are being able to edit commands, chunking the output into blocks and AI generated commands at your fingertips.

alokedesai · 2024-11-26T22:42:59 1732660979

Thank you for the kind words!

joshlk · 2024-10-09T17:48:47 1728496127

Would it be possible to move all the language developers to work on packaging?

IMO the Python language is feature complete but the packaging system needs heart surgery.

joshlk · 2024-09-04T11:15:47 1725448547

I’m not sure this holds as true if it’s the same team creating both implementations.

pornel · 2024-09-04T11:26:33 1725449193

It's sufficient.

Another implementation is still unlikely to have the exact same bugs. Especially rewrite in Rust will force the code to be structured differently (Rust is very opinionated about that).

The spec is big enough that the team won't be able to just write the exact same implementation from memory.

zcorpan · 2024-09-05T07:32:24 1725521544

I don't disagree that it's sufficient, but also, ideally different people would implement the spec. If you have a particular mental model or understanding of a part of the spec that doesn't match what the spec actually says, that is likely to translate identically when writing a second implementation.

joshlk · 2024-08-30T09:15:05 1725009305

I found the advertisements the most interesting bit

joshlk · 2024-08-13T21:08:57 1723583337

> I have never met a scientist who can resist the lure of fast-but-dangerous math

This made me chuckle