I’m not so sure it’s negligible. My anecdotal experience is that since Apple Sil...

stuaxo · 2025-01-07T09:36:12 1736242572

It's annoying I do LLMs for work and have a bit of an interest in them and doing stuff with GANS etc.

I have a bit of an interest in games too.

If I could get one platform for both, I could justify 2k maybe a bit more.

I can't justify that for just one half: running games on Mac, right now via Linux: no thanks.

And on the PC side, nvidia consumer cards only go to 24gb which is a bit limiting for LLMs, while being very expensive - I only play games every few months.

WaxProlix · 2025-01-07T14:15:58 1736259358

The new $2k card from Nvidia will be 32GB but your point stands. AMD is planning a unified chiplet based GPU architecture (AI/data center/workstation/gaming) called UDNA, which might alleviate some of these issues. It's been delayed and delayed though - hence the lackluster GPU offerings from team Red this cycle - so I haven't been getting my hopes up.

Maybe (LP)CAMM2 memory will make model usage just cheap enough that I can have a hosting server for it and do my usual midrange gaming GPU thing before then.

sliken · 2025-01-07T20:57:56 1736283476

Grace + Hopper, Grace + blackwell, and discussed GB10 are much like the currently shipping AMD MI300A.

I do hope that a AMD Strix Halo ships with 2 LPCAMM2 slots for a total width of 256 bits.

FuriouslyAdrift · 2025-01-07T19:46:51 1736279211

Unified architecture is still on track for 2026-ish.

wkat4242 · 2025-01-07T14:12:06 1736259126

32gb as of last night :)

dagmx · 2025-01-07T05:43:33 1736228613

I mean negligible to their bottom line. There may be tons of units bought or not, but the margin on a single datacenter system would buy tens of these.

It’s purely an ecosystem play imho. It benefits the kind of people who will go on to make potentially cool things and will stay loyal.

htrp · 2025-01-07T06:23:18 1736230998

>It’s purely an ecosystem play imho. It benefits the kind of people who will go on to make potentially cool things and will stay loyal.

100%

The people who prototype on a 3k workstation will also be the people who decide how to architect for a 3k GPU buildout for model training.

mrlongroots · 2025-01-07T11:41:47 1736250107

> It’s purely an ecosystem play imho. It benefits the kind of people who will go on to make potentially cool things and will stay loyal.

It will be massive for research labs. Most academics have to jump through a lot of hoops to get to play with not just CUDA, but also GPUDirect/RDMA/Infiniband etc. If you get older/donated hardware, you may have a large cluster but not newer features.

ckemere · 2025-01-07T14:17:56 1736259476

Academic minimal-bureaucracy purchasing card limit is about $4k, so pricing is convenient*2.

bwfan123 · 2025-01-07T15:46:43 1736264803

Devalapers developers developers - balmer monkey dance - the key to be entrenched is the platform ecosystem.

Also why aws is giving trainium credits for free

moralestapia · 2025-01-07T13:27:34 1736256454

Yes, but people already had their Macs for others reasons.

No one goes to an Apple store thinking "I'll get a laptop to do AI inference".

JohnBooty · 2025-01-07T14:06:50 1736258810

They have, because until now Apple Silicon was the only practical way for many to work with larger models at home because they can be configured with 64-192GB of unified memory. Even the laptops can be configured with up to 128GB of unified memory.

Performance is not amazing (roughly 4060 level, I think?) but in many ways it was the only game in town unless you were willing and able to build a multi-3090/4090 rig.

moralestapia · 2025-01-07T16:14:17 1736266457

I would bet that people running LLMs on their Macs, today, is <0.1% of their user base.

sroussey · 2025-01-07T18:02:36 1736272956

People buying Macs for LLMs—sure I agree.

Since the current MacOS comes built in with small LLMs, that number might be closer to 50% not 0.1%.

moralestapia · 2025-01-07T20:59:51 1736283591

I'm not arguing whether or not Macs are capable of doing it, but whether is a material force that drives people to buy Macs because of it; it's not.

justincormack · 2025-01-07T20:05:00 1736280300

Higher than that buying the top end machines though, which are very high margin

throwaway48476 · 2025-01-07T21:25:09 1736285109

All macs? Yes. But of 192GB mac configs? Probably >50%

the_other · 2025-01-07T13:55:06 1736258106

I'm currently wondering how likely it is I'll get into deeper LLM usage, and therefore how much Apple Silicon I need (because I'm addicted to macOS). So I'm some way closer to your steel man than you'd expect. But I'm probably a niche within a niche.

com2kid · 2025-01-07T15:33:27 1736264007

Tons of people do, my next machine will likely be a Mac for 60% this reason and 40% Windows being so user hostile now.

kelsey98765431 · 2025-01-07T14:27:41 1736260061

my $5k m3 max 128gb disagrees

moralestapia · 2025-01-07T16:12:06 1736266326

Doubt it, a year ago useful local LLMs on a Mac (via something like ollama) was barely taking off.

If what you say it's true you were among the first 100 people on the planet who were doing this; which btw, further supports my argument on how extremely rare is that use case for Mac users.

sroussey · 2025-01-07T18:04:09 1736273049

No, I got a MacBook Pro 14”with M2 Max and 64GB for LLMs, and that was two generations back.

kgwgk · 2025-01-07T22:26:16 1736288776

People were running llama.cpp on Mac laptops in March 2023 and Llama2 was released in July 2023. People were buying Macs to run LLMs months before M3 machines became available in November 2023.