the question is: how does the prompt processing time on this compare to M3 Ultra... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		behnamoh 85 days ago \| parent \| context \| favorite \| on: Nvidia DGX Spark: great hardware, early days for t... the question is: how does the prompt processing time on this compare to M3 Ultra because that one sucks at RAG even though it can technically handle huge models and long contexts...

zozbot234 85 days ago [–]

Prompt processing time on Apple Silicon might benefit from making use of the NPU/Apple Neural Engine. (Note, the NPU is bad if you're limited by memory bandwidth, but prompt processing is compute limited.) Just needs someone to do the work.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact