Pytorch actually has surprisingly good support for Apple Silicon. Occasionally a...

ein0p · on May 7, 2024

I’ve found it to be pretty terrible compared to CUDA, especially with Huggingface transformers. There’s no technical reason why it has to be terrible there though. Apple should fix that.

teaearlgraycold · on May 7, 2024

Yeah. It’s good with YOLO and Dino though. My M2 Max can compute Dino embeddings faster than a T4 (which is the GPU in AWS’s g4dn instance type).

ein0p · on May 7, 2024

MLX will probably be even faster than that, if the model is already ported. Faster startup time too. That’s my main pet peeve though: there’s no technical reason why PyTorch couldn’t be just as good. It’s just underfunding and neglect

whimsicalism · on May 8, 2024

t4's are like 6 years old

rcarmo · on May 7, 2024

And there is a lot of work being done with mlx.