Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
A handy metric is needed for gauging if GPUs are being used optimally (theregister.com)
8 points by cyberhost 4 months ago | hide | past | favorite | 2 comments


MFU is probably the best but requires application logic. You can export metrics at the infra level like SM efficiency. We explain it a bit how we used it to do some optimization.

https://www.trainy.ai/blog/gpu-utilization-misleading


MFU is indeed very useful. Today we found that while scaling Karpathy’s nanoGPT to multiple H100 nodes the MFU calculation itself was dropping MFU performance![1]

Commenting it out improved iter performance by almost 30%

1. https://github.com/modal-labs/multinode-training-guide/blob/...




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: