Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

According to NVIDIA. > a single server with eight H200 GPUs connected using NVLink and NVLink Switch can run the full, 671-billion-parameter DeepSeek-R1 model at up to 3,872 tokens per second.

You can rent a single H200 for 3$/hour.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: