Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Actual title: "Solve the GPU Cost Crisis with kvcached: A library to enable virtualized, elastic KV cache for LLM serving on shared GPUs"


Yes, we've put that in the title above (shortened to fit HN's 80 char limit). Submitted title was "Time to build a GPU OS? Here is the first step".




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: