I want to build a private AI setup for my company. Im thinking of hosting our model locally instead of in the cloud, using a server at the office that my team can access. Has anyone else done this and had success with it?
This setup will be used internally for uncensored chat, coding, image gen and analysis.
We're thinking of using a combo of hardware:
- RTX 4090 GPU (heard it's a beast)
- Threadripper Pro 5955WX (anyone used this one before?)
- SSD NVMe 1TB
What are your picks for a local AI setup? And what’s the minimum budget to achieve it?
How many GPUs you need is completely dependent on the size of your team, their frequency of usage, and the size of the models you are comfortable with.
I generally recommend you rent instances on something like runpod to build out a good estimate of your actual usage before commiting a bunch of money to hardware.