GPU VRAM is the bottleneck currently, check out r/localLlama for benchmarks and ...

		Rastonbury on Nov 10, 2024 \| parent \| context \| favorite \| on: Everything I've learned so far about running local... GPU VRAM is the bottleneck currently, check out r/localLlama for benchmarks and calculators for what models can fit into what cards approximately