I put a datacenter GPU in my gaming PC
DIY hardwarelocal LLM inferencedatacenter GPUNVIDIA V100PCIe adapterNixOS
Author: birdculture
Date: 5/31/2026
Article Summary:
A user installs a datacenter GPU in their gaming PC to run local LLM inference, achieving 32 tokens per second on a 27 billion parameter model with 32GB of total VRAM.