GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz
custom chipTransformerKV cachemicroGPTGPTFPGA
Author: laxmena
Date: 6/16/2026
Article Summary:
A researcher creates a custom chip that can run a full Transformer model at a high speed.