GateGPT: 56k tokens per second Transformer (KV cache) on FPGA at 80 MHz

AI & Machine Learning(twitter.com)view on HackerNews

custom chipTransformerKV cachemicroGPTGPTFPGA

Author: laxmena

Date: 6/16/2026

Article Summary:

A researcher creates a custom chip that can run a full Transformer model at a high speed.