Asymmetric Quantization: Near-Lossless Retrieval with 97% Storage Reduction
asymmetric quantizationlate interaction retrievalstorage reductionint8binary signs
Author: breadislove
Date: 6/29/2026
Article Summary:
The article discusses asymmetric quantization, a technique to reduce storage costs in late interaction retrieval systems by storing document vectors as binary signs and keeping query vectors at higher precision.