ARTFEED — Contemporary Art Intelligence

IVF-TQ: Codebook-Free Residual Compression for Streaming Vector Search

other · 2026-05-25

A new article on arXiv has unveiled IVF-TQ, a novel inverted-file index aimed at approximate nearest neighbor searches, featuring a data-independent residual compression layer. Unlike common methods like PQ, OPQ, and ScaNN, which rely on adjusting a codebook based on a specific dataset and suffer from decreased recall as databases grow, IVF-TQ uses a fixed random rotation along with a pre-calculated Lloyd-Max scalar quantizer that depends only on the bit width b and dimension d. The only part that requires training is the coarse k-means partition of IVF. This method provides a consistent inner-product error bound across the sphere, a significant advantage over learned-codebook approaches. Experiments reveal that product quantization can cause a -3.8pp drop in recall, highlighting the problem that IVF-TQ addresses.

Key facts

  • IVF-TQ uses a data-independent residual compression layer
  • Dominant methods (PQ, OPQ, ScaNN) fit a codebook to an initial sample
  • Product quantization degrades -3.8pp recall under shuffled-i.i.d. ingestion
  • Compression layer: fixed random rotation + Lloyd-Max scalar quantizer
  • Only IVF coarse k-means partition is trained
  • Uniform-over-sphere inner-product error bound depends on (b, d, delta)
  • Paper is on arXiv with ID 2605.17415
  • Method is codebook-free

Entities

Institutions

  • arXiv

Sources