ARTFEED — Contemporary Art Intelligence

Q2D2: A New Quantization Method for Neural Audio Codecs

other · 2026-05-18

Researchers have introduced Two-Dimensional Quantization (Q2D2), a novel quantization scheme for neural audio codecs that projects feature pairs onto structured 2D grids such as hexagonal, rhombic, or rectangular tiling. Unlike conventional methods like Residual Vector Quantization (RVQ), Vector Quantization (VQ), and Finite Scalar Quantization (FSQ), Q2D2 preserves the geometric structure of the latent space and improves correlation capture between features. This leads to enhanced representation learning, better codebook utilization, and lower token rates. The method achieves codebook sizes comparable to traditional approaches while improving audio compression efficiency. The paper is available on arXiv under identifier 2512.01537.

Key facts

  • Q2D2 projects feature pairs onto structured 2D grids
  • Grid types include hexagonal, rhombic, and rectangular tiling
  • Q2D2 improves upon RVQ, VQ, and FSQ methods
  • Enhances geometric structure of latent space
  • Improves correlation capture between features
  • Achieves lower token rates
  • Codebook sizes comparable to conventional methods
  • Paper available on arXiv: 2512.01537

Entities

Institutions

  • arXiv

Sources