Infilling Extraction Reveals Higher Memorization in Diffusion Language Models
A new study has introduced a technique called 'infilling extraction,' which aims to evaluate how diffusion language models (DLMs) retain training data. The findings show that DLMs can reveal up to three times more exact sequences than previously thought. Unlike autoregressive models, DLMs can denoise masked tokens from any location, making prefix-only probing ineffective. This approach uses a binary mask to create a bidirectional inductive bias. When tested on LLaDA-8B and Dream-7B in five different extraction modes, three training pipelines, and three datasets, it was found that edge-conditioned masks provided the best data extraction results. You can find the complete paper on arXiv.
Key facts
- Infilling extraction is a new data-extraction protocol for diffusion language models.
- It uses an arbitrary binary mask that subsumes prefix-only probing.
- Edge-conditioned masks extract up to three times more verbatim sequences.
- Experiments were conducted on LLaDA-8B and Dream-7B.
- Five extraction modes, three training pipelines, and three corpora were tested.
- The study covers verbatim and partial leakage.
- Prefix-only probing underestimates memorization in DLMs.
- The paper is arXiv:2605.24173v1.
Entities
Institutions
- arXiv