RAG-Based EEG-to-Text Decoding Pipeline Outperforms Random Baseline

ai-technology · 2026-05-20

A team of researchers has introduced a retrieval-augmented generation (RAG) framework aimed at translating EEG signals into text at the sentence level. This innovative approach integrates an EEG encoder that aligns with semantic embeddings, a vector retrieval phase, and a large language model (LLM) to enhance the generated outputs. Testing conducted on the ZuCo dataset, which features single-trial EEG recordings taken during silent reading, demonstrates that the system successfully captures linguistic information exceeding random baseline levels, marking a significant advancement as it does so without teacher forcing during inference. This research tackles the challenge of low signal-to-noise ratios in brain-computer interface studies.

Key facts

Proposes RAG-based sentence-level EEG-to-text decoding pipeline
Combines EEG encoder, vector retrieval, and LLM
Uses ZuCo dataset with single-trial EEG recordings during silent reading
Surpasses random baseline performance without teacher forcing
Addresses low signal-to-noise ratio in EEG decoding
First to achieve sentence-level decoding without teacher forcing
Published on arXiv with ID 2605.17503
Research conducted by anonymous authors

Entities

Institutions

arXiv

Locations

Zurich
Switzerland

Sources

arXiv cs.AI — 2026-05-19