RAG-Based EEG-to-Text Decoding Pipeline Outperforms Random Baseline
A team of researchers has introduced a retrieval-augmented generation (RAG) framework aimed at translating EEG signals into text at the sentence level. This innovative approach integrates an EEG encoder that aligns with semantic embeddings, a vector retrieval phase, and a large language model (LLM) to enhance the generated outputs. Testing conducted on the ZuCo dataset, which features single-trial EEG recordings taken during silent reading, demonstrates that the system successfully captures linguistic information exceeding random baseline levels, marking a significant advancement as it does so without teacher forcing during inference. This research tackles the challenge of low signal-to-noise ratios in brain-computer interface studies.
Key facts
- Proposes RAG-based sentence-level EEG-to-text decoding pipeline
- Combines EEG encoder, vector retrieval, and LLM
- Uses ZuCo dataset with single-trial EEG recordings during silent reading
- Surpasses random baseline performance without teacher forcing
- Addresses low signal-to-noise ratio in EEG decoding
- First to achieve sentence-level decoding without teacher forcing
- Published on arXiv with ID 2605.17503
- Research conducted by anonymous authors
Entities
Institutions
- arXiv
Locations
- Zurich
- Switzerland