ARTFEED — Contemporary Art Intelligence

Disco-RAG Framework Enhances AI Knowledge Synthesis with Discourse-Aware Retrieval

ai-technology · 2026-04-20

A new AI framework called Disco-RAG, detailed in arXiv preprint 2601.04377v5, addresses limitations in Retrieval-Augmented Generation (RAG) by incorporating discourse structures to improve knowledge synthesis. Traditional RAG methods often handle retrieved passages in a flat, unstructured manner, which hinders the capture of structural cues and the integration of evidence across multiple documents. Disco-RAG overcomes this by explicitly injecting discourse signals into the generation process. It constructs intra-chunk discourse trees to model local hierarchies and builds inter-chunk rhetorical graphs to represent cross-passage coherence. These structures are combined into a planning blueprint that conditions the generation output. Experiments on benchmarks for question answering and long-document summarization demonstrate the framework's effectiveness, showing improved performance in knowledge-intensive tasks. The approach aims to enhance large language models (LLMs) by enabling better synthesis of dispersed information from various documents. This development is part of ongoing research in AI and machine learning, focusing on advanced retrieval and generation techniques. The preprint was announced as a replacement cross on arXiv, indicating an update to prior versions.

Key facts

  • Disco-RAG is a discourse-aware framework for Retrieval-Augmented Generation (RAG).
  • It addresses flat and unstructured retrieval in existing RAG strategies.
  • The method constructs intra-chunk discourse trees for local hierarchies.
  • It builds inter-chunk rhetorical graphs to model cross-passage coherence.
  • These structures are integrated into a planning blueprint for generation.
  • Experiments were conducted on question answering and long-document summarization benchmarks.
  • The framework aims to enhance large language models (LLMs) in knowledge-intensive tasks.
  • The preprint is arXiv:2601.04377v5, announced as a replace-cross.

Entities

Institutions

  • arXiv

Sources