Long Context vs. RAG:

Strategies for Processing

Long Documents in LLMs

Tutorial at the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'25) Padova, Italy

13 July, 2025

The Speakers

Xinze LI

Nanyang Technological University

SINGAPORE

Yushi BAI

Tsinghua University

CHINA

Bowen JIN

University of Illinois Urbana-Champaign

USA

Fengbin ZHU

National University of Singapore

SINGAPORE

Liangming PAN

University of Arizona

USA

Yixin CAO

Fudan University

CHINA

ABOUT OUR TUTORIAL

Large Language Models (LLMs) excel at zero- and few-shot learning but are restricted by the length of context windows when processing long documents. Two strategies have emerged to overcome this limitation: (1) Long Context (LC) methods, which extend or compress transformer architectures to input more text; and (2) Retrieval-Augmented Generation (RAG), which integrates external knowledge sources via embedding- or index-based retrieval. This half-day tutorial offers a unified, beginner-friendly introduction to both approaches. We first review transformer fundamentals—positional encoding, attention complexity, and common LC techniques. Next, we explain the classic RAG pipeline and recent RAG strategies, alongside evaluation metrics and benchmarks. We also analyze recent empirical studies to highlight strengths, limitations, and trade-offs of LC vs. RAG in terms of scalability, computational cost, and retrieval effectiveness. We conclude with best practices for real-world deployments, emerging hybrid architectures, and open research directions, equipping IR researchers and practitioners with actionable guidelines for processing long documents in LLMs.

VENUE

Mantegna 2, Floor 3, Padova Congress

Page updated

Google Sites

Report abuse