Retriever Portfolios: A Principled Approach to Adaptive RAG

arXiv CS Monday 01 June 2026, 04:00 UTC By Miltiadis Stouras, Vincent Cohen-Addad, Silvio Lattanzi, Ola Svensson 1 min read

Key Points

arXiv:2605.31176v1 Announce Type: new Abstract: Retrieval-augmented generation (RAG) systems typically rely on a single retriever and a single set of hyperparameters, despite facing highly heterogeneous queries that range from simple factoid questions to complex multi-hop reasoning. We propose a method that automatically selects a small, diverse subset of retrievers (a portfolio) from a large pool of candidates, to cover different regions of the target query distribution. We formalize this setting via an expected best-of-$k$ objective over the query distribution and show that it admits an efficient portfolio construction algorithm with near-optimal guarantees. Across multiple QA benchmarks, our learned portfolios and router pipeline consistently outperform single-retriever and naive multi-retriever baselines on both retrieval metrics and answer quality. In addition, compared to inference-time hyperparameter tuning approaches, fixed portfolios enable parallel retrieval and LLM calls, achieving comparable (and sometimes better) accuracy with substantially lower latency and token cost.

Adaptive (ORG) LLM (ORG)

Originally published by arXiv CS Read original →

Retriever Portfolios: A Principled Approach to Adaptive RAG

Related Stories

Scientists discover 5 million-year-old whale graveyard stretching for hundreds of miles in the Indian Ocean

Plan for hundreds of new spaces to ease Ben Nevis parking woes

Plan for hundreds of new spaces to ease Ben Nevis parking woes

Low-copper paints matched high-copper rivals, while silicone performed best against fouling