RECON: Reasoning with Condensation for Efficient Retrieval-Augmented Generation

arXiv CS Tuesday 09 June 2026, 04:00 UTC By Zhichao Xu, Minheng Wang, Yawei Wang, Wenqian Ye, Yuntao Du, Yunpu Ma, Yijun Tian 1 min read

Key Points

Announce Type: replace Abstract: Search agents trained with reinforcement learning (RL) interleave reasoning with tool calls in a multi-turn, tool-integrated reasoning (TIR) loop, where each tool invocation returns an environment observation that is appended to the agent's context. As the rollout proceeds, these raw observations accumulate, inflating token cost and diluting the signal available for downstream reasoning. Unlike single-pass retrieve-then-read pipelines, where context...

arXiv:2510.10448v2 Announce Type: replace Abstract: Search agents trained with reinforcement learning (RL) interleave reasoning with tool calls in a multi-turn, tool-integrated reasoning (TIR) loop, where each tool invocation returns an environment observation that is appended to the agent's context. As the rollout proceeds, these raw observations accumulate, inflating token cost and diluting the signal available for downstream reasoning. Unlike single-pass retrieve-then-read pipelines, where context compression is a one-time postprocessing step, the multi-turn RL setting requires compression that runs at every observation step while remaining decoupled from policy optimization. We introduce RECON (REasoning with CONdensation), a framework that addresses this challenge by inserting a dedicated observation compressor into the reasoning loop. The compressor is trained via a two-stage curriculum: relevance pretraining on QA datasets followed by multi-aspect distillation from proprietary LLMs, and remains frozen during RL training to preserve policy stability. Integrated into the Search-R1 search-agent pipeline, RECON reduces total context length by 35%, improves training speed by 5.4% and inference latency by 30.9%, while boosting average exact-match by 14.5% on the 3B agent and 3.0% on the 7B agent, with particular strength in multi-hop QA. These results establish learned observation compression as a key component for building practical, scalable RL-trained search agents.

RL (ORG) TIR (ORG)

Originally published by arXiv CS Read original →

Knicks fans burning sage outside MSG ahead of Game 4 to purge the bad luck left behind from Trump’s attendance ‘It felt so dark yesterday, I was like, this is not the Garden that I know,’ the fan said - Bookmark - CommentsGo to comments The NBA Finals have New Yorkers desperate for a Knicks victory trying everything in their powers to help the team. Maybe it's the ratcheting tension as the series continues — as of this report, the Knicks are leading 2-1 against the San Antonio Spurs — but...

The Independent World 15m ago

Pope Leo blesses new tower at Spain’s Sagrada Familia

Pope Leo on Wednesday blessed a giant new tower at Barcelona’s famed Sagrada Familia Basilica after celebrating mass inside what is now the world’s tallest church. A choir of 600 singers performed at the service which lasted around 90 minutes and was attended by Spanish Prime Minister Pedro Sanchez as well as King Felipe VI and Queen Letizia. The stained-glass windows in various colours shone brightly in between the treelike columns of the temple as Leo delivered his homily in Spanish,...

South China Morning Post 18m ago

Road trauma victims call for 'overhaul' of Transport Accident Commission

Road trauma victims call for 'overhaul' of Transport Accident Commission in parliamentary inquiry Thu 11 Jun 2026 at 6:34am In short: Victims of road trauma have given evidence to a parliamentary inquiry into the Transport Accident Commission. Melita Parker told the inquiry the TAC system only moves when clients become impossible to ignore. Recommendations from the inquiry are expected to be tabled in parliament in the coming months.

ABC Australia 26m ago

Trump Risks Key Surveillance Authority Over ‘Unqualified’ Spy-Chief Pick

A sweeping warrantless surveillance authority remains on track to expire Friday, with no clear path to a deal, after President Donald Trump refused this week to abandon his pick of housing official Bill Pulte to temporarily lead the US intelligence community—even tasking Pulte with gutting the Office of the Director of National Intelligence in a DOGE-style “downsizing“ before a permanent director is named. In a Truth Social post after his second White House meeting in two days with House...

Wired 32m ago

RECON: Reasoning with Condensation for Efficient Retrieval-Augmented Generation

Related Stories

Knicks fans burning sage outside MSG ahead of Game 4 to purge the bad luck left behind from Trump’s attendance

Pope Leo blesses new tower at Spain’s Sagrada Familia

Road trauma victims call for 'overhaul' of Transport Accident Commission

Trump Risks Key Surveillance Authority Over ‘Unqualified’ Spy-Chief Pick