Efficient and Uncertainty-Aware Diffusion Framework for Offline-to-Online Reinforcement Learning

arXiv CS Monday 01 June 2026, 04:00 UTC By Ha Manh Bui, Metod Jazbec, Eric Nalisnick, Anqi Liu 1 min read

Key Points

arXiv:2605.30776v1 Announce Type: new Abstract: Offline-to-Online Reinforcement Learning (O2O-RL) leverages an offline, pre-trained policy to minimize costly online interactions. Although data-efficient, O2O-RL is susceptible to shifts between offline and online distributions. Existing work aims to mitigate the harm of this shift by finetuning the policy on trajectory data sampled from a diffusion model. Inspired by this line of work, we propose DUAL: an efficient \textbf{D}iffusion \textbf{U}ncertainty-\textbf{A}ware framework for offline-to-online reinforcement \textbf{L}earning. DUAL utilizes the prior knowledge of the diffusion model to distill a fast-sampling diffusion actor policy and transition model in the offline phase. DUAL also employs a Laplace approximation and distance transition-state-shift detection, thereby using uncertainty quantification to improve exploration versus exploitation in the online phase. We formally show that our actor loss with the Laplace approximation provides a proxy for a principled estimate of epistemic uncertainty. Empirically, DUAL improves the online expected return over O2O-RL baselines across multiple settings and environments.

Laplace (ORG)

Originally published by arXiv CS Read original →

A sweeping warrantless surveillance authority remains on track to expire Friday, with no clear path to a deal, after President Donald Trump refused this week to abandon his pick of housing official Bill Pulte to temporarily lead the US intelligence community—even tasking Pulte with gutting the Office of the Director of National Intelligence in a DOGE-style “downsizing“ before a permanent director is named. In a Truth Social post after his second White House meeting in two days with House...

Wired 4m ago

Veterans and relatives see no place for Trump's arch near Arlington National Cemetery

Three Vietnam War veterans are suing to stop President Trump from building an arch just steps from Arlington National Cemetery, where 400,000 service members, veterans and their relatives are buried.(Image credit: Eric Lee for NPR)

NPR News 7m ago

California's 'leisurely' ballot counting faces backlash, Dems ripped for 'defending the indefensible'

California's "leisurely" ballot counting process is facing backlash from The New York Times editorial board, which ripped Democrats for defending the "indefensible" in a piece published Wednesday. "This slowness is a failure of governance, and it should help inspire the creation of a better system," the editorial board wrote. "There is no good reason that California takes so long to count votes.

Fox News 10m ago

More child health nurse visits for Victorian kids amid NDIS shake-up

Extra maternal and child health nurse visits for children in Victoria under Thriving Kids program Thu 11 Jun 2026 at 6:14am All Victorian children will get two extra visits with maternal and child health nurses as the state prepares to launch its Thriving Kids program for those to be shifted off the National Disability Insurance Scheme (NDIS). Minister for Children Lizzie Blandthorn said the state would also review the existing 10 visits available for children from when they are born to the...

ABC Australia 18m ago

Efficient and Uncertainty-Aware Diffusion Framework for Offline-to-Online Reinforcement Learning

Related Stories

Trump Risks Key Surveillance Authority Over ‘Unqualified’ Spy-Chief Pick

Veterans and relatives see no place for Trump's arch near Arlington National Cemetery

California's 'leisurely' ballot counting faces backlash, Dems ripped for 'defending the indefensible'

More child health nurse visits for Victorian kids amid NDIS shake-up