Mean Flow Policy Optimization

arXiv CS Tuesday 02 June 2026, 04:00 UTC By Xiaoyi Dong, Xi Sheryl Zhang, Jian Cheng 1 min read

Key Points

Announce Type: replace Abstract: Diffusion models have recently emerged as expressive policy representations for online reinforcement learning (RL). However, their iterative generative processes introduce substantial training and inference overhead. To overcome this limitation, we propose to represent policies using MeanFlow models, a class of few-step flow-based generative models, to improve training and inference efficiency over diffusion-based RL approaches.

arXiv:2604.14698v2 Announce Type: replace Abstract: Diffusion models have recently emerged as expressive policy representations for online reinforcement learning (RL). However, their iterative generative processes introduce substantial training and inference overhead. To overcome this limitation, we propose to represent policies using MeanFlow models, a class of few-step flow-based generative models, to improve training and inference efficiency over diffusion-based RL approaches. To promote exploration, we optimize MeanFlow policies under the maximum entropy RL framework via soft policy iteration, and address two key challenges specific to MeanFlow policies: action likelihood evaluation and soft policy improvement. Experiments on MuJoCo, DeepMind Control Suite and HumanoidBench benchmarks demonstrate that our method, Mean Flow Policy Optimization (MFPO), achieves performance comparable to or exceeding current diffusion-based baselines while considerably reducing training and inference time. Our code is available at https://github.com/dongxiaoyi-xyz/MFPO.

RL (ORG) DeepMind Control Suite (ORG) HumanoidBench (ORG) Mean Flow Policy Optimization (ORG)

Originally published by arXiv CS Read original →

Social Security Administration Commissioner Frank Bisignano told Congress on Wednesday that the agency has improved one legacy pain point for individuals who contact it — long phone wait times for the toll-free helpline. SSA has brought the average "speed of answer," or the time it takes for an agent to answer an incoming call, to the "lowest level in a decade," Bisignano said in written testimony to the House Ways and Means Social Security and Work & Welfare subcommittee hearing. In May,...

CNBC 10m ago

First home buyers left scrambling as stamp duty exemption ends

Tasmania's first home buyer stamp duty exemption is ending, leaving some scrambling to settle before June 30 Thu 11 Jun 2026 at 7:16am In short: Tasmania's free stamp duty scheme for first home buyers comes to an end this month, but some Tasmanians are being caught in an anxious wait to meet the cut-off. Heith Mineur is grappling with a months-long, drawn-out process to purchase his first home with the government's shared equity scheme and worries he will have to pay $25,000 in stamp duty if...

ABC Australia 15m ago

The limits of self-funding: From the Politics Desk

Welcome to From the Politics Desk, a daily newsletter that brings you the NBC News Politics team’s latest reporting and analysis from the White House, Capitol Hill and the campaign trail. In today’s edition, Ben Kamisar takes stock of the half-billion dollars Tom Steyer has spent over the course of two unsuccessful bids for office. Plus, Andrea Mitchell digs into the latest back-and-forth between the U.S. and Iran.

NBC News 20m ago

Bill Gates tells Epstein hearing he 'never victimised anyone'

Bill Gates tells Epstein hearing he 'never victimised anyone' Billionaire Bill Gates told US lawmakers he “never victimised anyone” and said his meetings with Jeffrey Epstein were for philanthropic discussions that he later ended. Microsoft co-founder Bill Gates denied Wednesday (Jun 10) that he had "victimised anyone" as he began closed-door testimony to US lawmakers over his relationship with notorious sex offender Jeffrey Epstein. Gates, one of the world's richest men and a leading...

Channel News Asia 23m ago

Mean Flow Policy Optimization

Related Stories

Bisignano says Social Security Administration's phone helpline wait times have reached a record low

First home buyers left scrambling as stamp duty exemption ends

The limits of self-funding: From the Politics Desk

Bill Gates tells Epstein hearing he 'never victimised anyone'