Home › Politics › Faster Synchronous On-Policy RL via Straggler-Aware Group Sizing

Politics

Faster Synchronous On-Policy RL via Straggler-Aware Group Sizing

arXiv CS Tuesday 02 June 2026, 04:00 UTC By Azal Ahmad Khan, Ammar Ahmed, Zeshan Fayyaz, Sheng Di, Mingyi Hong, Ali Anwar 1 min read

Key Points

arXiv:2606.02218v1 Announce Type: new Abstract: Synchronous reinforcement learning methods such as Group Relative Policy Optimization (GRPO) provide stable and reproducible on-policy training, but they are highly vulnerable to stragglers, a single unusually long rollout can delay reward computation and parameter updates for the entire group. This problem becomes more severe as group size increases, creating a tension between the benefits of larger groups and the wall-clock cost of synchronization stalls. We propose Straggler-Aware Group Control (SAGC), a dynamic group-size controller that adapts the training group online based on observed rollout behavior. SAGC formulates group-size selection as an online constrained optimization problem, seeking to retain the benefits of larger groups while controlling the long-term rate of straggler events. Across synchronous GRPO and DAPO training, and on top of both vanilla and strong engineered baselines, SAGC consistently reduces straggler incidence and improves wall-clock efficiency while achieving competitive or better training reward. We further show that these gains transfer to final model quality: SAGC is competitive with or better than the strongest static group-size baseline on downstream reasoning benchmarks, and often produces shorter outputs without any explicit length penalty. These results position dynamic group control as a practical way to make synchronous on-policy RL more efficient and robust.

Straggler-Aware Group Sizing arXiv:2606.02218v1 (ORG) Group Relative Policy Optimization (ORG) GRPO (ORG) Straggler-Aware Group Control (ORG) SAGC (ORG) RL (ORG)

Originally published by arXiv CS Read original →

A sweeping warrantless surveillance authority remains on track to expire Friday, with no clear path to a deal, after President Donald Trump refused this week to abandon his pick of housing official Bill Pulte to temporarily lead the US intelligence community—even tasking Pulte with gutting the Office of the Director of National Intelligence in a DOGE-style “downsizing“ before a permanent director is named. In a Truth Social post after his second White House meeting in two days with House...

Wired 4m ago

Veterans and relatives see no place for Trump's arch near Arlington National Cemetery

Three Vietnam War veterans are suing to stop President Trump from building an arch just steps from Arlington National Cemetery, where 400,000 service members, veterans and their relatives are buried.(Image credit: Eric Lee for NPR)

NPR News 6m ago

California's 'leisurely' ballot counting faces backlash, Dems ripped for 'defending the indefensible'

California's "leisurely" ballot counting process is facing backlash from The New York Times editorial board, which ripped Democrats for defending the "indefensible" in a piece published Wednesday. "This slowness is a failure of governance, and it should help inspire the creation of a better system," the editorial board wrote. "There is no good reason that California takes so long to count votes.

Fox News 9m ago

More child health nurse visits for Victorian kids amid NDIS shake-up

Extra maternal and child health nurse visits for children in Victoria under Thriving Kids program Thu 11 Jun 2026 at 6:14am All Victorian children will get two extra visits with maternal and child health nurses as the state prepares to launch its Thriving Kids program for those to be shifted off the National Disability Insurance Scheme (NDIS). Minister for Children Lizzie Blandthorn said the state would also review the existing 10 visits available for children from when they are born to the...

ABC Australia 18m ago

Faster Synchronous On-Policy RL via Straggler-Aware Group Sizing

Related Stories

Trump Risks Key Surveillance Authority Over ‘Unqualified’ Spy-Chief Pick

Veterans and relatives see no place for Trump's arch near Arlington National Cemetery

California's 'leisurely' ballot counting faces backlash, Dems ripped for 'defending the indefensible'

More child health nurse visits for Victorian kids amid NDIS shake-up