A Unified Framework for Locality in Scalable MARL

arXiv CS Thursday 04 June 2026, 04:00 UTC By Sourav Chakraborty, Amit Kiran Rege, Claire Monteleoni, Lijun Chen 2 min read

Key Points

arXiv:2602.16966v2 Announce Type: replace Abstract: Scalable methods for networked multi-agent reinforcement learning let each agent plan using only a small neighborhood of the agent graph. This works only when the system is value-local, meaning a perturbation at one agent affects the long-run value at another agent weakly when the two are far apart. In the average-reward setting, the standard way to certify locality is the Dobrushin row-sum bound on a single matrix $C^\pi$ that captures how each agent's next state depends on each other agent's current state. To make this matrix easy to work with, prior work bounds it by a supremum over joint actions. The resulting bound is independent of the policy, but it is loose whenever the policy never picks the worst-case action. We split $C^\pi$ into pieces that separately track environment sensitivity and policy sensitivity, $C^\pi \preceq E^{\mathrm s}+E^{\mathrm a}\Pi(\pi)$, where $E^{\mathrm s}$ measures how the next state moves with the current state, $E^{\mathrm a}$ measures how it moves with the current action, and $\Pi(\pi)$ measures how reactive the policy is to changes in state. The spectral radius of $H^\pi := E^{\mathrm s}+E^{\mathrm a}\Pi(\pi)$ then controls the decay of the average-reward Poisson solution, and the spectral certificate $\rho(H^\pi)<1$ is strictly weaker than the row-sum condition $\|H^\pi\|_\infty<1$ on the same matrix and applies in regimes where policy-independent action-supremum bounds used in prior Dobrushin-style work cannot. For temperature-$\tau$ softmax policies we get $\Pi(\pi)\le L/(2\tau)$, so the softmax temperature directly controls locality. We use this decay result to give a deterministic oracle guarantee for a block-coordinate KL-proximal policy-improvement template whose truncation bias decays exponentially in the message-passing radius $\kappa$.

Unified Framework for Locality (ORG) C^\pi$ (ORG) Poisson (ORG) softmax (PERSON)

Originally published by arXiv CS Read original →

The NHLPA expects a full NHL investigation of coach Mike Babcock before the Edmonton Oilers can hire him, sources told ESPN on Tuesday. The investigation would cover Babcock's time with the Columbus Blue Jackets in 2023, when he was hired but never coached a game for the team. Hired in July 2023, Babcock resigned that September after an NHLPA investigation into claims that he violated players' privacy when he asked to see photos on their cellphones.

ESPN 43m ago

Trump signs $70 billion immigration funding bill after months of delay

President Donald Trump on Wednesday signed a $70 billion bill to fund immigration enforcement agencies through the end of his term. The package to fund Immigration and Customs Enforcement and Customs and Border Protection passed out of Congress in the last week after months of debate and delays amid Democratic concerns about overly aggressive immigration enforcement. At a signing ceremony in the Oval Office on Wednesday, Trump said the bill would "give the heroes of ICE and border patrol ......

CNBC 57m ago

Emergency action seeks to prevent erasure of 'mother' and 'father' in code of largest US town

Officials in America's most populated township are taking urgent action to stop a Democrat-backed bill that would replace "mother" and "father" in New York State law with gender-neutral parental terms. The emergency resolution from Hempstead Township comes just days after New York State Legislature passed a bill that would replace "mother" with the term "gestating parent" and "father" with "non-gestating parent." It would also change "paternity" to "parentage.

Fox News Politics 1h ago

Uber sues New York City over 'reckless' driver protection law

Uber sues New York City over 'reckless' driver protection law NEW YORK, June 10 : Uber Technologies sued New York City to block enforcement of a new law that it said would unconstitutionally force it to keep drivers it does not want on its platform. In a complaint filed late on Tuesday night, Uber said the law against "wrongful deactivations" would improperly shield drivers who engage in dangerous, threatening or other inappropriate behavior, threatening public safety and causing "immediate...

Channel News Asia 1h ago

A Unified Framework for Locality in Scalable MARL

Related Stories

Sources: NHLPA eyes Babcock inquiry on '23 case

Trump signs $70 billion immigration funding bill after months of delay

Emergency action seeks to prevent erasure of 'mother' and 'father' in code of largest US town

Uber sues New York City over 'reckless' driver protection law