VeRO: A Harness for Agents to Optimize Agents

arXiv CS Wednesday 03 June 2026, 04:00 UTC By Varun Ursekar, Apaar Shanker, Veronica Chatrath, Yuan Xue, Samuel Marc Denton 1 min read

Key Points

arXiv:2602.22480v4 Announce Type: replace Abstract: An important emerging application of coding agents is agent harness optimization: the iterative improvement of a target agent by editing and evaluating its code. Despite its relevance, the community lacks a systematic understanding of coding agent performance on this task. Harness optimization differs from conventional software engineering: agent harnesses interleave deterministic code with stochastic LLM completions, requiring structured capture of both intermediate execution traces and downstream outcomes. To address these challenges, we introduce (1) VeRO (Versioning, Rewards, and Observations), an outer harness that provides versioned snapshots, budget-controlled evaluation, and structured execution traces of target harnesses, and (2) VeRO-Bench, a benchmark suite of target agents and tasks with reference evaluation procedures. Using VeRO, we conduct an empirical study comparing optimizers across tasks and analyzing which modifications reliably improve target agent harnesses. We release VeRO to support research on agent optimization as a core capability for coding agents. Code is available at https://github.com/scaleapi/vero.

VeRO (ORG) LLM (ORG) Observations (PERSON) https://github.com/scaleapi/vero (LOCATION)

Originally published by arXiv CS Read original →

Like a game of hot potato, there are certain players who you don't want to get stuck with once their productivity declines. Whether it's unsustainable hot streaks, shrinking opportunities or a simple shift in luck, baseball can change both quickly and dramatically. In fantasy terms, accurately forecasting any of these valuation downturns in advance of them actually happening can provide your fantasy team with a significant boost -- especially if it means trading the player before his stock...

ESPN 12m ago

San Francisco judge not convinced reparations fund will be discriminatory during lawsuit hearing

San Francisco Superior Court Judge Joseph Quinn ruled last week that a lawsuit challenging the city's race-based reparations fund is premature, sustaining a demurrer against the suit. A demurrer is an objection stating that the evidence presented was not sufficient to proceed for a review by the judge. "We are disappointed by the Superior Court's ruling, but remain undeterred.

Fox News 16m ago

Trump doubles down on Pulte for DNI, calls for short-term extension of foreign surveillance law

President Donald Trump on Wednesday doubled down on his choice of Bill Pulte as acting director of national intelligence, despite bipartisan pushback on the pick that could result in the lapse this week of a foreign surveillance program with major national security implications. Earlier this month Trump tapped Pulte, who leads the Federal Housing Finance Agency and has used his perch to launch a series of probes into several of the president's political opponents over allegations of...

CNBC 16m ago

Canadian Bonds Rally After BOC Holds Rates, Cites Weak Economy

Bank of Canada Senior Deputy Governor Carolyn Rogers, left, and Bank of Canada Governor Tiff Macklem during a news conference in Ottawa on June 10.

Bloomberg Markets 19m ago

VeRO: A Harness for Agents to Optimize Agents

Related Stories

Hot potato! Get rid of these imposters while you c...

San Francisco judge not convinced reparations fund will be discriminatory during lawsuit hearing

Trump doubles down on Pulte for DNI, calls for short-term extension of foreign surveillance law

Canadian Bonds Rally After BOC Holds Rates, Cites Weak Economy