The Consistency Illusion: How Multi-Agent Debate Hides Reasoning Misalignment

arXiv CS Tuesday 09 June 2026, 04:00 UTC By Xiaoyang Wang, Christopher C. Yang 1 min read

Key Points

arXiv:2606.08457v1 Announce Type: new Abstract: Multi-agent LLM systems for medical question answering often treat consensus as a reliability signal: if multiple agents agree on an answer, it is presumed trustworthy. However, answer-level consensus does not entail reasoning-level alignment. We introduce CARA (Cross-Agent Reasoning Alignment), a family of automated metrics that measure whether agents who agree on an answer also agree on the reasoning. Applying CARA to a standard debate system on two medical QA benchmarks, MedQA-USMLE and MedThink-Bench, we identify the consistency illusion: a failure mode where debate reduces detectable contradictions between agents while simultaneously decreasing the semantic similarity of their reasoning chains; agents appear to agree more but reason less consistently. To improve this misalignment, we propose the Grounded Debate Protocol (GDP), a prompt-level intervention that requires agents to commit to named medical facts and take explicit stances on other agents' claims. GDP produces large, consistent alignment improvements, with Cohen's d ranging from +1.43 to +1.99, across two datasets and two backbone models, without adding LLM calls or modifying system architecture. Our results motivate cross-agent reasoning alignment as a quantity to audit alongside accuracy in safety-critical domains.

Debate Hides Reasoning Misalignment arXiv:2606.08457v1 Announce (PERSON) LLM (ORG) CARA (ORG) Cross (ORG) MedQA-USMLE (ORG) MedThink-Bench (ORG) Cohen (PERSON) +1.99 (ORG)

Originally published by arXiv CS Read original →

The Consistency Illusion: How Multi-Agent Debate Hides Reasoning Misalignment

Related Stories

Slaying suspect in San Diego charged with 3 more murders, including 2 cold cases

Calls to replace 'high risk' Shetland hospital building

Calls to replace 'high risk' Shetland hospital building

Farmer who went from triple bypass to winning Britain's Got Talent urges others to see a GP