CRANE: Knowledge Editing for Reasoning MLLMs

arXiv CS Tuesday 09 June 2026, 04:00 UTC By Han Huang, Hao Wang, Mengqi Zhang, Shu Wu, Qiang Liu, Liang Wang 1 min read

Key Points

arXiv:2606.09033v1 Announce Type: new Abstract: The emergence of reasoning multimodal large language models (MLLMs), which generate explicit chain-of-thought (CoT) reasoning before producing answers, has introduced a new challenge for knowledge editing: methods that appear successful under traditional metrics (teacher-forcing accuracy up to 100%) can fail severely when the model's reasoning process is examined (Grounded Success as low as 0%). We identify three failure modes: (1) Structural Collapse, where weight-modifying methods destroy the CoT format; (2) Cognitive Dissonance, where the model's reasoning chain actively rejects the injected edit fact based on visual evidence; and (3) Shallow Internalization, where methods succeed on exact queries but fail on rephrase or multi-hop variants. On reasoning MLLMs, these modes interact: methods that generalize (FT, LoRA) trigger format collapse, while methods without deep modification cannot generalize. To expose these failures, we propose a CoT-aware evaluation protocol and construct ReasonEdit-Bench, with conflict stratification, multi-level probes, and multi-hop portability tests. We propose CRANE, a retrieval-augmented framework that requires no per-edit parameter modification. CRANE combines a modality-aware dual-library retrieval system with a two-phase training strategy: Supervised Fine-Tuning (SFT) for structural initialization, followed by GRPO with a Cognitive Routing Reward that trains the model to arbitrate between visual priors and injected edit facts. On ReasonEdit-Bench, CRANE achieves 96.9% Grounded Success on conflict scenarios and 96.9% intermediate entity usage in multi-hop chains, with 97.6% text-locality and 68.1% image-locality Edit Independence. On the out-of-distribution MMEVOKE benchmark, CRANE reaches 87.0% under gold retrieval.

Grounded Success (ORG) ReasonEdit-Bench (ORG) CRANE (ORG) GRPO (ORG)

Originally published by arXiv CS Read original →

CRANE: Knowledge Editing for Reasoning MLLMs

Related Stories

School knife attack suspect girl detained under Mental Health Act

FBI nabs 7 for alleged 'campaign of violence' to pressure University of Michigan, businesses over Israel ties

Cyber gangs access students' personal data in University of Nottingham hack

Five jailed for violence at Henry Nowak police protest