Home Health Med-Banana: Learning Quality-Controlled Medical Image...
Health

Med-Banana: Learning Quality-Controlled Medical Image Editing from Success-and-Failure Trajectories

Key Points

arXiv:2511.00801v4 Announce Type: replace Abstract: Text-guided medical image editing must satisfy the requested pathology while preserving anatomy, modality-specific appearance, and clinical plausibility. However, existing datasets largely supervise editors with final accepted edits and discard the failed attempts produced during generation.

arXiv:2511.00801v4 Announce Type: replace Abstract: Text-guided medical image editing must satisfy the requested pathology while preserving anatomy, modality-specific appearance, and clinical plausibility. However, existing datasets largely supervise editors with final accepted edits and discard the failed attempts produced during generation. We argue that these failures provide essential supervision for quality control: they specify what should be rejected, why an edit is medically or visually invalid, and how the instruction should be revised. We present Med-Banana, a trajectory-supervised framework for quality-controlled medical image editing. We introduce Med-Banana-80K, a large-scale resource of success-and-failure editing trajectories with candidate images, verification outcomes, rejection reasons, and prompt refinements. Building on it, Med-Banana jointly trains an editor, verifier, and refiner, enabling edit--verify--refine inference from accepted and rejected attempts. Experiments across MLLM judges, blind expert assessment, source-preservation and real--synthetic separability probes demonstrate consistent improvements over open medical image editors. Code and data are publicly available.
Med-Banana: Learning Quality-Controlled Medical Image (ORG) Success (ORG) Med-Banana (ORG)
Originally published by arXiv CS Read original →