J-RAS: Mutual Adaptation for Medical Image Segmentation via Contrastive Retrieval-Augmented Joint Optimization

arXiv CS Thursday 04 June 2026, 04:00 UTC By Salma J. Ahmed, Emad A. Mohammed, Azam Asilian Bidgoli 1 min read

Key Points

Announce Type: replace Abstract: Manual medical image segmentation by clinicians, though accurate, is time-consuming and variable across experts, whereas AI-based models automate this process but often underperform with limited data and domain shifts. Inspired by how pathology trainees acquire disease recognition skills through guided comparison with expert-annotated slides and histopathology atlas reference images, we propose Joint Retrieval-Augmented Segmentation (J-RAS). This framework...

arXiv:2510.09953v3 Announce Type: replace Abstract: Manual medical image segmentation by clinicians, though accurate, is time-consuming and variable across experts, whereas AI-based models automate this process but often underperform with limited data and domain shifts. Inspired by how pathology trainees acquire disease recognition skills through guided comparison with expert-annotated slides and histopathology atlas reference images, we propose Joint Retrieval-Augmented Segmentation (J-RAS). This framework enables segmentation networks to learn with guidance. J-RAS jointly optimizes a segmentation model and a retrieval model through alternating contrastive and supervised learning, allowing the retrieval network to discover contextually relevant image-mask pairs that refine the segmentation model's anatomical reasoning. Unlike conventional retrieval-based augmentation that passively provides similar samples, J-RAS establishes a mutual adaptation and optimization loop where the retrieval model learns to emphasize segmentation-relevant cues, while the segmentation model leverages retrieved examples to improve boundary delineation, robustness to rare cases, and cross-dataset generalization. Evaluations on four public benchmarks spanning different imaging modalities, including ACDC and M&Ms (MRI), Breast Cancer Ultrasound, and lung and infection CT, across multiple backbones (U-Net, TransUNet, SAM, and SegFormer) demonstrate the generalizability and effectiveness of J-RAS. For instance, on ACDC, SegFormer improves from a mean Dice of 0.8708$\pm$0.042 and HD of 1.8130$\pm$2.49 to 0.9115$\pm$0.031 and 1.1489$\pm$0.30. These results highlight how retrieval-guided contrastive optimization bridges human-like guidance and machine-learned precision in medical image segmentation.

J-RAS (ORG) Mutual Adaptation for Medical Image Segmentation (ORG) AI (ORG) Joint Retrieval-Augmented Segmentation (ORG) ACDC (ORG) Breast Cancer Ultrasound (ORG) CT (ORG) U-Net (ORG) SAM (PERSON) SegFormer (ORG)

Originally published by arXiv CS Read original →

J-RAS: Mutual Adaptation for Medical Image Segmentation via Contrastive Retrieval-Augmented Joint Optimization

Related Stories

Two babies from Congo orphanage die of Ebola as outbreak death toll climbs

Children who missed out on swimming lessons because of covid among those most at risk of drowning

Deion Sanders declares he's cancer-free, addresses incontinence ahead of Colorado football season

Inspired by Karl-Anthony Towns, funeral home hosts...