Home Health PaCX-MAE: Physiology-Augmented Chest X-Ray Masked Autoencoder
Health

PaCX-MAE: Physiology-Augmented Chest X-Ray Masked Autoencoder

Key Points

arXiv:2606.01537v2 Announce Type: replace Abstract: Clinical diagnosis often requires combining imaging with physiological measurements, yet deployed models typically operate on unimodal data. We present PaCX-MAE, a cross-modal distillation framework that injects physiological priors into chest X-ray (CXR) encoders while remaining strictly unimodal at inference. PaCX-MAE augments in-domain masked autoencoding with a dual contrastive-predictive objective, aligning CXR representations with...

arXiv:2606.01537v2 Announce Type: replace Abstract: Clinical diagnosis often requires combining imaging with physiological measurements, yet deployed models typically operate on unimodal data. We present PaCX-MAE, a cross-modal distillation framework that injects physiological priors into chest X-ray (CXR) encoders while remaining strictly unimodal at inference. PaCX-MAE augments in-domain masked autoencoding with a dual contrastive-predictive objective, aligning CXR representations with paired ECG and laboratory embeddings. Extensive evaluation across nine benchmarks demonstrates consistent improvements over domain-specific MAE, particularly on physiology-dependent tasks (e.g., +2.7 AUROC on MedMod; +6.5 F1 on VinDr). The method proves highly label-efficient in the 1% regime and preserves anatomical fidelity, achieving parity with MAE on segmentation tasks. Zero-shot and attention analyses confirm that PaCX-MAE successfully learns to attend to physiological indicators, such as the cardiac silhouette, absent in standard visual pretraining.
PaCX-MAE: Physiology-Augmented (ORG) PaCX-MAE (ORG) CXR (ORG) ECG (ORG) MAE (ORG) AUROC (ORG) MedMod (ORG) +6.5 F1 (ORG)
Originally published by arXiv CS Read original →