DynaCF: Mitigating Shortcut Learning in Reward Models via Dynamic Counterfactual Sensitivity

arXiv CS Tuesday 09 June 2026, 04:00 UTC By Fengyuan Liu, Yongliang Miao, Zirui He, Yanguang Liu, Fei Sun, Mengnan Du 1 min read

Key Points

arXiv:2606.09043v1 Announce Type: new Abstract: Reward models trained from pairwise preferences often exploit superficial shortcut cues rather than learning true response quality. We propose DynaCF, a dynamic reweighting framework for mitigating shortcut learning in reward model training. Unlike static shortcut heuristics, DynaCF measures shortcut sensitivity online during optimization by applying semantics-preserving counterfactual perturbations and tracking the resulting margin shifts and preference flips under the current model. Samples with higher shortcut sensitivity are dynamically downweighted in the Bradley-Terry objective, encouraging the model to rely less on superficial patterns and more on task-relevant preference signals. Extensive experiments show that DynaCF consistently improves robustness in preference modeling.

Dynamic Counterfactual Sensitivity (ORG) Bradley (PERSON)

Originally published by arXiv CS Read original →

'Voltron: Legendary Defender' turns 10 today, and we think this mecha robot reboot was just as good as 'Power Rangers' and 'Transformers' Voltron may sound like an ointment for back pain, but the reboot Legendary Defender demonstrates that there's more to the big stompy robots concept than meets the eye. Reboot is a dirty word when it comes to TV. Very rarely does a remade show receive its due.

Space.com 30m ago

Exclusive-GM may ditch LFP batteries for future EVs

Exclusive-GM may ditch LFP batteries for future EVs SAN FRANCISCO, June 10 : General Motors may scrap plans to use a lower-cost, iron-based battery chemistry that many automakers are using to cut electric-vehicle costs, GM's head of battery technology said. The Detroit automaker had said it planned to develop lithium-iron phosphate, or LFP, batteries for use in future EV models, and would begin making those batteries in late 2027 at a jointly owned plant in Tennessee. But GM battery chief...

Channel News Asia 40m ago

Claude Fable won’t answer basic biology questions

Anthropic just released Claude Fable 5, calling it the most powerful AI model it has ever made widely available and praising its skills in biology, among others. But the model won't answer basic biology questions - the kind you'd expect a high schooler to handle. Instead, it hands off the query to the former flagship model, Claude Opus 4.8.

The Verge 46m ago

Musk Stock Fans Say ‘The More, The Better’ in SpaceX IPO Frenzy

A SpaceX Falcon 9 rocket launched from Cape Canaveral Space Force Station in Florida.

Bloomberg Technology 46m ago

DynaCF: Mitigating Shortcut Learning in Reward Models via Dynamic Counterfactual Sensitivity

Related Stories

'Voltron: Legendary Defender' turns 10 today, and we think this mecha robot reboot was just as good as 'Power Rangers' and 'Transformers'

Exclusive-GM may ditch LFP batteries for future EVs

Claude Fable won’t answer basic biology questions

Musk Stock Fans Say ‘The More, The Better’ in SpaceX IPO Frenzy