Well-Posed KL-Regularized Control via Wasserstein and Kalman-Wasserstein KL Divergences

arXiv CS Tuesday 02 June 2026, 04:00 UTC By Viktor Stein, Adwait Datar, Nihat Ay 1 min read

Key Points

arXiv:2602.02250v2 Announce Type: replace-cross Abstract: Kullback-Leibler (KL) divergence regularization is widely used in reinforcement learning, but it becomes infinite under support mismatch and can degenerate in low-noise regimes. Using a unified information-geometric framework, we introduce KL analogs by replacing the Fisher-Rao geometry in the dynamical formulation of the KL with transport-based geometries, and derive closed-form expressions for common distribution families. Between elliptic distributions, these divergences remain finite for degenerating equal covariances and yield a geometric interpretation of regularization heuristics used in Kalman ensemble methods. We demonstrate the utility of these divergences in KL-regularized optimal control. In the fully tractable setting of linear time-invariant systems with Gaussian process noise, the classical KL reduces to a quadratic control penalty that becomes singular as process noise vanishes. Our variants remove this singularity and yield well-posed problems. In both the double integrator and cart-pole examples, the resulting controls preserve nontrivial feedback and achieve better closed-loop performance.

Wasserstein (PERSON) Kalman-Wasserstein KL (ORG) Kullback-Leibler (PERSON) Fisher-Rao (ORG) KL (LOCATION) Kalman (ORG)

Originally published by arXiv CS Read original →

Well-Posed KL-Regularized Control via Wasserstein and Kalman-Wasserstein KL Divergences

Related Stories

Senesi signs for Tottenham on free transfer from Bournemouth

Which rookies will drive in Barcelona practice as Hamilton, Antonelli sit out?

Sources: NHLPA eyes Babcock inquiry on '23 case

Rest? Play? All options open for Itoje's summer