Science
Wasserstein normalized autoencoder for anomaly detection
Key Points
Announce Type: replace-cross Abstract: A novel anomaly detection algorithm is presented. The Wasserstein normalized autoencoder (WNAE) is a normalized probabilistic model that minimizes the Wasserstein distance between the learned probability distribution--a Boltzmann distribution where the energy is the reconstruction error of the autoencoder--and the distribution of the training data. This algorithm has been developed and applied to the identification of semivisible jets--conical sprays of...
arXiv:2510.02168v2 Announce Type: replace-cross
Abstract: A novel anomaly detection algorithm is presented. The Wasserstein normalized autoencoder (WNAE) is a normalized probabilistic model that minimizes the Wasserstein distance between the learned probability distribution--a Boltzmann distribution where the energy is the reconstruction error of the autoencoder--and the distribution of the training data. This algorithm has been developed and applied to the identification of semivisible jets--conical sprays of visible standard model particles and invisible dark matter states--with the CMS experiment at the CERN LHC. Trained on jets of particles from simulated standard model processes, the WNAE is shown to learn the probability distribution of the input data in a fully unsupervised fashion, such that it effectively identifies new physics jets as anomalies. The model exhibits stable, convergent training and recovers strong classification performance for a wide range of signals against the selected background process, for which a standard autoencoder fails because of outlier reconstruction. In addition, the model improves upon standard normalized autoencoders while remaining fully agnostic to the signal. The WNAE directly tackles the problem of outlier reconstruction, a common failure mode of autoencoders in anomaly detection tasks.