Optimal Transport Based Testing in Factorial Design

Authors

Groppe M, Niemöller L, Hundrieser S, Ventzke D, Blob A, Köster S, Munk A

Journal

Arxiv

Citation

arXiv:2509.13970.

Abstract

We introduce a general framework for testing statistical hypotheses for probability measures supported on finite spaces, which is based on optimal transport (OT). These tests are inspired by the analysis of variance (ANOVA) and its nonparametric counterparts. They allow for testing linear relationships in factorial designs between discrete probability measures and are based on pairwise comparisons of the OT distance and corresponding barycenters. To this end, we derive under the null hypotheses and (local) alternatives the asymptotic distribution of empirical OT costs and the empirical OT barycenter cost functional as the optimal value of linear programs with random objective function. In particular, we extend existing techniques for probability to signed measures and show directional Hadamard differentiability and the validity of the functional delta method. We discuss computational issues, permutation and bootstrap tests, and back up our findings with simulations. We illustrate our methodology on two datasets from cellular biophysics and biometric identification.

DOI

10.48550/arXiv.2509.13970