Robust Barycenter Estimation using Semi-Unbalanced Neural Optimal Transport

A common challenge in aggregating data from multiple sources can be formalized as an \textit{Optimal Transport} (OT) barycenter problem, which seeks to compute the average of probability distributions with respect to OT discrepancies. However, the presence of outliers and noise in the data measures...

Full description

Saved in:
Bibliographic Details
Main Authors Gazdieva, Milena, Choi, Jaemoo, Kolesov, Alexander, Choi, Jaewoong, Mokrov, Petr, Korotin, Alexander
Format Journal Article
LanguageEnglish
Published 04.10.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A common challenge in aggregating data from multiple sources can be formalized as an \textit{Optimal Transport} (OT) barycenter problem, which seeks to compute the average of probability distributions with respect to OT discrepancies. However, the presence of outliers and noise in the data measures can significantly hinder the performance of traditional statistical methods for estimating OT barycenters. To address this issue, we propose a novel, scalable approach for estimating the \textit{robust} continuous barycenter, leveraging the dual formulation of the \textit{(semi-)unbalanced} OT problem. To the best of our knowledge, this paper is the first attempt to develop an algorithm for robust barycenters under the continuous distribution setup. Our method is framed as a $\min$-$\max$ optimization problem and is adaptable to \textit{general} cost function. We rigorously establish the theoretical underpinnings of the proposed method and demonstrate its robustness to outliers and class imbalance through a number of illustrative experiments.
DOI:10.48550/arxiv.2410.03974