OTA: Optimal Transport Assignment for Object Detection

Recent advances in label assignment in object detection mainly seek to independently define positive/negative training samples for each ground-truth (gt) object. In this paper, we innovatively revisit the label assignment from a global perspective and propose to formulate the assigning procedure as...

Full description

Saved in:

Bibliographic Details
Published in	2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) pp. 303 - 312
Main Authors	Ge, Zheng, Liu, Songtao, Li, Zeming, Yoshie, Osamu, Sun, Jian
Format	Conference Proceeding
Language	English
Published	IEEE 01.01.2021
Subjects	Codes Computer vision Costs Estimation Object detection Training Transportation
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Recent advances in label assignment in object detection mainly seek to independently define positive/negative training samples for each ground-truth (gt) object. In this paper, we innovatively revisit the label assignment from a global perspective and propose to formulate the assigning procedure as an Optimal Transport (OT) problem - a well-studied topic in Optimization Theory. Concretely, we define the unit transportation cost between each demander (anchor) and supplier (gt) pair as the weighted summation of their classification and regression losses. After formulation, finding the best assignment solution is converted to solve the optimal transport plan at minimal transportation costs, which can be solved via Sinkhorn-Knopp Iteration. On COCO, a single FCOS-ResNet-50 detector equipped with Optimal Transport Assignment (OTA) can reach 40.7% mAP under 1× scheduler, outperforming all other existing assigning methods. Extensive experiments conducted on COCO and CrowdHuman further validate the effectiveness of our proposed OTA, especially its superiority in crowd scenarios. The code is available at https://github.com/Megvii-BaseDetection/OTA.
ISSN:	2575-7075
DOI:	10.1109/CVPR46437.2021.00037