Generalizable Crowd Counting via Diverse Context Style Learning

Existing crowd counting approaches predominantly perform well on the training-testing protocol. However, due to large style discrepancies not only among images but also within a single image, they suffer from obvious performance degradation when applied to unseen domains. In this paper, we aim to de...

Full description

Saved in:

Bibliographic Details
Published in	IEEE transactions on circuits and systems for video technology Vol. 32; no. 8; pp. 5399 - 5410
Main Authors	Zhao, Wenda, Wang, Mingyue, Liu, Yu, Lu, Huimin, Xu, Congan, Yao, Libo
Format	Journal Article
Language	English
Published	New York IEEE 01.08.2022 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Degradation diverse context styles Domains Gated ensemble learning generalized crowd counting Lighting Logic gates Mean square error methods Performance degradation Quality Redundancy Training Visualization
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Existing crowd counting approaches predominantly perform well on the training-testing protocol. However, due to large style discrepancies not only among images but also within a single image, they suffer from obvious performance degradation when applied to unseen domains. In this paper, we aim to design a generalizable crowd counting framework which is trained on a source domain but can generalize well on the other domains. To reach this, we propose a gated ensemble learning framework. Specifically, we first propose a diverse fine-grained style attention model to help learn discriminative content feature representations, allowing for exploiting diverse features to improve generalization. We then introduce a channel-level binary gating ensemble model, where diverse feature prior, input-dependent guidance and density grade classification constraint are implemented, to optimally select diverse content features to participate in the ensemble, taking advantage of their complementary while avoiding redundancy. Extensive experiments show that our gating ensemble approach achieves superior generalization performance among four public datasets. Codes are publicly available at https://github.com/wdzhao123/DCSL .
ISSN:	1051-8215 1558-2205
DOI:	10.1109/TCSVT.2022.3146459