Causality Inspired Representation Learning for Domain Generalization

Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn represent...

Full description

Saved in:

Bibliographic Details
Published in	Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 8036 - 8046
Main Authors	Lv, Fangrui, Liang, Jian, Li, Shuang, Zang, Bin, Liu, Chi Harold, Wang, Ziteng, Liu, Di
Format	Conference Proceeding
Language	English
Published	IEEE 01.06.2022
Subjects	categorization Classification algorithms Computer vision Data models Pattern recognition Representation learning retrieval; Representation learning; Self-& semi-& meta- & unsupervised learning Transfer/low-shot/long-tail learning; Machine learning; Recognition: detection
Online Access	Get full text
ISSN	1063-6919
DOI	10.1109/CVPR52688.2022.00788

Cover

Loading…

Abstract	Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach. 1 1 Code is available at "https://github.com/BIT-DA/CIRL".
AbstractList	Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach. 1 1 Code is available at "https://github.com/BIT-DA/CIRL".
Author	Lv, Fangrui Li, Shuang Liu, Di Liu, Chi Harold Wang, Ziteng Liang, Jian Zang, Bin
Author_xml	– sequence: 1 givenname: Fangrui surname: Lv fullname: Lv, Fangrui email: fangruilv@bit.edu.cn organization: Beijing Institute of Technology,China – sequence: 2 givenname: Jian surname: Liang fullname: Liang, Jian email: xuelang.lj@alibaba-inc.com organization: Alibaba Group,China – sequence: 3 givenname: Shuang surname: Li fullname: Li, Shuang email: shuangli@bit.edu.cn organization: Beijing Institute of Technology,China – sequence: 4 givenname: Bin surname: Zang fullname: Zang, Bin email: binzang@bit.edu.cn organization: Beijing Institute of Technology,China – sequence: 5 givenname: Chi Harold surname: Liu fullname: Liu, Chi Harold email: liuchi02@gmail.com organization: Beijing Institute of Technology,China – sequence: 6 givenname: Ziteng surname: Wang fullname: Wang, Ziteng email: ziteng.wang@yizhun-ai.com organization: Yizhun Medical AI Co., Ltd,China – sequence: 7 givenname: Di surname: Liu fullname: Liu, Di email: wendi.ld@alibaba-inc.com organization: Alibaba Group,China
BookMark	eNotjMFOwkAQQFejiYB8gR76A8WZ2bLdOZoiSNJEQ9Qr2bJTswa2ZFsP-PUa9fQu772xuohdFKVuEWaIwHfV2_NmTsbaGQHRDKC09kyN0Zh5Ybgw-lyNEIzODSNfqWnffwCAJkTDdqQWlfvs3T4Mp2wd-2NI4rONHJP0Egc3hC5mtbgUQ3zP2i5li-7gQsxWEiX9ZF-_yrW6bN2-l-k_J-p1-fBSPeb102pd3dd5INBDblsuvG2a0jaEThdEHsF5D8ZSSRYYxYJrfMlIBhka4R3rdseGG68L1hN18_cNIrI9pnBw6bRlW_KcSv0NH3tNfA
CODEN	IEEPAD
ContentType	Conference Proceeding
DBID	6IE 6IH CBEJK RIE RIO
DOI	10.1109/CVPR52688.2022.00788
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Applied Sciences
EISBN	1665469463 9781665469463
EISSN	1063-6919
EndPage	8046
ExternalDocumentID	9879527
Genre	orig-research
GrantInformation_xml	– fundername: National Natural Science Foundation of China grantid: U21A20519,61902028 funderid: 10.13039/501100001809
GroupedDBID	6IE 6IH 6IL 6IN AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP OCL RIE RIL RIO
ID	FETCH-LOGICAL-i203t-8f94d8bb78b21a3422d10add0682728091e80abd79126190be9c93fc969bd3493
IEDL.DBID	RIE
IngestDate	Wed Aug 27 02:15:10 EDT 2025
IsPeerReviewed	false
IsScholarly	true
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i203t-8f94d8bb78b21a3422d10add0682728091e80abd79126190be9c93fc969bd3493
PageCount	11
ParticipantIDs	ieee_primary_9879527
PublicationCentury	2000
PublicationDate	2022-June
PublicationDateYYYYMMDD	2022-06-01
PublicationDate_xml	– month: 06 year: 2022 text: 2022-June
PublicationDecade	2020
PublicationTitle	Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online)
PublicationTitleAbbrev	CVPR
PublicationYear	2022
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0003211698
Score	2.5408807
Snippet	Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen...
SourceID	ieee
SourceType	Publisher
StartPage	8036
SubjectTerms	categorization Classification algorithms Computer vision Data models Pattern recognition Representation learning retrieval; Representation learning; Self-& semi-& meta- & unsupervised learning Transfer/low-shot/long-tail learning; Machine learning; Recognition: detection
Title	Causality Inspired Representation Learning for Domain Generalization
URI	https://ieeexplore.ieee.org/document/9879527
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LT8IwGG_AkydUML7Tg0cHfa79ziBBEwwhYriRde0MMQ4j28W_3nWraIwHb8t22NIv3ffo74HQtY4NKClcpAyRkbBMRtpIGbE0FonUJmPOD_SnD_FkIe6XctlCNzsujHOuBp-5vr-sz_LtJi39qGwA3hmbqTZqV41bw9XazVN41cnEoAM7jhIYDJ9mcy9m4gFczMtyKm-v8sNDpU4h4w6afr28QY689MvC9NOPX7qM__26A9T7Juvh2S4NHaKWy49QJ1SXOOzdbReNhkm5ratufJf78_Xq6bzGwQb6UY6D2OozripZPNq8JuscB13qQNfsocX49nE4iYKHQrRmhBeRzkBYbYzShtGEC8YsJdU_jcSaeWcqoE6TxFgF1PdSxDhIgWcpxGAsF8CP0V6-yd0JwkKIjAupnCVGKEcToCAtBStFFdFEnKKuX5TVWyOTsQrrcfb37XO078PSoK4u0F7xXrrLKr8X5qoO7CfmcaUV
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFH9BPOgJFYzf9uDRYdu1W3sGCSgQQsBwI-vaGWIcRraLf73rVtEYD96W7rCmL917ff19ANyIQMmQM-OFCnOPaco9oTj3aBywiAuVUGMb-qNx0J-zhwVf1OB2y4UxxpTgM9O2j-Vdvl7HuW2V3UnrjE3DHdgt8j4nFVtr21Hxi7NMIIXjxxEs7zpPk6mVM7EQLmqFOUNrsPLDRaVMIr0GjL4-X2FHXtp5ptrxxy9lxv_O7wBa33Q9NNkmokOomfQIGq6-RG73bprQ7UT5pqy70SC1N-zF22mJhHUEpBQ5udVnVNSyqLt-jVYpcsrUjrDZgnnvftbpe85FwVtR7GeeSCTTQqlQKEoin1GqCS7-ajgQ1HpTSWIEjpQOJbGnKayMjKWfxDKQSvtM-sdQT9epOQHEGEt8xkOjsWKhIZEkkmsiNWdFTCN2Ck27KMu3Sihj6dbj7O_ha9jrz0bD5XAwfjyHfRuiCoN1AfXsPTeXRbbP1FUZ5E-0V6he
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+%28IEEE+Computer+Society+Conference+on+Computer+Vision+and+Pattern+Recognition.+Online%29&rft.atitle=Causality+Inspired+Representation+Learning+for+Domain+Generalization&rft.au=Lv%2C+Fangrui&rft.au=Liang%2C+Jian&rft.au=Li%2C+Shuang&rft.au=Zang%2C+Bin&rft.date=2022-06-01&rft.pub=IEEE&rft.eissn=1063-6919&rft.spage=8036&rft.epage=8046&rft_id=info:doi/10.1109%2FCVPR52688.2022.00788&rft.externalDocID=9879527