Causality Inspired Representation Learning for Domain Generalization
Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn represent...
Saved in:
Published in | Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 8036 - 8046 |
---|---|
Main Authors | , , , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.06.2022
|
Subjects | |
Online Access | Get full text |
ISSN | 1063-6919 |
DOI | 10.1109/CVPR52688.2022.00788 |
Cover
Loading…
Abstract | Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach. 1 1 Code is available at "https://github.com/BIT-DA/CIRL". |
---|---|
AbstractList | Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach. 1 1 Code is available at "https://github.com/BIT-DA/CIRL". |
Author | Lv, Fangrui Li, Shuang Liu, Di Liu, Chi Harold Wang, Ziteng Liang, Jian Zang, Bin |
Author_xml | – sequence: 1 givenname: Fangrui surname: Lv fullname: Lv, Fangrui email: fangruilv@bit.edu.cn organization: Beijing Institute of Technology,China – sequence: 2 givenname: Jian surname: Liang fullname: Liang, Jian email: xuelang.lj@alibaba-inc.com organization: Alibaba Group,China – sequence: 3 givenname: Shuang surname: Li fullname: Li, Shuang email: shuangli@bit.edu.cn organization: Beijing Institute of Technology,China – sequence: 4 givenname: Bin surname: Zang fullname: Zang, Bin email: binzang@bit.edu.cn organization: Beijing Institute of Technology,China – sequence: 5 givenname: Chi Harold surname: Liu fullname: Liu, Chi Harold email: liuchi02@gmail.com organization: Beijing Institute of Technology,China – sequence: 6 givenname: Ziteng surname: Wang fullname: Wang, Ziteng email: ziteng.wang@yizhun-ai.com organization: Yizhun Medical AI Co., Ltd,China – sequence: 7 givenname: Di surname: Liu fullname: Liu, Di email: wendi.ld@alibaba-inc.com organization: Alibaba Group,China |
BookMark | eNotjMFOwkAQQFejiYB8gR76A8WZ2bLdOZoiSNJEQ9Qr2bJTswa2ZFsP-PUa9fQu772xuohdFKVuEWaIwHfV2_NmTsbaGQHRDKC09kyN0Zh5Ybgw-lyNEIzODSNfqWnffwCAJkTDdqQWlfvs3T4Mp2wd-2NI4rONHJP0Egc3hC5mtbgUQ3zP2i5li-7gQsxWEiX9ZF-_yrW6bN2-l-k_J-p1-fBSPeb102pd3dd5INBDblsuvG2a0jaEThdEHsF5D8ZSSRYYxYJrfMlIBhka4R3rdseGG68L1hN18_cNIrI9pnBw6bRlW_KcSv0NH3tNfA |
CODEN | IEEPAD |
ContentType | Conference Proceeding |
DBID | 6IE 6IH CBEJK RIE RIO |
DOI | 10.1109/CVPR52688.2022.00788 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Applied Sciences |
EISBN | 1665469463 9781665469463 |
EISSN | 1063-6919 |
EndPage | 8046 |
ExternalDocumentID | 9879527 |
Genre | orig-research |
GrantInformation_xml | – fundername: National Natural Science Foundation of China grantid: U21A20519,61902028 funderid: 10.13039/501100001809 |
GroupedDBID | 6IE 6IH 6IL 6IN AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IJVOP OCL RIE RIL RIO |
ID | FETCH-LOGICAL-i203t-8f94d8bb78b21a3422d10add0682728091e80abd79126190be9c93fc969bd3493 |
IEDL.DBID | RIE |
IngestDate | Wed Aug 27 02:15:10 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i203t-8f94d8bb78b21a3422d10add0682728091e80abd79126190be9c93fc969bd3493 |
PageCount | 11 |
ParticipantIDs | ieee_primary_9879527 |
PublicationCentury | 2000 |
PublicationDate | 2022-June |
PublicationDateYYYYMMDD | 2022-06-01 |
PublicationDate_xml | – month: 06 year: 2022 text: 2022-June |
PublicationDecade | 2020 |
PublicationTitle | Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) |
PublicationTitleAbbrev | CVPR |
PublicationYear | 2022 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0003211698 |
Score | 2.5408807 |
Snippet | Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 8036 |
SubjectTerms | categorization Classification algorithms Computer vision Data models Pattern recognition Representation learning retrieval; Representation learning; Self-& semi-& meta- & unsupervised learning Transfer/low-shot/long-tail learning; Machine learning; Recognition: detection |
Title | Causality Inspired Representation Learning for Domain Generalization |
URI | https://ieeexplore.ieee.org/document/9879527 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LT8IwGG_AkydUML7Tg0cHfa79ziBBEwwhYriRde0MMQ4j28W_3nWraIwHb8t22NIv3ffo74HQtY4NKClcpAyRkbBMRtpIGbE0FonUJmPOD_SnD_FkIe6XctlCNzsujHOuBp-5vr-sz_LtJi39qGwA3hmbqTZqV41bw9XazVN41cnEoAM7jhIYDJ9mcy9m4gFczMtyKm-v8sNDpU4h4w6afr28QY689MvC9NOPX7qM__26A9T7Juvh2S4NHaKWy49QJ1SXOOzdbReNhkm5ratufJf78_Xq6bzGwQb6UY6D2OozripZPNq8JuscB13qQNfsocX49nE4iYKHQrRmhBeRzkBYbYzShtGEC8YsJdU_jcSaeWcqoE6TxFgF1PdSxDhIgWcpxGAsF8CP0V6-yd0JwkKIjAupnCVGKEcToCAtBStFFdFEnKKuX5TVWyOTsQrrcfb37XO078PSoK4u0F7xXrrLKr8X5qoO7CfmcaUV |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFH9BPOgJFYzf9uDRYdu1W3sGCSgQQsBwI-vaGWIcRraLf73rVtEYD96W7rCmL917ff19ANyIQMmQM-OFCnOPaco9oTj3aBywiAuVUGMb-qNx0J-zhwVf1OB2y4UxxpTgM9O2j-Vdvl7HuW2V3UnrjE3DHdgt8j4nFVtr21Hxi7NMIIXjxxEs7zpPk6mVM7EQLmqFOUNrsPLDRaVMIr0GjL4-X2FHXtp5ptrxxy9lxv_O7wBa33Q9NNkmokOomfQIGq6-RG73bprQ7UT5pqy70SC1N-zF22mJhHUEpBQ5udVnVNSyqLt-jVYpcsrUjrDZgnnvftbpe85FwVtR7GeeSCTTQqlQKEoin1GqCS7-ajgQ1HpTSWIEjpQOJbGnKayMjKWfxDKQSvtM-sdQT9epOQHEGEt8xkOjsWKhIZEkkmsiNWdFTCN2Ck27KMu3Sihj6dbj7O_ha9jrz0bD5XAwfjyHfRuiCoN1AfXsPTeXRbbP1FUZ5E-0V6he |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+%28IEEE+Computer+Society+Conference+on+Computer+Vision+and+Pattern+Recognition.+Online%29&rft.atitle=Causality+Inspired+Representation+Learning+for+Domain+Generalization&rft.au=Lv%2C+Fangrui&rft.au=Liang%2C+Jian&rft.au=Li%2C+Shuang&rft.au=Zang%2C+Bin&rft.date=2022-06-01&rft.pub=IEEE&rft.eissn=1063-6919&rft.spage=8036&rft.epage=8046&rft_id=info:doi/10.1109%2FCVPR52688.2022.00788&rft.externalDocID=9879527 |