Causality Inspired Representation Learning for Domain Generalization

Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn represent...

Full description

Saved in:
Bibliographic Details
Published inProceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online) pp. 8036 - 8046
Main Authors Lv, Fangrui, Liang, Jian, Li, Shuang, Zang, Bin, Liu, Chi Harold, Wang, Ziteng, Liu, Di
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.06.2022
Subjects
Online AccessGet full text
ISSN1063-6919
DOI10.1109/CVPR52688.2022.00788

Cover

Loading…
Abstract Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach. 1 1 Code is available at "https://github.com/BIT-DA/CIRL".
AbstractList Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen target domain. The mainstream is to leverage statistical models to model the dependence between data and labels, intending to learn representations independent of domain. Nevertheless, the statistical models are superficial descriptions of reality since they are only required to model dependence instead of the intrinsic causal mechanism. When the dependence changes with the target distribution, the statistic models may fail to generalize. In this regard, we introduce a general structural causal model to formalize the DG problem. Specifically, we assume that each input is constructed from a mix of causal factors (whose relationship with the label is invariant across domains) and non-causal factors (category-independent), and only the former cause the classification judgments. Our goal is to extract the causal factors from inputs and then reconstruct the invariant causal mechanisms. However, the theoretical idea is far from practical of DG since the required causal/non-causal factors are unobserved. We highlight that ideal causal factors should meet three basic properties: separated from the non-causal ones, jointly independent, and causally sufficient for the classification. Based on that, we propose a Causality Inspired Representation Learning (CIRL) algorithm that enforces the representations to satisfy the above properties and then uses them to simulate the causal factors, which yields improved generalization ability. Extensive experimental results on several widely used datasets verify the effectiveness of our approach. 1 1 Code is available at "https://github.com/BIT-DA/CIRL".
Author Lv, Fangrui
Li, Shuang
Liu, Di
Liu, Chi Harold
Wang, Ziteng
Liang, Jian
Zang, Bin
Author_xml – sequence: 1
  givenname: Fangrui
  surname: Lv
  fullname: Lv, Fangrui
  email: fangruilv@bit.edu.cn
  organization: Beijing Institute of Technology,China
– sequence: 2
  givenname: Jian
  surname: Liang
  fullname: Liang, Jian
  email: xuelang.lj@alibaba-inc.com
  organization: Alibaba Group,China
– sequence: 3
  givenname: Shuang
  surname: Li
  fullname: Li, Shuang
  email: shuangli@bit.edu.cn
  organization: Beijing Institute of Technology,China
– sequence: 4
  givenname: Bin
  surname: Zang
  fullname: Zang, Bin
  email: binzang@bit.edu.cn
  organization: Beijing Institute of Technology,China
– sequence: 5
  givenname: Chi Harold
  surname: Liu
  fullname: Liu, Chi Harold
  email: liuchi02@gmail.com
  organization: Beijing Institute of Technology,China
– sequence: 6
  givenname: Ziteng
  surname: Wang
  fullname: Wang, Ziteng
  email: ziteng.wang@yizhun-ai.com
  organization: Yizhun Medical AI Co., Ltd,China
– sequence: 7
  givenname: Di
  surname: Liu
  fullname: Liu, Di
  email: wendi.ld@alibaba-inc.com
  organization: Alibaba Group,China
BookMark eNotjMFOwkAQQFejiYB8gR76A8WZ2bLdOZoiSNJEQ9Qr2bJTswa2ZFsP-PUa9fQu772xuohdFKVuEWaIwHfV2_NmTsbaGQHRDKC09kyN0Zh5Ybgw-lyNEIzODSNfqWnffwCAJkTDdqQWlfvs3T4Mp2wd-2NI4rONHJP0Egc3hC5mtbgUQ3zP2i5li-7gQsxWEiX9ZF-_yrW6bN2-l-k_J-p1-fBSPeb102pd3dd5INBDblsuvG2a0jaEThdEHsF5D8ZSSRYYxYJrfMlIBhka4R3rdseGG68L1hN18_cNIrI9pnBw6bRlW_KcSv0NH3tNfA
CODEN IEEPAD
ContentType Conference Proceeding
DBID 6IE
6IH
CBEJK
RIE
RIO
DOI 10.1109/CVPR52688.2022.00788
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan (POP) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP) 1998-present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Applied Sciences
EISBN 1665469463
9781665469463
EISSN 1063-6919
EndPage 8046
ExternalDocumentID 9879527
Genre orig-research
GrantInformation_xml – fundername: National Natural Science Foundation of China
  grantid: U21A20519,61902028
  funderid: 10.13039/501100001809
GroupedDBID 6IE
6IH
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IJVOP
OCL
RIE
RIL
RIO
ID FETCH-LOGICAL-i203t-8f94d8bb78b21a3422d10add0682728091e80abd79126190be9c93fc969bd3493
IEDL.DBID RIE
IngestDate Wed Aug 27 02:15:10 EDT 2025
IsPeerReviewed false
IsScholarly true
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i203t-8f94d8bb78b21a3422d10add0682728091e80abd79126190be9c93fc969bd3493
PageCount 11
ParticipantIDs ieee_primary_9879527
PublicationCentury 2000
PublicationDate 2022-June
PublicationDateYYYYMMDD 2022-06-01
PublicationDate_xml – month: 06
  year: 2022
  text: 2022-June
PublicationDecade 2020
PublicationTitle Proceedings (IEEE Computer Society Conference on Computer Vision and Pattern Recognition. Online)
PublicationTitleAbbrev CVPR
PublicationYear 2022
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0003211698
Score 2.5408807
Snippet Domain generalization (DG) is essentially an out-of-distribution problem, aiming to generalize the knowledge learned from multiple source domains to an unseen...
SourceID ieee
SourceType Publisher
StartPage 8036
SubjectTerms categorization
Classification algorithms
Computer vision
Data models
Pattern recognition
Representation learning
retrieval; Representation learning; Self-& semi-& meta- & unsupervised learning
Transfer/low-shot/long-tail learning; Machine learning; Recognition: detection
Title Causality Inspired Representation Learning for Domain Generalization
URI https://ieeexplore.ieee.org/document/9879527
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3LT8IwGG_AkydUML7Tg0cHfa79ziBBEwwhYriRde0MMQ4j28W_3nWraIwHb8t22NIv3ffo74HQtY4NKClcpAyRkbBMRtpIGbE0FonUJmPOD_SnD_FkIe6XctlCNzsujHOuBp-5vr-sz_LtJi39qGwA3hmbqTZqV41bw9XazVN41cnEoAM7jhIYDJ9mcy9m4gFczMtyKm-v8sNDpU4h4w6afr28QY689MvC9NOPX7qM__26A9T7Juvh2S4NHaKWy49QJ1SXOOzdbReNhkm5ratufJf78_Xq6bzGwQb6UY6D2OozripZPNq8JuscB13qQNfsocX49nE4iYKHQrRmhBeRzkBYbYzShtGEC8YsJdU_jcSaeWcqoE6TxFgF1PdSxDhIgWcpxGAsF8CP0V6-yd0JwkKIjAupnCVGKEcToCAtBStFFdFEnKKuX5TVWyOTsQrrcfb37XO078PSoK4u0F7xXrrLKr8X5qoO7CfmcaUV
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFH9BPOgJFYzf9uDRYdu1W3sGCSgQQsBwI-vaGWIcRraLf73rVtEYD96W7rCmL917ff19ANyIQMmQM-OFCnOPaco9oTj3aBywiAuVUGMb-qNx0J-zhwVf1OB2y4UxxpTgM9O2j-Vdvl7HuW2V3UnrjE3DHdgt8j4nFVtr21Hxi7NMIIXjxxEs7zpPk6mVM7EQLmqFOUNrsPLDRaVMIr0GjL4-X2FHXtp5ptrxxy9lxv_O7wBa33Q9NNkmokOomfQIGq6-RG73bprQ7UT5pqy70SC1N-zF22mJhHUEpBQ5udVnVNSyqLt-jVYpcsrUjrDZgnnvftbpe85FwVtR7GeeSCTTQqlQKEoin1GqCS7-ajgQ1HpTSWIEjpQOJbGnKayMjKWfxDKQSvtM-sdQT9epOQHEGEt8xkOjsWKhIZEkkmsiNWdFTCN2Ck27KMu3Sihj6dbj7O_ha9jrz0bD5XAwfjyHfRuiCoN1AfXsPTeXRbbP1FUZ5E-0V6he
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=proceeding&rft.title=Proceedings+%28IEEE+Computer+Society+Conference+on+Computer+Vision+and+Pattern+Recognition.+Online%29&rft.atitle=Causality+Inspired+Representation+Learning+for+Domain+Generalization&rft.au=Lv%2C+Fangrui&rft.au=Liang%2C+Jian&rft.au=Li%2C+Shuang&rft.au=Zang%2C+Bin&rft.date=2022-06-01&rft.pub=IEEE&rft.eissn=1063-6919&rft.spage=8036&rft.epage=8046&rft_id=info:doi/10.1109%2FCVPR52688.2022.00788&rft.externalDocID=9879527