Contrasting Low and High-Resolution Features for HER2 Scoring using Deep Learning

Breast cancer, the most common malignancy among women, requires precise detection and classification for effective treatment. Immunohistochemistry (IHC) biomarkers like HER2, ER, and PR are critical for identifying breast cancer subtypes. However, traditional IHC classification relies on pathologist...

Full description

Saved in:
Bibliographic Details
Main Authors Chauhan, Ekansh, Sharma, Anila, Sharma, Amit, Nishadham, Vikas, Ghughtyal, Asha, Kumar, Ankur, Gupta, Gurudutt, Mehta, Anurag, Jawahar, C. V, Vinod, P. K
Format Journal Article
LanguageEnglish
Published 27.03.2025
Subjects
Online AccessGet full text
DOI10.48550/arxiv.2503.22069

Cover

Abstract Breast cancer, the most common malignancy among women, requires precise detection and classification for effective treatment. Immunohistochemistry (IHC) biomarkers like HER2, ER, and PR are critical for identifying breast cancer subtypes. However, traditional IHC classification relies on pathologists' expertise, making it labor-intensive and subject to significant inter-observer variability. To address these challenges, this study introduces the India Pathology Breast Cancer Dataset (IPD-Breast), comprising of 1,272 IHC slides (HER2, ER, and PR) aimed at automating receptor status classification. The primary focus is on developing predictive models for HER2 3-way classification (0, Low, High) to enhance prognosis. Evaluation of multiple deep learning models revealed that an end-to-end ConvNeXt network utilizing low-resolution IHC images achieved an AUC, F1, and accuracy of 91.79%, 83.52%, and 83.56%, respectively, for 3-way classification, outperforming patch-based methods by over 5.35% in F1 score. This study highlights the potential of simple yet effective deep learning techniques to significantly improve accuracy and reproducibility in breast cancer classification, supporting their integration into clinical workflows for better patient outcomes.
AbstractList Breast cancer, the most common malignancy among women, requires precise detection and classification for effective treatment. Immunohistochemistry (IHC) biomarkers like HER2, ER, and PR are critical for identifying breast cancer subtypes. However, traditional IHC classification relies on pathologists' expertise, making it labor-intensive and subject to significant inter-observer variability. To address these challenges, this study introduces the India Pathology Breast Cancer Dataset (IPD-Breast), comprising of 1,272 IHC slides (HER2, ER, and PR) aimed at automating receptor status classification. The primary focus is on developing predictive models for HER2 3-way classification (0, Low, High) to enhance prognosis. Evaluation of multiple deep learning models revealed that an end-to-end ConvNeXt network utilizing low-resolution IHC images achieved an AUC, F1, and accuracy of 91.79%, 83.52%, and 83.56%, respectively, for 3-way classification, outperforming patch-based methods by over 5.35% in F1 score. This study highlights the potential of simple yet effective deep learning techniques to significantly improve accuracy and reproducibility in breast cancer classification, supporting their integration into clinical workflows for better patient outcomes.
Author Jawahar, C. V
Chauhan, Ekansh
Vinod, P. K
Nishadham, Vikas
Ghughtyal, Asha
Gupta, Gurudutt
Sharma, Amit
Mehta, Anurag
Kumar, Ankur
Sharma, Anila
Author_xml – sequence: 1
  givenname: Ekansh
  surname: Chauhan
  fullname: Chauhan, Ekansh
– sequence: 2
  givenname: Anila
  surname: Sharma
  fullname: Sharma, Anila
– sequence: 3
  givenname: Amit
  surname: Sharma
  fullname: Sharma, Amit
– sequence: 4
  givenname: Vikas
  surname: Nishadham
  fullname: Nishadham, Vikas
– sequence: 5
  givenname: Asha
  surname: Ghughtyal
  fullname: Ghughtyal, Asha
– sequence: 6
  givenname: Ankur
  surname: Kumar
  fullname: Kumar, Ankur
– sequence: 7
  givenname: Gurudutt
  surname: Gupta
  fullname: Gupta, Gurudutt
– sequence: 8
  givenname: Anurag
  surname: Mehta
  fullname: Mehta, Anurag
– sequence: 9
  givenname: C. V
  surname: Jawahar
  fullname: Jawahar, C. V
– sequence: 10
  givenname: P. K
  surname: Vinod
  fullname: Vinod, P. K
BackLink https://doi.org/10.48550/arXiv.2503.22069$$DView paper in arXiv
BookMark eNqFjrsOgkAURLfQwtcHWHl_AFwXMVojhIJGsCcbveAmuEvuLj7-XiH2NjOZZCZzpmykjUbGlhvub_dhyNeSXurhi5AHvhB8d5iwU2S0I2md0jVk5glSXyFV9c3L0Zqmc8poSFC6jtBCZQjSOBdQXAz1i872ekRsIUNJ-pvmbFzJxuLi5zO2SuJzlHrDedmSukt6lz1EOUAE_xsfRFw97A
ContentType Journal Article
Copyright http://creativecommons.org/publicdomain/zero/1.0
Copyright_xml – notice: http://creativecommons.org/publicdomain/zero/1.0
DBID AKY
GOX
DOI 10.48550/arxiv.2503.22069
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2503_22069
GroupedDBID AKY
GOX
ID FETCH-arxiv_primary_2503_220693
IEDL.DBID GOX
IngestDate Tue Jul 22 20:28:51 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_2503_220693
OpenAccessLink https://arxiv.org/abs/2503.22069
ParticipantIDs arxiv_primary_2503_22069
PublicationCentury 2000
PublicationDate 2025-03-27
PublicationDateYYYYMMDD 2025-03-27
PublicationDate_xml – month: 03
  year: 2025
  text: 2025-03-27
  day: 27
PublicationDecade 2020
PublicationYear 2025
Score 3.8119047
SecondaryResourceType preprint
Snippet Breast cancer, the most common malignancy among women, requires precise detection and classification for effective treatment. Immunohistochemistry (IHC)...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Artificial Intelligence
Computer Science - Computer Vision and Pattern Recognition
Title Contrasting Low and High-Resolution Features for HER2 Scoring using Deep Learning
URI https://arxiv.org/abs/2503.22069
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07TwMxDLbaTiwIBKi8PbAGSi6X640IWk6Ih3hJt50uqVOxQHVtgZ9PnByCpWtiRVY8-PsS-zPAySRxympnhHKpFJ5_GZFbcsK52qXGDnMKXe9397p4VTdlWnYAf3th6ub77TPqA5v5mc_PyamUA513oSslk6vrhzJ-TgYprtb-z85jzLD0L0mMN2C9RXd4EcOxCR1634JHVoBq6jkXGOPtxxd68o5cXyH47TxGHhmKLT31RQ8isRg9SXy2oTYOuTJ9ildEM2zFUKfbcDwevVwWIjhRzaJiRMX-VcG_ZAd6ntdTH9AzIWMza7lLQxFPckyV1pOcWNAmo_Nd6K86ZW_11j6sSR5RO0iEzA6gt2iWdOjz5sIchcv7AbCDchY
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Contrasting+Low+and+High-Resolution+Features+for+HER2+Scoring+using+Deep+Learning&rft.au=Chauhan%2C+Ekansh&rft.au=Sharma%2C+Anila&rft.au=Sharma%2C+Amit&rft.au=Nishadham%2C+Vikas&rft.date=2025-03-27&rft_id=info:doi/10.48550%2Farxiv.2503.22069&rft.externalDocID=2503_22069