Oral cancer detection and interpretation: Deep multiple instance learning versus conventional deep single instance learning

The current medical standard for setting an oral cancer (OC) diagnosis is histological examination of a tissue sample from the oral cavity. This process is time consuming and more invasive than an alternative approach of acquiring a brush sample followed by cytological analysis. Skilled cytotechnolo...

Full description

Saved in:
Bibliographic Details
Main Authors Koriakina, Nadezhda, Sladoje, Nataša, Bašić, Vladimir, Lindblad, Joakim
Format Journal Article
LanguageEnglish
Published 03.02.2022
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The current medical standard for setting an oral cancer (OC) diagnosis is histological examination of a tissue sample from the oral cavity. This process is time consuming and more invasive than an alternative approach of acquiring a brush sample followed by cytological analysis. Skilled cytotechnologists are able to detect changes due to malignancy, however, to introduce this approach into clinical routine is associated with challenges such as a lack of experts and labour-intensive work. To design a trustworthy OC detection system that would assist cytotechnologists, we are interested in AI-based methods that reliably can detect cancer given only per-patient labels (minimizing annotation bias), and also provide information on which cells are most relevant for the diagnosis (enabling supervision and understanding). We, therefore, perform a comparison of a conventional single instance learning (SIL) approach and a modern multiple instance learning (MIL) method suitable for OC detection and interpretation, utilizing three different neural network architectures. To facilitate systematic evaluation of the considered approaches, we introduce a synthetic PAP-QMNIST dataset, that serves as a model of OC data, while offering access to per-instance ground truth. Our study indicates that on PAP-QMNIST, the SIL performs better, on average, than the MIL approach. Performance at the bag level on real-world cytological data is similar for both methods, yet the single instance approach performs better on average. Visual examination by cytotechnologist indicates that the methods manage to identify cells which deviate from normality, including malignant cells as well as those suspicious for dysplasia. We share the code as open source at https://github.com/MIDA-group/OralCancerMILvsSIL
AbstractList The current medical standard for setting an oral cancer (OC) diagnosis is histological examination of a tissue sample from the oral cavity. This process is time consuming and more invasive than an alternative approach of acquiring a brush sample followed by cytological analysis. Skilled cytotechnologists are able to detect changes due to malignancy, however, to introduce this approach into clinical routine is associated with challenges such as a lack of experts and labour-intensive work. To design a trustworthy OC detection system that would assist cytotechnologists, we are interested in AI-based methods that reliably can detect cancer given only per-patient labels (minimizing annotation bias), and also provide information on which cells are most relevant for the diagnosis (enabling supervision and understanding). We, therefore, perform a comparison of a conventional single instance learning (SIL) approach and a modern multiple instance learning (MIL) method suitable for OC detection and interpretation, utilizing three different neural network architectures. To facilitate systematic evaluation of the considered approaches, we introduce a synthetic PAP-QMNIST dataset, that serves as a model of OC data, while offering access to per-instance ground truth. Our study indicates that on PAP-QMNIST, the SIL performs better, on average, than the MIL approach. Performance at the bag level on real-world cytological data is similar for both methods, yet the single instance approach performs better on average. Visual examination by cytotechnologist indicates that the methods manage to identify cells which deviate from normality, including malignant cells as well as those suspicious for dysplasia. We share the code as open source at https://github.com/MIDA-group/OralCancerMILvsSIL
Author Lindblad, Joakim
Sladoje, Nataša
Bašić, Vladimir
Koriakina, Nadezhda
Author_xml – sequence: 1
  givenname: Nadezhda
  surname: Koriakina
  fullname: Koriakina, Nadezhda
– sequence: 2
  givenname: Nataša
  surname: Sladoje
  fullname: Sladoje, Nataša
– sequence: 3
  givenname: Vladimir
  surname: Bašić
  fullname: Bašić, Vladimir
– sequence: 4
  givenname: Joakim
  surname: Lindblad
  fullname: Lindblad, Joakim
BackLink https://doi.org/10.48550/arXiv.2202.01783$$DView paper in arXiv
BookMark eNptkM1OwzAQhH2AAxQegBN-gQTbcVKbGyo_RarUS-_RZr1BllInctwIxMuTFI6cRprRN9LMNbsIfSDG7qTItSlL8QDx00-5UkLlQq5NccW-9xE6jhCQIneUCJPvA4fguA-J4hApwWI98meigR9PXfJDR3M6poXiHUEMPnzwieJ4Gjn2YaKwIHOxW5hxTv8jbthlC91It3-6YofXl8Nmm-32b--bp10G1brILOoSqTKN09q5wkqQpdWyBd2iQQJVSoFkG-OM0VDNO7UVqAw2mqxqZbFi97-15_X1EP0R4le9vFCfXyh-AGZjXN8
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID AKY
GOX
DOI 10.48550/arxiv.2202.01783
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2202_01783
GroupedDBID AKY
GOX
ID FETCH-LOGICAL-a673-9c45ce68bd44dd391a15941fa4fc8cea2510ce9b8d884a6855490c28cb4e92f13
IEDL.DBID GOX
IngestDate Mon Jan 08 05:45:20 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a673-9c45ce68bd44dd391a15941fa4fc8cea2510ce9b8d884a6855490c28cb4e92f13
OpenAccessLink https://arxiv.org/abs/2202.01783
ParticipantIDs arxiv_primary_2202_01783
PublicationCentury 2000
PublicationDate 2022-02-03
PublicationDateYYYYMMDD 2022-02-03
PublicationDate_xml – month: 02
  year: 2022
  text: 2022-02-03
  day: 03
PublicationDecade 2020
PublicationYear 2022
Score 1.8367958
SecondaryResourceType preprint
Snippet The current medical standard for setting an oral cancer (OC) diagnosis is histological examination of a tissue sample from the oral cavity. This process is...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computer Vision and Pattern Recognition
Computer Science - Learning
Title Oral cancer detection and interpretation: Deep multiple instance learning versus conventional deep single instance learning
URI https://arxiv.org/abs/2202.01783
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV09T8MwED21nVgQCFD5lAfWiMZ2EocNAaVioEuRskX22UFdQpWmCIk_z9lJRZFgPfs8PMv2s_z8DuA68_XLiapGHJMqkpmlfZC24iiOhba88sXNg9vnSzp7lc9FUgyAbf_C6OZz-dH5A5v1DefeTzPOlBjCkHMv2XqaF93jZLDi6vv_9COOGUI7h8T0APZ7dsfuuuk4hIGrj-Br3lAMPbwNs64N4qea0RWeLX9J_m7Zg3MrttX4UaunbuhYX9rhjXkNxWbNdqXiNCLl-Bv_XxnHsJg-Lu5nUV_zINJpJqIcZYIuVcZKaa3IY010Q8aVlhUqdJrYyARdbpRVSurUa8zyCXKFRrqcoBUnMKrfazcGVgkKYzKJTWJoYUqToCF6JayTNkOpTmEckCpXna1F6UEsA4hn_zedwx73HwC8bllcwKhtNu6SjuXWXIW5-QacQo__
link.rule.ids 228,230,783,888
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Oral+cancer+detection+and+interpretation%3A+Deep+multiple+instance+learning+versus+conventional+deep+single+instance+learning&rft.au=Koriakina%2C+Nadezhda&rft.au=Sladoje%2C+Nata%C5%A1a&rft.au=Ba%C5%A1i%C4%87%2C+Vladimir&rft.au=Lindblad%2C+Joakim&rft.date=2022-02-03&rft_id=info:doi/10.48550%2Farxiv.2202.01783&rft.externalDocID=2202_01783