On the Relation Between Linear Autoencoders and Non-Negative Matrix Factorization for Mutational Signature Extraction

Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, several recent studies have proposed replacing NMF with autoencoders. The increasing popularity of autoencoders war...

Full description

Saved in:
Bibliographic Details
Published inJournal of computational biology Vol. 32; no. 5; pp. 461 - 472
Main Authors Egendal, Ida, Brøndum, Rasmus Froberg, Pelizzola, Marta, Hobolth, Asger, Bøgsted, Martin
Format Journal Article
LanguageEnglish
Published United States Mary Ann Liebert, Inc., publishers 01.05.2025
Subjects
Online AccessGet full text
ISSN1557-8666
1557-8666
DOI10.1089/cmb.2024.0784

Cover

Abstract Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, several recent studies have proposed replacing NMF with autoencoders. The increasing popularity of autoencoders warrants an investigation on whether this replacement is in general valid and reasonable. Moreover, the exact relationship between non-negative autoencoders and NMF has not been thoroughly explored. Thus, a main aim of this study is to investigate in detail the relationship between autoencoders and NMF. We define a non-negative linear autoencoder, AE-NMF, which is mathematically equivalent with convex NMF, a constrained version of NMF. The performance of NMF and the non-negative linear autoencoder is compared within the context of mutational signature extraction from simulated and real-world cancer genomics data. We find that the reconstructions based on NMF are more accurate compared with AE-NMF, while the signatures extracted using both methods exhibit comparable consistency and performance when externally validated. These findings suggest that AE-NMF, the linear non-negative autoencoders investigated in this article, do not provide an improvement of NMF in the field of mutational signature extraction. Our study serves as a foundation for understanding the theoretical implication of replacing NMF with non-negative autoencoders.
AbstractList Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, several recent studies have proposed replacing NMF with autoencoders. The increasing popularity of autoencoders warrants an investigation on whether this replacement is in general valid and reasonable. Moreover, the exact relationship between non-negative autoencoders and NMF has not been thoroughly explored. Thus, a main aim of this study is to investigate in detail the relationship between autoencoders and NMF. We define a non-negative linear autoencoder, AE-NMF, which is mathematically equivalent with convex NMF, a constrained version of NMF. The performance of NMF and the non-negative linear autoencoder is compared within the context of mutational signature extraction from simulated and real-world cancer genomics data. We find that the reconstructions based on NMF are more accurate compared with AE-NMF, while the signatures extracted using both methods exhibit comparable consistency and performance when externally validated. These findings suggest that AE-NMF, the linear non-negative autoencoders investigated in this article, do not provide an improvement of NMF in the field of mutational signature extraction. Our study serves as a foundation for understanding the theoretical implication of replacing NMF with non-negative autoencoders.
Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, several recent studies have proposed replacing NMF with autoencoders. The increasing popularity of autoencoders warrants an investigation on whether this replacement is in general valid and reasonable. Moreover, the exact relationship between non-negative autoencoders and NMF has not been thoroughly explored. Thus, a main aim of this study is to investigate in detail the relationship between autoencoders and NMF. We define a non-negative linear autoencoder, AE-NMF, which is mathematically equivalent with convex NMF, a constrained version of NMF. The performance of NMF and the non-negative linear autoencoder is compared within the context of mutational signature extraction from simulated and real-world cancer genomics data. We find that the reconstructions based on NMF are more accurate compared with AE-NMF, while the signatures extracted using both methods exhibit comparable consistency and performance when externally validated. These findings suggest that AE-NMF, the linear non-negative autoencoders investigated in this article, do not provide an improvement of NMF in the field of mutational signature extraction. Our study serves as a foundation for understanding the theoretical implication of replacing NMF with non-negative autoencoders.Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, several recent studies have proposed replacing NMF with autoencoders. The increasing popularity of autoencoders warrants an investigation on whether this replacement is in general valid and reasonable. Moreover, the exact relationship between non-negative autoencoders and NMF has not been thoroughly explored. Thus, a main aim of this study is to investigate in detail the relationship between autoencoders and NMF. We define a non-negative linear autoencoder, AE-NMF, which is mathematically equivalent with convex NMF, a constrained version of NMF. The performance of NMF and the non-negative linear autoencoder is compared within the context of mutational signature extraction from simulated and real-world cancer genomics data. We find that the reconstructions based on NMF are more accurate compared with AE-NMF, while the signatures extracted using both methods exhibit comparable consistency and performance when externally validated. These findings suggest that AE-NMF, the linear non-negative autoencoders investigated in this article, do not provide an improvement of NMF in the field of mutational signature extraction. Our study serves as a foundation for understanding the theoretical implication of replacing NMF with non-negative autoencoders.
Author Brøndum, Rasmus Froberg
Hobolth, Asger
Egendal, Ida
Pelizzola, Marta
Bøgsted, Martin
Author_xml – sequence: 1
  givenname: Ida
  orcidid: 0000-0002-6189-6053
  surname: Egendal
  fullname: Egendal, Ida
– sequence: 2
  givenname: Rasmus Froberg
  surname: Brøndum
  fullname: Brøndum, Rasmus Froberg
– sequence: 3
  givenname: Marta
  surname: Pelizzola
  fullname: Pelizzola, Marta
– sequence: 4
  givenname: Asger
  surname: Hobolth
  fullname: Hobolth, Asger
– sequence: 5
  givenname: Martin
  surname: Bøgsted
  fullname: Bøgsted, Martin
BackLink https://www.ncbi.nlm.nih.gov/pubmed/40113251$$D View this record in MEDLINE/PubMed
BookMark eNqF0TtPwzAQB3ALFdEHjKzII0uKHddxMpaqBaQ-JB5zZCeXEpTaxXag8OlJaEFsTD6ffnfD_fuoo40GhM4pGVISJ1fZRg1DEo6GRMSjI9SjnIsgjqKo86fuor5zL4RQFhFxgrojQikLOe2heqWxfwZ8D5X0pdH4Gvw7gMbzUoO0eFx7AzozOViHpc7x0uhgCesGvwFeSG_LHZ7JzBtbfu43FMbiRe2_P7LCD-VaS19bwNOdt41s2qfouJCVg7PDO0BPs-nj5DaYr27uJuN5kIUJ94GMKJcyJwmjhWJkJDmnWcTyUIg8UVyByptachEqVkCSk5ASRplgBVExYRkboMv93q01rzU4n25Kl0FVSQ2mdimjIok5FVw09OJAa7WBPN3aciPtR_pzqgYEe5BZ45yF4pdQkrZRpE0UaRtF2kbReLb3rZFaVyUosP6fqS92Uo3P
Cites_doi 10.1016/j.celrep.2012.12.008
10.1109/TNNLS.2015.2479223
10.1109/TPAMI.2008.277
10.1093/bioinformatics/btae320
10.1038/44565
10.1038/s41388-020-1343-z
10.1038/s41586-020-2434-2
10.1126/science.abl9283
10.1109/LGRS.2018.2823425
10.1038/s41586-020-1943-3
10.1093/mutage/gev073
10.1002/aic.690370209
10.1016/j.neunet.2012.05.003
10.1002/9780470316801.ch2
10.1186/s13073-018-0539-0
10.1016/j.xgen.2022.100179
10.1093/nar/gky1015
10.1093/annonc/mdy054
ContentType Journal Article
Copyright 2025, Mary Ann Liebert, Inc., publishers
Copyright_xml – notice: 2025, Mary Ann Liebert, Inc., publishers
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7X8
DOI 10.1089/cmb.2024.0784
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
MEDLINE - Academic
DatabaseTitleList MEDLINE

MEDLINE - Academic
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
Mathematics
EISSN 1557-8666
EndPage 472
ExternalDocumentID 40113251
10_1089_cmb_2024_0784
Genre Journal Article
GroupedDBID ---
0R~
29K
4.4
53G
5GY
ABBKN
ACGFO
ADBBV
AENEX
AFOSN
ALMA_UNASSIGNED_HOLDINGS
BAWUL
BNQNF
CS3
D-I
DIK
DU5
EBS
F5P
IAO
IHR
IM4
MV1
NQHIM
O9-
P2P
RML
RNS
TN5
TR2
UE5
AAYXX
CITATION
34G
39C
ABEFU
AI.
CAG
CGR
COF
CUY
CVF
ECM
EIF
EJD
IER
IGS
ITC
NPM
R.V
RIG
RMSOB
VH1
7X8
SCNPE
ID FETCH-LOGICAL-c295t-a615aad0931fb304a551c63d277d9b5bebdd27a572b3fe9d021031373f0b803c3
ISSN 1557-8666
IngestDate Fri Sep 05 14:34:30 EDT 2025
Tue May 13 01:30:45 EDT 2025
Sun Jul 06 05:03:39 EDT 2025
Sat May 10 06:40:19 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 5
Keywords convex non-negative matrix factorization
non-negative matrix factorization
mutational signatures
non-negative autoencoders
Language English
License https://www.liebertpub.com/nv/resources-tools/text-and-data-mining-policy/121
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c295t-a615aad0931fb304a551c63d277d9b5bebdd27a572b3fe9d021031373f0b803c3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0002-6189-6053
PMID 40113251
PQID 3179851757
PQPubID 23479
PageCount 12
ParticipantIDs proquest_miscellaneous_3179851757
pubmed_primary_40113251
crossref_primary_10_1089_cmb_2024_0784
maryannliebert_primary_10_1089_cmb_2024_0784
PublicationCentury 2000
PublicationDate 2025-05-01
PublicationDateYYYYMMDD 2025-05-01
PublicationDate_xml – month: 05
  year: 2025
  text: 2025-05-01
  day: 01
PublicationDecade 2020
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle Journal of computational biology
PublicationTitleAlternate J Comput Biol
PublicationYear 2025
Publisher Mary Ann Liebert, Inc., publishers
Publisher_xml – name: Mary Ann Liebert, Inc., publishers
References B20
B21
B11
B12
B13
B14
B15
B16
B19
B1
B2
B3
B4
B5
B6
B7
B8
B9
Squires S (B18) 2019
References_xml – ident: B1
  doi: 10.1016/j.celrep.2012.12.008
– ident: B7
  doi: 10.1109/TNNLS.2015.2479223
– ident: B5
  doi: 10.1109/TPAMI.2008.277
– ident: B15
  doi: 10.1093/bioinformatics/btae320
– ident: B12
  doi: 10.1038/44565
– ident: B16
  doi: 10.1038/s41388-020-1343-z
– year: 2019
  ident: B18
  publication-title: arXiv Preprint arXiv
– ident: B21
  doi: 10.1038/s41586-020-2434-2
– ident: B4
  doi: 10.1126/science.abl9283
– ident: B6
  doi: 10.1109/LGRS.2018.2823425
– ident: B2
  doi: 10.1038/s41586-020-1943-3
– ident: B14
  doi: 10.1093/mutage/gev073
– ident: B11
  doi: 10.1002/aic.690370209
– ident: B13
  doi: 10.1016/j.neunet.2012.05.003
– ident: B9
  doi: 10.1002/9780470316801.ch2
– ident: B3
  doi: 10.1186/s13073-018-0539-0
– ident: B8
  doi: 10.1016/j.xgen.2022.100179
– ident: B19
  doi: 10.1093/nar/gky1015
– ident: B20
  doi: 10.1093/annonc/mdy054
SSID ssj0013607
Score 2.4370987
Snippet Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of...
SourceID proquest
pubmed
crossref
maryannliebert
SourceType Aggregation Database
Index Database
Publisher
StartPage 461
SubjectTerms Algorithms
Autoencoder
Computational Biology - methods
Genomics - methods
Humans
Mutation
Neoplasms - genetics
Original Articles
Title On the Relation Between Linear Autoencoders and Non-Negative Matrix Factorization for Mutational Signature Extraction
URI https://www.liebertpub.com/doi/abs/10.1089/cmb.2024.0784
https://www.ncbi.nlm.nih.gov/pubmed/40113251
https://www.proquest.com/docview/3179851757
Volume 32
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Zb9NAEF6FIiSQQFCucGmREC8hqe3d9fGYolYpKOkDrdQ3a9e7jiq1NqptqeSH8fuYPWwnkIrCi-VYWV_zeWdmZ-YbhD5EuQ92m-bYVwocFELlOJHwPYok5HHis5xRXeA8X4SzU_rljJ0NBj_XspaaWkyy1da6kv-RKhwDueoq2X-QbHdSOAD7IF_YgoRheysZH9scxTahbbTvkq7AwdT8PNOmLjVPpc5VNkGCRVmMF2ppub7nmpz_enRoGu64akyTdDhv6naF8Nv50jJ_jg6u6ytbA3GDOZuZ9hDtQMft1Nnr8AjStBYYHa0vAegw_X5cSNdsmVeXTQW2dNnxbplJ--J8tQIP3JUW1d34WSnKC7suNK2WLs3YrWAErM8XbCddBpoyDB0l9pZjbqbuV0L7WLiZdqkldHcanNpmQH8oBy_W3KrZpZjAXdAJGEe014Jt5P835dilLJpgfZykMDzVw1M9_A66G0SRTQ84-tpHr0JTpt89g-N2heF7G1ffsIUe6lJFXhTgeuiU-ps9HmP5nDxGj5yM8dTi7wkaqGIX3bNNTH_sogfzjvm3eoqa4wLDT9xiEjtMYotJvI5JDJjE65jEFpN4A5MYMIl7TOIOk7jH5DN0enhw8nk2dq09xlmQsHrMwZDmXHoJ8XNBPMrBcM9CIuFtykQwoYSEfc6iQJBcJVKvTBCfRCT3ROyRjDxHO0VZqJcIg78uw0zlLKAZZXGW0Jz7US48KsHXicMh-ti-5PS7ZXBJtwpziD5tiuBvf3_fCiiFKVnH2XihyqZKiSYBZGCXR0P0wkquOxUFfUrAp3h127t6je7338wbtFNfNeot2MG1eGdA9wsnFbXZ
linkProvider Flying Publisher
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=On+the+Relation+Between+Linear+Autoencoders+and+Non-Negative+Matrix+Factorization+for+Mutational+Signature+Extraction&rft.jtitle=Journal+of+computational+biology&rft.au=Egendal%2C+Ida&rft.au=Br%C3%B8ndum%2C+Rasmus+Froberg&rft.au=Pelizzola%2C+Marta&rft.au=Hobolth%2C+Asger&rft.date=2025-05-01&rft.issn=1557-8666&rft.eissn=1557-8666&rft.volume=32&rft.issue=5&rft.spage=461&rft.epage=472&rft_id=info:doi/10.1089%2Fcmb.2024.0784&rft.externalDBID=n%2Fa&rft.externalDocID=10_1089_cmb_2024_0784
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1557-8666&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1557-8666&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1557-8666&client=summon