On the Relation Between Linear Autoencoders and Non-Negative Matrix Factorization for Mutational Signature Extraction
Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, several recent studies have proposed replacing NMF with autoencoders. The increasing popularity of autoencoders war...
Saved in:
Published in | Journal of computational biology Vol. 32; no. 5; pp. 461 - 472 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
United States
Mary Ann Liebert, Inc., publishers
01.05.2025
|
Subjects | |
Online Access | Get full text |
ISSN | 1557-8666 1557-8666 |
DOI | 10.1089/cmb.2024.0784 |
Cover
Abstract | Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, several recent studies have proposed replacing NMF with autoencoders. The increasing popularity of autoencoders warrants an investigation on whether this replacement is in general valid and reasonable. Moreover, the exact relationship between non-negative autoencoders and NMF has not been thoroughly explored. Thus, a main aim of this study is to investigate in detail the relationship between autoencoders and NMF. We define a non-negative linear autoencoder, AE-NMF, which is mathematically equivalent with convex NMF, a constrained version of NMF. The performance of NMF and the non-negative linear autoencoder is compared within the context of mutational signature extraction from simulated and real-world cancer genomics data. We find that the reconstructions based on NMF are more accurate compared with AE-NMF, while the signatures extracted using both methods exhibit comparable consistency and performance when externally validated. These findings suggest that AE-NMF, the linear non-negative autoencoders investigated in this article, do not provide an improvement of NMF in the field of mutational signature extraction. Our study serves as a foundation for understanding the theoretical implication of replacing NMF with non-negative autoencoders. |
---|---|
AbstractList | Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, several recent studies have proposed replacing NMF with autoencoders. The increasing popularity of autoencoders warrants an investigation on whether this replacement is in general valid and reasonable. Moreover, the exact relationship between non-negative autoencoders and NMF has not been thoroughly explored. Thus, a main aim of this study is to investigate in detail the relationship between autoencoders and NMF. We define a non-negative linear autoencoder, AE-NMF, which is mathematically equivalent with convex NMF, a constrained version of NMF. The performance of NMF and the non-negative linear autoencoder is compared within the context of mutational signature extraction from simulated and real-world cancer genomics data. We find that the reconstructions based on NMF are more accurate compared with AE-NMF, while the signatures extracted using both methods exhibit comparable consistency and performance when externally validated. These findings suggest that AE-NMF, the linear non-negative autoencoders investigated in this article, do not provide an improvement of NMF in the field of mutational signature extraction. Our study serves as a foundation for understanding the theoretical implication of replacing NMF with non-negative autoencoders. Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, several recent studies have proposed replacing NMF with autoencoders. The increasing popularity of autoencoders warrants an investigation on whether this replacement is in general valid and reasonable. Moreover, the exact relationship between non-negative autoencoders and NMF has not been thoroughly explored. Thus, a main aim of this study is to investigate in detail the relationship between autoencoders and NMF. We define a non-negative linear autoencoder, AE-NMF, which is mathematically equivalent with convex NMF, a constrained version of NMF. The performance of NMF and the non-negative linear autoencoder is compared within the context of mutational signature extraction from simulated and real-world cancer genomics data. We find that the reconstructions based on NMF are more accurate compared with AE-NMF, while the signatures extracted using both methods exhibit comparable consistency and performance when externally validated. These findings suggest that AE-NMF, the linear non-negative autoencoders investigated in this article, do not provide an improvement of NMF in the field of mutational signature extraction. Our study serves as a foundation for understanding the theoretical implication of replacing NMF with non-negative autoencoders.Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of high-dimensional data. However, several recent studies have proposed replacing NMF with autoencoders. The increasing popularity of autoencoders warrants an investigation on whether this replacement is in general valid and reasonable. Moreover, the exact relationship between non-negative autoencoders and NMF has not been thoroughly explored. Thus, a main aim of this study is to investigate in detail the relationship between autoencoders and NMF. We define a non-negative linear autoencoder, AE-NMF, which is mathematically equivalent with convex NMF, a constrained version of NMF. The performance of NMF and the non-negative linear autoencoder is compared within the context of mutational signature extraction from simulated and real-world cancer genomics data. We find that the reconstructions based on NMF are more accurate compared with AE-NMF, while the signatures extracted using both methods exhibit comparable consistency and performance when externally validated. These findings suggest that AE-NMF, the linear non-negative autoencoders investigated in this article, do not provide an improvement of NMF in the field of mutational signature extraction. Our study serves as a foundation for understanding the theoretical implication of replacing NMF with non-negative autoencoders. |
Author | Brøndum, Rasmus Froberg Hobolth, Asger Egendal, Ida Pelizzola, Marta Bøgsted, Martin |
Author_xml | – sequence: 1 givenname: Ida orcidid: 0000-0002-6189-6053 surname: Egendal fullname: Egendal, Ida – sequence: 2 givenname: Rasmus Froberg surname: Brøndum fullname: Brøndum, Rasmus Froberg – sequence: 3 givenname: Marta surname: Pelizzola fullname: Pelizzola, Marta – sequence: 4 givenname: Asger surname: Hobolth fullname: Hobolth, Asger – sequence: 5 givenname: Martin surname: Bøgsted fullname: Bøgsted, Martin |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/40113251$$D View this record in MEDLINE/PubMed |
BookMark | eNqF0TtPwzAQB3ALFdEHjKzII0uKHddxMpaqBaQ-JB5zZCeXEpTaxXag8OlJaEFsTD6ffnfD_fuoo40GhM4pGVISJ1fZRg1DEo6GRMSjI9SjnIsgjqKo86fuor5zL4RQFhFxgrojQikLOe2heqWxfwZ8D5X0pdH4Gvw7gMbzUoO0eFx7AzozOViHpc7x0uhgCesGvwFeSG_LHZ7JzBtbfu43FMbiRe2_P7LCD-VaS19bwNOdt41s2qfouJCVg7PDO0BPs-nj5DaYr27uJuN5kIUJ94GMKJcyJwmjhWJkJDmnWcTyUIg8UVyByptachEqVkCSk5ASRplgBVExYRkboMv93q01rzU4n25Kl0FVSQ2mdimjIok5FVw09OJAa7WBPN3aciPtR_pzqgYEe5BZ45yF4pdQkrZRpE0UaRtF2kbReLb3rZFaVyUosP6fqS92Uo3P |
Cites_doi | 10.1016/j.celrep.2012.12.008 10.1109/TNNLS.2015.2479223 10.1109/TPAMI.2008.277 10.1093/bioinformatics/btae320 10.1038/44565 10.1038/s41388-020-1343-z 10.1038/s41586-020-2434-2 10.1126/science.abl9283 10.1109/LGRS.2018.2823425 10.1038/s41586-020-1943-3 10.1093/mutage/gev073 10.1002/aic.690370209 10.1016/j.neunet.2012.05.003 10.1002/9780470316801.ch2 10.1186/s13073-018-0539-0 10.1016/j.xgen.2022.100179 10.1093/nar/gky1015 10.1093/annonc/mdy054 |
ContentType | Journal Article |
Copyright | 2025, Mary Ann Liebert, Inc., publishers |
Copyright_xml | – notice: 2025, Mary Ann Liebert, Inc., publishers |
DBID | AAYXX CITATION CGR CUY CVF ECM EIF NPM 7X8 |
DOI | 10.1089/cmb.2024.0784 |
DatabaseName | CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed MEDLINE - Academic |
DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) MEDLINE - Academic |
DatabaseTitleList | MEDLINE MEDLINE - Academic |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology Mathematics |
EISSN | 1557-8666 |
EndPage | 472 |
ExternalDocumentID | 40113251 10_1089_cmb_2024_0784 |
Genre | Journal Article |
GroupedDBID | --- 0R~ 29K 4.4 53G 5GY ABBKN ACGFO ADBBV AENEX AFOSN ALMA_UNASSIGNED_HOLDINGS BAWUL BNQNF CS3 D-I DIK DU5 EBS F5P IAO IHR IM4 MV1 NQHIM O9- P2P RML RNS TN5 TR2 UE5 AAYXX CITATION 34G 39C ABEFU AI. CAG CGR COF CUY CVF ECM EIF EJD IER IGS ITC NPM R.V RIG RMSOB VH1 7X8 SCNPE |
ID | FETCH-LOGICAL-c295t-a615aad0931fb304a551c63d277d9b5bebdd27a572b3fe9d021031373f0b803c3 |
ISSN | 1557-8666 |
IngestDate | Fri Sep 05 14:34:30 EDT 2025 Tue May 13 01:30:45 EDT 2025 Sun Jul 06 05:03:39 EDT 2025 Sat May 10 06:40:19 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 5 |
Keywords | convex non-negative matrix factorization non-negative matrix factorization mutational signatures non-negative autoencoders |
Language | English |
License | https://www.liebertpub.com/nv/resources-tools/text-and-data-mining-policy/121 |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c295t-a615aad0931fb304a551c63d277d9b5bebdd27a572b3fe9d021031373f0b803c3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ORCID | 0000-0002-6189-6053 |
PMID | 40113251 |
PQID | 3179851757 |
PQPubID | 23479 |
PageCount | 12 |
ParticipantIDs | proquest_miscellaneous_3179851757 pubmed_primary_40113251 crossref_primary_10_1089_cmb_2024_0784 maryannliebert_primary_10_1089_cmb_2024_0784 |
PublicationCentury | 2000 |
PublicationDate | 2025-05-01 |
PublicationDateYYYYMMDD | 2025-05-01 |
PublicationDate_xml | – month: 05 year: 2025 text: 2025-05-01 day: 01 |
PublicationDecade | 2020 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States |
PublicationTitle | Journal of computational biology |
PublicationTitleAlternate | J Comput Biol |
PublicationYear | 2025 |
Publisher | Mary Ann Liebert, Inc., publishers |
Publisher_xml | – name: Mary Ann Liebert, Inc., publishers |
References | B20 B21 B11 B12 B13 B14 B15 B16 B19 B1 B2 B3 B4 B5 B6 B7 B8 B9 Squires S (B18) 2019 |
References_xml | – ident: B1 doi: 10.1016/j.celrep.2012.12.008 – ident: B7 doi: 10.1109/TNNLS.2015.2479223 – ident: B5 doi: 10.1109/TPAMI.2008.277 – ident: B15 doi: 10.1093/bioinformatics/btae320 – ident: B12 doi: 10.1038/44565 – ident: B16 doi: 10.1038/s41388-020-1343-z – year: 2019 ident: B18 publication-title: arXiv Preprint arXiv – ident: B21 doi: 10.1038/s41586-020-2434-2 – ident: B4 doi: 10.1126/science.abl9283 – ident: B6 doi: 10.1109/LGRS.2018.2823425 – ident: B2 doi: 10.1038/s41586-020-1943-3 – ident: B14 doi: 10.1093/mutage/gev073 – ident: B11 doi: 10.1002/aic.690370209 – ident: B13 doi: 10.1016/j.neunet.2012.05.003 – ident: B9 doi: 10.1002/9780470316801.ch2 – ident: B3 doi: 10.1186/s13073-018-0539-0 – ident: B8 doi: 10.1016/j.xgen.2022.100179 – ident: B19 doi: 10.1093/nar/gky1015 – ident: B20 doi: 10.1093/annonc/mdy054 |
SSID | ssj0013607 |
Score | 2.4370987 |
Snippet | Since its introduction, non-negative matrix factorization (NMF) has been a popular tool for extracting interpretable, low-dimensional representations of... |
SourceID | proquest pubmed crossref maryannliebert |
SourceType | Aggregation Database Index Database Publisher |
StartPage | 461 |
SubjectTerms | Algorithms Autoencoder Computational Biology - methods Genomics - methods Humans Mutation Neoplasms - genetics Original Articles |
Title | On the Relation Between Linear Autoencoders and Non-Negative Matrix Factorization for Mutational Signature Extraction |
URI | https://www.liebertpub.com/doi/abs/10.1089/cmb.2024.0784 https://www.ncbi.nlm.nih.gov/pubmed/40113251 https://www.proquest.com/docview/3179851757 |
Volume | 32 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Zb9NAEF6FIiSQQFCucGmREC8hqe3d9fGYolYpKOkDrdQ3a9e7jiq1NqptqeSH8fuYPWwnkIrCi-VYWV_zeWdmZ-YbhD5EuQ92m-bYVwocFELlOJHwPYok5HHis5xRXeA8X4SzU_rljJ0NBj_XspaaWkyy1da6kv-RKhwDueoq2X-QbHdSOAD7IF_YgoRheysZH9scxTahbbTvkq7AwdT8PNOmLjVPpc5VNkGCRVmMF2ppub7nmpz_enRoGu64akyTdDhv6naF8Nv50jJ_jg6u6ytbA3GDOZuZ9hDtQMft1Nnr8AjStBYYHa0vAegw_X5cSNdsmVeXTQW2dNnxbplJ--J8tQIP3JUW1d34WSnKC7suNK2WLs3YrWAErM8XbCddBpoyDB0l9pZjbqbuV0L7WLiZdqkldHcanNpmQH8oBy_W3KrZpZjAXdAJGEe014Jt5P835dilLJpgfZykMDzVw1M9_A66G0SRTQ84-tpHr0JTpt89g-N2heF7G1ffsIUe6lJFXhTgeuiU-ps9HmP5nDxGj5yM8dTi7wkaqGIX3bNNTH_sogfzjvm3eoqa4wLDT9xiEjtMYotJvI5JDJjE65jEFpN4A5MYMIl7TOIOk7jH5DN0enhw8nk2dq09xlmQsHrMwZDmXHoJ8XNBPMrBcM9CIuFtykQwoYSEfc6iQJBcJVKvTBCfRCT3ROyRjDxHO0VZqJcIg78uw0zlLKAZZXGW0Jz7US48KsHXicMh-ti-5PS7ZXBJtwpziD5tiuBvf3_fCiiFKVnH2XihyqZKiSYBZGCXR0P0wkquOxUFfUrAp3h127t6je7338wbtFNfNeot2MG1eGdA9wsnFbXZ |
linkProvider | Flying Publisher |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=On+the+Relation+Between+Linear+Autoencoders+and+Non-Negative+Matrix+Factorization+for+Mutational+Signature+Extraction&rft.jtitle=Journal+of+computational+biology&rft.au=Egendal%2C+Ida&rft.au=Br%C3%B8ndum%2C+Rasmus+Froberg&rft.au=Pelizzola%2C+Marta&rft.au=Hobolth%2C+Asger&rft.date=2025-05-01&rft.issn=1557-8666&rft.eissn=1557-8666&rft.volume=32&rft.issue=5&rft.spage=461&rft.epage=472&rft_id=info:doi/10.1089%2Fcmb.2024.0784&rft.externalDBID=n%2Fa&rft.externalDocID=10_1089_cmb_2024_0784 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1557-8666&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1557-8666&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1557-8666&client=summon |