Blind audio source separation by NTF and its perceptual quality evaluation
In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source separation is modeled as an optimization problem and the β-divergence cost function is iteratively optimized by alternating multiplicative update ru...
Saved in:
Published in | 2008 IEEE 16th Signal Processing, Communication and Applications Conference pp. 1 - 4 |
---|---|
Main Authors | , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.04.2008
|
Subjects | |
Online Access | Get full text |
ISBN | 1424419980 9781424419982 |
ISSN | 2165-0608 |
DOI | 10.1109/SIU.2008.4632692 |
Cover
Loading…
Abstract | In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source separation is modeled as an optimization problem and the β-divergence cost function is iteratively optimized by alternating multiplicative update rules. The traditional measures which are used to evaluate the decomposition performance are known to be not informative about perceptual quality of the audio signals. Therefore performance of the designed system is evaluated not only with the well known Amari index, but also with perceptual audio quality criterions which are defined in the recommendation report, ITU-R BS.1387 of International Telecommunication Union (ITU). In this study, it has been shown that source decomposition performance of the NTF modelling on audio data mixed under different conditions, is superior to the nonnegative matrix factorization (NMF). Furthermore, it has been observed that some of the decomposed sources are acceptable according to Amari index while thay are not with respect to the perceptual quality criteria thus it can be concluded that the perceptual criteria is more suitable to objective quality evaluation of audio. |
---|---|
AbstractList | In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source separation is modeled as an optimization problem and the β-divergence cost function is iteratively optimized by alternating multiplicative update rules. The traditional measures which are used to evaluate the decomposition performance are known to be not informative about perceptual quality of the audio signals. Therefore performance of the designed system is evaluated not only with the well known Amari index, but also with perceptual audio quality criterions which are defined in the recommendation report, ITU-R BS.1387 of International Telecommunication Union (ITU). In this study, it has been shown that source decomposition performance of the NTF modelling on audio data mixed under different conditions, is superior to the nonnegative matrix factorization (NMF). Furthermore, it has been observed that some of the decomposed sources are acceptable according to Amari index while thay are not with respect to the perceptual quality criteria thus it can be concluded that the perceptual criteria is more suitable to objective quality evaluation of audio. |
Author | Keyder, M. Altug Gunsel, Bilge |
Author_xml | – sequence: 1 givenname: M. Altug surname: Keyder fullname: Keyder, M. Altug email: akeyder@gmail.com organization: İstanbul Teknik Üniversitesi, Çoğulortam İşaret İşleme ve Örüntü Tanima Laboratory Elektronik ve Haberleşme Mühendisliği Bölümü, Turkey – sequence: 2 givenname: Bilge surname: Gunsel fullname: Gunsel, Bilge email: gunselb@itu.edu.tr organization: İstanbul Teknik Üniversitesi, Çoğulortam İşaret İşleme ve Örüntü Tanima Laboratory Elektronik ve Haberleşme Mühendisliği Bölümü, Turkey |
BookMark | eNp9TsFqwkAUfKJCTZt7oZf3A8a3m3XdvVYq6sGL6Vm29RVW0iTNJkL-vkuxV-cwwzAzMAlMqrpigGdBmRBkF8fdeyaJTKZ0LrWVI0jtyggllRI2YgzJvzE0gZkUejknTWYKSdytrMqXUjxAGsKFiIQ2MrdyBvvX0ldndP3Z1xjqvv1kDNy41nW-rvBjwEOxQRcrvgvYcMybrncl_kTy3YB8dWX_V36C6ZcrA6c3fYSXzVux3s49M5-a1n-7djjd_uf301_jRETQ |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/SIU.2008.4632692 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering |
EISBN | 9781424419999 1424419999 |
EndPage | 4 |
ExternalDocumentID | 4632692 |
Genre | orig-research |
GroupedDBID | 6IE 6IF 6IH 6IK 6IL 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI M43 OCL RIE RIL RNS |
ID | FETCH-ieee_primary_46326923 |
IEDL.DBID | RIE |
ISBN | 1424419980 9781424419982 |
ISSN | 2165-0608 |
IngestDate | Wed Aug 27 02:44:24 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | false |
LCCN | 2007943521 |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-ieee_primary_46326923 |
ParticipantIDs | ieee_primary_4632692 |
PublicationCentury | 2000 |
PublicationDate | 2008-April |
PublicationDateYYYYMMDD | 2008-04-01 |
PublicationDate_xml | – month: 04 year: 2008 text: 2008-April |
PublicationDecade | 2000 |
PublicationTitle | 2008 IEEE 16th Signal Processing, Communication and Applications Conference |
PublicationTitleAbbrev | SIU |
PublicationYear | 2008 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0001682392 ssj0000453644 |
Score | 2.7898393 |
Snippet | In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1 |
SubjectTerms | Artificial neural networks Blind source separation Communications technology Distortion measurement Indexes Signal processing Source separation |
Title | Blind audio source separation by NTF and its perceptual quality evaluation |
URI | https://ieeexplore.ieee.org/document/4632692 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH7MnfTij03UqeTg0XS1SWN7VRxzsCG4wW4jaVIYSju29jD_evOj7VR28JaUkr5S6Pfy8n3fA7jTiMhSqkJMYsoxlWGCeRBxLB_TmEk9IraLwnjChjM6mofzFtw3WhillCWfKc8M7Vm-zJPSlMr6lOlkI9Y_3AO9cXNaraaeolMTwiqosvUVFgXE9kQOHliIfeZHta7LyMoau6dqHtRHmH7cf3-dOZJl9bxfjVcs7gyOYVxH7OgmH15ZCC_5-mPm-N9XOoHuTuGH3hrsOoWWys7g6Ic5YQdGTzoDlYiXcpkjV-NHG-WswvMMiS2aTAeI61uWxQatHEGm5J_I6TS3aGck3oXe4GX6PMQmssXKGVwsqqDIObSzPFMXgDhRKjAb2kSnMNKnnMeCpnEapiKhofAvobNvhav9l3tw6GgXhgBzDe1iXaobje2FuLUf9RuHkKCL |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT4MwFH5Z5kG9-GMz6vzRg0dhCKXSq0ayzY2YyJLdSKElWVxgcXCYf72lZUzNDt4KIeU1JP0er9_3PYA7iYgkxcI1HIqZgbmbGMz2mMEfU0q4HDmqi8IkIIMpHs3cWQvuGy2MEEKRz4RZDdVZPs-TsiqV9TGRyQaVG-6exH1MtVqrqajI5MQhNVipCgvxbEd1RbYfiGtYxPI2yq5KWNYYPtXX9uYQ06L99-FU0yzrN_5qvaKQxz-CySZmTTj5MMsiNpOvP3aO_13UMXS3Gj_01qDXCbREdgqHP-wJOzB6kjkoR6zk8xzpKj9aCW0WnmcoXqMg9BGTj8yLFVpqikzJFkgrNddoayXehZ7_Ej4PjCqyaKktLqI6KOcM2lmeiXNAzBHCrn5pE5nEcAszRmOc0tRN4wS7sXUBnV0zXO6-fQv7g3AyjsbD4LUHB5qEUdFhrqBdfJbiWiJ9Ed-oD_wNKwWj2w |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2008+IEEE+16th+Signal+Processing%2C+Communication+and+Applications+Conference&rft.atitle=Blind+audio+source+separation+by+NTF+and+its+perceptual+quality+evaluation&rft.au=Keyder%2C+M.+Altug&rft.au=Gunsel%2C+Bilge&rft.date=2008-04-01&rft.pub=IEEE&rft.isbn=9781424419982&rft.issn=2165-0608&rft.spage=1&rft.epage=4&rft_id=info:doi/10.1109%2FSIU.2008.4632692&rft.externalDocID=4632692 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2165-0608&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2165-0608&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2165-0608&client=summon |