Blind audio source separation by NTF and its perceptual quality evaluation

In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source separation is modeled as an optimization problem and the β-divergence cost function is iteratively optimized by alternating multiplicative update ru...

Full description

Saved in:

Bibliographic Details
Published in	2008 IEEE 16th Signal Processing, Communication and Applications Conference pp. 1 - 4
Main Authors	Keyder, M. Altug, Gunsel, Bilge
Format	Conference Proceeding
Language	English
Published	IEEE 01.04.2008
Subjects	Artificial neural networks Blind source separation Communications technology Distortion measurement Indexes Signal processing Source separation
Online Access	Get full text
ISBN	1424419980 9781424419982
ISSN	2165-0608
DOI	10.1109/SIU.2008.4632692

Cover

Loading…

Abstract	In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source separation is modeled as an optimization problem and the β-divergence cost function is iteratively optimized by alternating multiplicative update rules. The traditional measures which are used to evaluate the decomposition performance are known to be not informative about perceptual quality of the audio signals. Therefore performance of the designed system is evaluated not only with the well known Amari index, but also with perceptual audio quality criterions which are defined in the recommendation report, ITU-R BS.1387 of International Telecommunication Union (ITU). In this study, it has been shown that source decomposition performance of the NTF modelling on audio data mixed under different conditions, is superior to the nonnegative matrix factorization (NMF). Furthermore, it has been observed that some of the decomposed sources are acceptable according to Amari index while thay are not with respect to the perceptual quality criteria thus it can be concluded that the perceptual criteria is more suitable to objective quality evaluation of audio.
AbstractList	In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source separation is modeled as an optimization problem and the β-divergence cost function is iteratively optimized by alternating multiplicative update rules. The traditional measures which are used to evaluate the decomposition performance are known to be not informative about perceptual quality of the audio signals. Therefore performance of the designed system is evaluated not only with the well known Amari index, but also with perceptual audio quality criterions which are defined in the recommendation report, ITU-R BS.1387 of International Telecommunication Union (ITU). In this study, it has been shown that source decomposition performance of the NTF modelling on audio data mixed under different conditions, is superior to the nonnegative matrix factorization (NMF). Furthermore, it has been observed that some of the decomposed sources are acceptable according to Amari index while thay are not with respect to the perceptual quality criteria thus it can be concluded that the perceptual criteria is more suitable to objective quality evaluation of audio.
Author	Keyder, M. Altug Gunsel, Bilge
Author_xml	– sequence: 1 givenname: M. Altug surname: Keyder fullname: Keyder, M. Altug email: akeyder@gmail.com organization: İstanbul Teknik Üniversitesi, Çoğulortam İşaret İşleme ve Örüntü Tanima Laboratory Elektronik ve Haberleşme Mühendisliği Bölümü, Turkey – sequence: 2 givenname: Bilge surname: Gunsel fullname: Gunsel, Bilge email: gunselb@itu.edu.tr organization: İstanbul Teknik Üniversitesi, Çoğulortam İşaret İşleme ve Örüntü Tanima Laboratory Elektronik ve Haberleşme Mühendisliği Bölümü, Turkey
BookMark	eNp9TsFqwkAUfKJCTZt7oZf3A8a3m3XdvVYq6sGL6Vm29RVW0iTNJkL-vkuxV-cwwzAzMAlMqrpigGdBmRBkF8fdeyaJTKZ0LrWVI0jtyggllRI2YgzJvzE0gZkUejknTWYKSdytrMqXUjxAGsKFiIQ2MrdyBvvX0ldndP3Z1xjqvv1kDNy41nW-rvBjwEOxQRcrvgvYcMybrncl_kTy3YB8dWX_V36C6ZcrA6c3fYSXzVux3s49M5-a1n-7djjd_uf301_jRETQ
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/SIU.2008.4632692
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Xplore POP ALL IEEE Xplore All Conference Proceedings IEEE Xplore IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering
EISBN	9781424419999 1424419999
EndPage	4
ExternalDocumentID	4632692
Genre	orig-research
GroupedDBID	6IE 6IF 6IH 6IK 6IL 6IN AAJGR AAWTH ABLEC ADZIZ ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK CHZPO IEGSK IPLJI M43 OCL RIE RIL RNS
ID	FETCH-ieee_primary_46326923
IEDL.DBID	RIE
ISBN	1424419980 9781424419982
ISSN	2165-0608
IngestDate	Wed Aug 27 02:44:24 EDT 2025
IsPeerReviewed	false
IsScholarly	false
LCCN	2007943521
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-ieee_primary_46326923
ParticipantIDs	ieee_primary_4632692
PublicationCentury	2000
PublicationDate	2008-April
PublicationDateYYYYMMDD	2008-04-01
PublicationDate_xml	– month: 04 year: 2008 text: 2008-April
PublicationDecade	2000
PublicationTitle	2008 IEEE 16th Signal Processing, Communication and Applications Conference
PublicationTitleAbbrev	SIU
PublicationYear	2008
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0001682392 ssj0000453644
Score	2.7898393
Snippet	In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source...
SourceID	ieee
SourceType	Publisher
StartPage	1
SubjectTerms	Artificial neural networks Blind source separation Communications technology Distortion measurement Indexes Signal processing Source separation
Title	Blind audio source separation by NTF and its perceptual quality evaluation
URI	https://ieeexplore.ieee.org/document/4632692
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH7MnfTij03UqeTg0XS1SWN7VRxzsCG4wW4jaVIYSju29jD_evOj7VR28JaUkr5S6Pfy8n3fA7jTiMhSqkJMYsoxlWGCeRBxLB_TmEk9IraLwnjChjM6mofzFtw3WhillCWfKc8M7Vm-zJPSlMr6lOlkI9Y_3AO9cXNaraaeolMTwiqosvUVFgXE9kQOHliIfeZHta7LyMoau6dqHtRHmH7cf3-dOZJl9bxfjVcs7gyOYVxH7OgmH15ZCC_5-mPm-N9XOoHuTuGH3hrsOoWWys7g6Ic5YQdGTzoDlYiXcpkjV-NHG-WswvMMiS2aTAeI61uWxQatHEGm5J_I6TS3aGck3oXe4GX6PMQmssXKGVwsqqDIObSzPFMXgDhRKjAb2kSnMNKnnMeCpnEapiKhofAvobNvhav9l3tw6GgXhgBzDe1iXaobje2FuLUf9RuHkKCL
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT4MwFH5Z5kG9-GMz6vzRg0dhCKXSq0ayzY2YyJLdSKElWVxgcXCYf72lZUzNDt4KIeU1JP0er9_3PYA7iYgkxcI1HIqZgbmbGMz2mMEfU0q4HDmqi8IkIIMpHs3cWQvuGy2MEEKRz4RZDdVZPs-TsiqV9TGRyQaVG-6exH1MtVqrqajI5MQhNVipCgvxbEd1RbYfiGtYxPI2yq5KWNYYPtXX9uYQ06L99-FU0yzrN_5qvaKQxz-CySZmTTj5MMsiNpOvP3aO_13UMXS3Gj_01qDXCbREdgqHP-wJOzB6kjkoR6zk8xzpKj9aCW0WnmcoXqMg9BGTj8yLFVpqikzJFkgrNddoayXehZ7_Ej4PjCqyaKktLqI6KOcM2lmeiXNAzBHCrn5pE5nEcAszRmOc0tRN4wS7sXUBnV0zXO6-fQv7g3AyjsbD4LUHB5qEUdFhrqBdfJbiWiJ9Ed-oD_wNKwWj2w
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2008+IEEE+16th+Signal+Processing%2C+Communication+and+Applications+Conference&rft.atitle=Blind+audio+source+separation+by+NTF+and+its+perceptual+quality+evaluation&rft.au=Keyder%2C+M.+Altug&rft.au=Gunsel%2C+Bilge&rft.date=2008-04-01&rft.pub=IEEE&rft.isbn=9781424419982&rft.issn=2165-0608&rft.spage=1&rft.epage=4&rft_id=info:doi/10.1109%2FSIU.2008.4632692&rft.externalDocID=4632692
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2165-0608&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2165-0608&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2165-0608&client=summon