Blind audio source separation by NTF and its perceptual quality evaluation

In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source separation is modeled as an optimization problem and the β-divergence cost function is iteratively optimized by alternating multiplicative update ru...

Full description

Saved in:
Bibliographic Details
Published in2008 IEEE 16th Signal Processing, Communication and Applications Conference pp. 1 - 4
Main Authors Keyder, M. Altug, Gunsel, Bilge
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.04.2008
Subjects
Online AccessGet full text
ISBN1424419980
9781424419982
ISSN2165-0608
DOI10.1109/SIU.2008.4632692

Cover

Loading…
Abstract In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source separation is modeled as an optimization problem and the β-divergence cost function is iteratively optimized by alternating multiplicative update rules. The traditional measures which are used to evaluate the decomposition performance are known to be not informative about perceptual quality of the audio signals. Therefore performance of the designed system is evaluated not only with the well known Amari index, but also with perceptual audio quality criterions which are defined in the recommendation report, ITU-R BS.1387 of International Telecommunication Union (ITU). In this study, it has been shown that source decomposition performance of the NTF modelling on audio data mixed under different conditions, is superior to the nonnegative matrix factorization (NMF). Furthermore, it has been observed that some of the decomposed sources are acceptable according to Amari index while thay are not with respect to the perceptual quality criteria thus it can be concluded that the perceptual criteria is more suitable to objective quality evaluation of audio.
AbstractList In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source separation is modeled as an optimization problem and the β-divergence cost function is iteratively optimized by alternating multiplicative update rules. The traditional measures which are used to evaluate the decomposition performance are known to be not informative about perceptual quality of the audio signals. Therefore performance of the designed system is evaluated not only with the well known Amari index, but also with perceptual audio quality criterions which are defined in the recommendation report, ITU-R BS.1387 of International Telecommunication Union (ITU). In this study, it has been shown that source decomposition performance of the NTF modelling on audio data mixed under different conditions, is superior to the nonnegative matrix factorization (NMF). Furthermore, it has been observed that some of the decomposed sources are acceptable according to Amari index while thay are not with respect to the perceptual quality criteria thus it can be concluded that the perceptual criteria is more suitable to objective quality evaluation of audio.
Author Keyder, M. Altug
Gunsel, Bilge
Author_xml – sequence: 1
  givenname: M. Altug
  surname: Keyder
  fullname: Keyder, M. Altug
  email: akeyder@gmail.com
  organization: İstanbul Teknik Üniversitesi, Çoğulortam İşaret İşleme ve Örüntü Tanima Laboratory Elektronik ve Haberleşme Mühendisliği Bölümü, Turkey
– sequence: 2
  givenname: Bilge
  surname: Gunsel
  fullname: Gunsel, Bilge
  email: gunselb@itu.edu.tr
  organization: İstanbul Teknik Üniversitesi, Çoğulortam İşaret İşleme ve Örüntü Tanima Laboratory Elektronik ve Haberleşme Mühendisliği Bölümü, Turkey
BookMark eNp9TsFqwkAUfKJCTZt7oZf3A8a3m3XdvVYq6sGL6Vm29RVW0iTNJkL-vkuxV-cwwzAzMAlMqrpigGdBmRBkF8fdeyaJTKZ0LrWVI0jtyggllRI2YgzJvzE0gZkUejknTWYKSdytrMqXUjxAGsKFiIQ2MrdyBvvX0ldndP3Z1xjqvv1kDNy41nW-rvBjwEOxQRcrvgvYcMybrncl_kTy3YB8dWX_V36C6ZcrA6c3fYSXzVux3s49M5-a1n-7djjd_uf301_jRETQ
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/SIU.2008.4632692
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Xplore
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
EISBN 9781424419999
1424419999
EndPage 4
ExternalDocumentID 4632692
Genre orig-research
GroupedDBID 6IE
6IF
6IH
6IK
6IL
6IN
AAJGR
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
IPLJI
M43
OCL
RIE
RIL
RNS
ID FETCH-ieee_primary_46326923
IEDL.DBID RIE
ISBN 1424419980
9781424419982
ISSN 2165-0608
IngestDate Wed Aug 27 02:44:24 EDT 2025
IsPeerReviewed false
IsScholarly false
LCCN 2007943521
Language English
LinkModel DirectLink
MergedId FETCHMERGED-ieee_primary_46326923
ParticipantIDs ieee_primary_4632692
PublicationCentury 2000
PublicationDate 2008-April
PublicationDateYYYYMMDD 2008-04-01
PublicationDate_xml – month: 04
  year: 2008
  text: 2008-April
PublicationDecade 2000
PublicationTitle 2008 IEEE 16th Signal Processing, Communication and Applications Conference
PublicationTitleAbbrev SIU
PublicationYear 2008
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0001682392
ssj0000453644
Score 2.7898393
Snippet In this paper, the audio blind source separation (BSS) using three dimensional nonnegative tensor factorization (3D-NTF), is realized. The audio source...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Artificial neural networks
Blind source separation
Communications technology
Distortion measurement
Indexes
Signal processing
Source separation
Title Blind audio source separation by NTF and its perceptual quality evaluation
URI https://ieeexplore.ieee.org/document/4632692
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwFH7MnfTij03UqeTg0XS1SWN7VRxzsCG4wW4jaVIYSju29jD_evOj7VR28JaUkr5S6Pfy8n3fA7jTiMhSqkJMYsoxlWGCeRBxLB_TmEk9IraLwnjChjM6mofzFtw3WhillCWfKc8M7Vm-zJPSlMr6lOlkI9Y_3AO9cXNaraaeolMTwiqosvUVFgXE9kQOHliIfeZHta7LyMoau6dqHtRHmH7cf3-dOZJl9bxfjVcs7gyOYVxH7OgmH15ZCC_5-mPm-N9XOoHuTuGH3hrsOoWWys7g6Ic5YQdGTzoDlYiXcpkjV-NHG-WswvMMiS2aTAeI61uWxQatHEGm5J_I6TS3aGck3oXe4GX6PMQmssXKGVwsqqDIObSzPFMXgDhRKjAb2kSnMNKnnMeCpnEapiKhofAvobNvhav9l3tw6GgXhgBzDe1iXaobje2FuLUf9RuHkKCL
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PT4MwFH5Z5kG9-GMz6vzRg0dhCKXSq0ayzY2YyJLdSKElWVxgcXCYf72lZUzNDt4KIeU1JP0er9_3PYA7iYgkxcI1HIqZgbmbGMz2mMEfU0q4HDmqi8IkIIMpHs3cWQvuGy2MEEKRz4RZDdVZPs-TsiqV9TGRyQaVG-6exH1MtVqrqajI5MQhNVipCgvxbEd1RbYfiGtYxPI2yq5KWNYYPtXX9uYQ06L99-FU0yzrN_5qvaKQxz-CySZmTTj5MMsiNpOvP3aO_13UMXS3Gj_01qDXCbREdgqHP-wJOzB6kjkoR6zk8xzpKj9aCW0WnmcoXqMg9BGTj8yLFVpqikzJFkgrNddoayXehZ7_Ej4PjCqyaKktLqI6KOcM2lmeiXNAzBHCrn5pE5nEcAszRmOc0tRN4wS7sXUBnV0zXO6-fQv7g3AyjsbD4LUHB5qEUdFhrqBdfJbiWiJ9Ed-oD_wNKwWj2w
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2008+IEEE+16th+Signal+Processing%2C+Communication+and+Applications+Conference&rft.atitle=Blind+audio+source+separation+by+NTF+and+its+perceptual+quality+evaluation&rft.au=Keyder%2C+M.+Altug&rft.au=Gunsel%2C+Bilge&rft.date=2008-04-01&rft.pub=IEEE&rft.isbn=9781424419982&rft.issn=2165-0608&rft.spage=1&rft.epage=4&rft_id=info:doi/10.1109%2FSIU.2008.4632692&rft.externalDocID=4632692
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2165-0608&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2165-0608&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2165-0608&client=summon