Sound Recognition in Mixtures

In this paper, we describe a method for recognizing sound sources in a mixture. While many audio-based content analysis methods focus on detecting or classifying target sounds in a discriminative manner, we approach this as a regression problem, in which we estimate the relative proportions of sound...

Full description

Saved in:

Bibliographic Details
Published in	Latent Variable Analysis and Signal Separation pp. 405 - 413
Main Authors	Nam, Juhan, Mysore, Gautham J., Smaragdis, Paris
Format	Book Chapter
Language	English
Published	Berlin, Heidelberg Springer Berlin Heidelberg 2012
Series	Lecture Notes in Computer Science
Subjects	Relative Proportion Single Source Sound Source Transition Matrice Transition Matrix
Online Access	Get full text

Cover

Loading…

Abstract	In this paper, we describe a method for recognizing sound sources in a mixture. While many audio-based content analysis methods focus on detecting or classifying target sounds in a discriminative manner, we approach this as a regression problem, in which we estimate the relative proportions of sound sources in the given mixture. Using source separation ideas based on probabilistic latent component analysis, we directly estimate these proportions from the mixture without actually separating the sources. We also introduce a method for learning a transition matrix to temporally constrain the problem. We demonstrate the proposed method on a mixture of five classes of sounds and show that it is quite effective in correctly estimating the relative proportions of the sounds in the mixture.
AbstractList	In this paper, we describe a method for recognizing sound sources in a mixture. While many audio-based content analysis methods focus on detecting or classifying target sounds in a discriminative manner, we approach this as a regression problem, in which we estimate the relative proportions of sound sources in the given mixture. Using source separation ideas based on probabilistic latent component analysis, we directly estimate these proportions from the mixture without actually separating the sources. We also introduce a method for learning a transition matrix to temporally constrain the problem. We demonstrate the proposed method on a mixture of five classes of sounds and show that it is quite effective in correctly estimating the relative proportions of the sounds in the mixture.
Author	Nam, Juhan Smaragdis, Paris Mysore, Gautham J.
Author_xml	– sequence: 1 givenname: Juhan surname: Nam fullname: Nam, Juhan organization: Center for Computer Research in Music and Acoustics, Stanford University, USA – sequence: 2 givenname: Gautham J. surname: Mysore fullname: Mysore, Gautham J. organization: Advanced Technology Labs, Adobe Systems Inc., USA – sequence: 3 givenname: Paris surname: Smaragdis fullname: Smaragdis, Paris organization: University of Illinois at Urbana-Champaign, USA
BookMark	eNpVkFtLAzEQhaNWcK37DxT2D0RnMtls8ijFG1QEL89hL7NlVRJpWvDnm1ZfPC8D38A5nHMqZiEGFuIc4RIBmivXWEnSaCWVrWuUxtdwIMqMKcM9M4eiQIMoibQ7-vcDNxMFECjpGk0nokzpHbKMMWh1IS5e4jYM1TP3cRWmzRRDNYXqcfrebNeczsTx2H4mLv_uXLzd3rwu7uXy6e5hcb2UCZFAsnJd35o2ZzJ0tRmcGpXpeWCyNdgOdY394ECBYtZkFTS2VXZEHNjpUdNcqF_f9LWeworXvovxI3kEv9vA50KefLb3-75-twH9AKC5Sco
ContentType	Book Chapter
Copyright	Springer-Verlag Berlin Heidelberg 2012
Copyright_xml	– notice: Springer-Verlag Berlin Heidelberg 2012
DOI	10.1007/978-3-642-28551-6_50
DatabaseTitleList
DeliveryMethod	fulltext_linktorsrc
Discipline	Engineering Computer Science
EISBN	9783642285516 3642285511
EISSN	1611-3349
Editor	Yeredor, Arie Theis, Fabian Zibulevsky, Michael Cichocki, Andrzej
Editor_xml	– sequence: 1 givenname: Fabian surname: Theis fullname: Theis, Fabian email: fabian.theis@helmholtz-muenchen.de – sequence: 2 givenname: Andrzej surname: Cichocki fullname: Cichocki, Andrzej email: a.cichocki@riken.jp – sequence: 3 givenname: Arie surname: Yeredor fullname: Yeredor, Arie email: arie@eng.tau.ac.il – sequence: 4 givenname: Michael surname: Zibulevsky fullname: Zibulevsky, Michael email: mzib@cs.technion.ac.il
EndPage	413
GroupedDBID	-DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ABMNI ACGFS ADCXD AEFIE ALMA_UNASSIGNED_HOLDINGS EJD F5P FEDTE HVGLF LAS LDH P2P RIG RNI RSU SVGTG VI1 ~02
ID	FETCH-LOGICAL-s1130-e29bca6a642e0b56d92f26cede38508b1451cd90202ee4382078a28f11de94f43
ISBN	9783642285509 3642285503
ISSN	0302-9743
IngestDate	Wed Nov 06 06:25:34 EST 2024
IsPeerReviewed	true
IsScholarly	true
Language	English
LinkModel	OpenURL
MergedId	FETCHMERGED-LOGICAL-s1130-e29bca6a642e0b56d92f26cede38508b1451cd90202ee4382078a28f11de94f43
PageCount	9
ParticipantIDs	springer_books_10_1007_978_3_642_28551_6_50
PublicationCentury	2000
PublicationDate	2012
PublicationDateYYYYMMDD	2012-01-01
PublicationDate_xml	– year: 2012 text: 2012
PublicationDecade	2010
PublicationPlace	Berlin, Heidelberg
PublicationPlace_xml	– name: Berlin, Heidelberg
PublicationSeriesTitle	Lecture Notes in Computer Science
PublicationSubtitle	10th International Conference, LVA/ICA 2012, Tel Aviv, Israel, March 12-15, 2012. Proceedings
PublicationTitle	Latent Variable Analysis and Signal Separation
PublicationYear	2012
Publisher	Springer Berlin Heidelberg
Publisher_xml	– name: Springer Berlin Heidelberg
RelatedPersons	Kleinberg, Jon M. Mattern, Friedemann Nierstrasz, Oscar Steffen, Bernhard Kittler, Josef Vardi, Moshe Y. Weikum, Gerhard Sudan, Madhu Naor, Moni Mitchell, John C. Terzopoulos, Demetri Pandu Rangan, C. Kanade, Takeo Hutchison, David Tygar, Doug
RelatedPersons_xml	– sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David organization: Lancaster University, Lancaster, UK – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo organization: Carnegie Mellon University, Pittsburgh, USA – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef organization: University of Surrey, Guildford, UK – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. organization: Cornell University, Ithaca, USA – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann organization: ETH Zurich, Zurich, Switzerland – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. organization: Stanford University, Stanford, USA – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni organization: Weizmann Institute of Science, Rehovot, Israel – sequence: 8 givenname: Oscar surname: Nierstrasz fullname: Nierstrasz, Oscar organization: University of Bern, Bern, Switzerland – sequence: 9 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. organization: Indian Institute of Technology, Madras, India – sequence: 10 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: University of Dortmund, Dortmund, Germany – sequence: 11 givenname: Madhu surname: Sudan fullname: Sudan, Madhu organization: Massachusetts Institute of Technology, USA – sequence: 12 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri organization: University of California, Los Angeles, USA – sequence: 13 givenname: Doug surname: Tygar fullname: Tygar, Doug organization: University of California, Berkeley, USA – sequence: 14 givenname: Moshe Y. surname: Vardi fullname: Vardi, Moshe Y. organization: Rice University, Houston, USA – sequence: 15 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany
SSID	ssj0000666184 ssj0002792
Score	1.7159984
Snippet	In this paper, we describe a method for recognizing sound sources in a mixture. While many audio-based content analysis methods focus on detecting or...
SourceID	springer
SourceType	Publisher
StartPage	405
SubjectTerms	Relative Proportion Single Source Sound Source Transition Matrice Transition Matrix
Title	Sound Recognition in Mixtures
URI	http://link.springer.com/10.1007/978-3-642-28551-6_50
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1JT4QwFG5cLurBPe7h4G2CGWiL9ODBGJdMnLm4xBsBWpSDYyKYqL_e97oMdYmJXgghBNr3yuNt31dC9mGJpCWYxFDmvAxZzuGbY-IwLCSXMaeyLxWCk4ej5OKGDe74XQch0OiStjgo33_ElfxHq3AN9Ioo2T9odvJQuADnoF84gobh-MX5_ZxmtejlFuv4txDsavjThF5EN2PW9-hmXilD7d3V2kcOFv3QLYvhW2O7bc_zl_Yhf-wNDrrMCzzgXtaWz_-5bvxFdoWbMqHraZqQTNvksH7FsoS5EeWgmqNLW6oYPbW6A6zndpNwxsXPPug2Dj_74LKPvV_IuTRQBJnGIBgSnnmjYIshmjHmTRnzmyCpIjUkptaksj73_s7MIFe_GX6_1yNBxBG8LQqTDNM504cCbN_s8eng8naSf8O4LUKMrv1rI5GiqTiZUSEOyI2aGqambhYeBvOnV36rqmtn5XqJLCCAJUBkCQh4mUyp8QpZdAIPrMBXyLzHRblKdrUuA0-XQT0OnC7XyM3Z6fXJRWj3zgibCNySUMWiKPMkh4GpfsETKeIqTkolFU3BJy9wg-ZSCggWYqWwGAyuYh6nVRRJJVjF6DqZGT-N1QYJZIXVuYrHEkmei1SknDEBYWvFElrSeJP03Gwz_BqazFFhg2wymsEQMi2bDGWz9ae7t8lct-p2yEz7_KJ2wQtsiz2r0A_uzE9x
link.rule.ids	782,783,787,796,27937
linkProvider	Library Specific Holdings
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Latent+Variable+Analysis+and+Signal+Separation&rft.au=Nam%2C+Juhan&rft.au=Mysore%2C+Gautham+J.&rft.au=Smaragdis%2C+Paris&rft.atitle=Sound+Recognition+in+Mixtures&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2012-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783642285509&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=405&rft.epage=413&rft_id=info:doi/10.1007%2F978-3-642-28551-6_50
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon