Sound event detection learning

An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neura...

Full description

Saved in:

Bibliographic Details
Main Authors	GUO YONGHONG, VISSER ERIK, SAGI FIRAS, XU ERIC
Format	Patent
Language	Chinese English
Published	08.07.2022
Subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online Access	Get full text

Cover

Loading…

Abstract	An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neural network to generate a second output corresponding to a second set of sound categories. The second class count of the second set of sound classes is greater than the first class count of the first set of sound classes. The processor is further configured to provide the first output to the neural adapter to generate a third output corresponding to the second set of sound categories. The processor is further configured to provide the second output and the third output to the merge adapter to generate sound event identification data based on the audio data samples. 一种设备，包括处理器，该处理器被配置为接收音频数据样本并将音频数据样本提供给第一神经网络以生成对应于第一组声音类别的第一输出。处理器还被配置为将音频数据样本提供给第二神经网络以生成对应于第二组声音类别的第二输出。第二组声音类别的第二类别计数大于第一组声音类别的第一类别计数。处理器还被配置为将第一输出提供给神
AbstractList	An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neural network to generate a second output corresponding to a second set of sound categories. The second class count of the second set of sound classes is greater than the first class count of the first set of sound classes. The processor is further configured to provide the first output to the neural adapter to generate a third output corresponding to the second set of sound categories. The processor is further configured to provide the second output and the third output to the merge adapter to generate sound event identification data based on the audio data samples. 一种设备，包括处理器，该处理器被配置为接收音频数据样本并将音频数据样本提供给第一神经网络以生成对应于第一组声音类别的第一输出。处理器还被配置为将音频数据样本提供给第二神经网络以生成对应于第二组声音类别的第二输出。第二组声音类别的第二类别计数大于第一组声音类别的第一类别计数。处理器还被配置为将第一输出提供给神
Author	SAGI FIRAS XU ERIC VISSER ERIK GUO YONGHONG
Author_xml	– fullname: GUO YONGHONG – fullname: VISSER ERIK – fullname: SAGI FIRAS – fullname: XU ERIC
BookMark	eNrjYmDJy89L5WSQC84vzUtRSC1LzStRSEktSU0uyczPU8hJTSzKy8xL52FgTUvMKU7lhdLcDIpuriHOHrqpBfnxqcUFicmpeakl8c5-hoYm5sYGpmZmjsbEqAEA2-EmXw
ContentType	Patent
DBID	EVB
DatabaseName	esp@cenet
DatabaseTitleList
Database_xml	– sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine Chemistry Sciences Physics
DocumentTitleAlternate	声音事件检测学习
ExternalDocumentID	CN114730566A
GroupedDBID	EVB
ID	FETCH-epo_espacenet_CN114730566A3
IEDL.DBID	EVB
IngestDate	Fri Jul 19 13:10:53 EDT 2024
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	Chinese English
LinkModel	DirectLink
MergedId	FETCHMERGED-epo_espacenet_CN114730566A3
Notes	Application Number: CN202080078739
OpenAccessLink	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220708&DB=EPODOC&CC=CN&NR=114730566A
ParticipantIDs	epo_espacenet_CN114730566A
PublicationCentury	2000
PublicationDate	20220708
PublicationDateYYYYMMDD	2022-07-08
PublicationDate_xml	– month: 07 year: 2022 text: 20220708 day: 08
PublicationDecade	2020
PublicationYear	2022
RelatedCompanies	QUALCOMM INCORPORATED
RelatedCompanies_xml	– name: QUALCOMM INCORPORATED
Score	3.5403454
Snippet	An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first...
SourceID	epo
SourceType	Open Access Repository
SubjectTerms	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Title	Sound event detection learning
URI	https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220708&DB=EPODOC&locale=&CC=CN&NR=114730566A
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfR1dS8Mw8Jjz802rolNHBelbcOvX2ociLm0ZwrqhU_Y2miad-jCHjQj-ei-hc77o6wUuyXGX-8h9AFx1wjJA5uWktIVNXM5dkvshI7xgAWes5zEdyh5m_uDRvZt60wa8rmphdJ_QT90cESWqQHmX-r1eroNYsc6trK7ZC4LebtJJFFu1d2zbyMGBFfejZDyKR9SiNKKZld1HaPb3lLXs327ApjKjVZ_95KmvqlKWv1VKug9bY8S2kAfQ-Ho2YJeuJq8ZsDOsP7wN2NYZmkWFwFoKq0NoP6hZSKbuvWRyIXU21cKsB0DMj-AyTSZ0QHDL2c_9ZjRbn845hib6_eIETLRluoXjh4LlDJ22bu6UHU_NBnf9Qng2O4XW33ha_y2ewZ6ilc46Dc6hKd8_xAXqVsnamijfNcF8ow
link.rule.ids	230,309,786,891,25594,76903
linkProvider	European Patent Office
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQMbBMswAm3hTdNKNUI12TlBQT3UQzyyTdlOQki5SkJHPTJPBQtq-fmUeoiVeEaQQTQxZsLwz4nNBy8OGIwByVDMzvJeDyugAxiOUCXltZrJ-UCRTKt3cLsXVRg_aOjYyAKdhCzcXJ1jXA38XfWc3Z2dbZT80vyBbY7DcHtZbNHJkZWM2BXUJwVynMCbQrpQC5SnETZGALAJqWVyLEwFSVIczA6Qy7eU2YgcMXOuEtzMAOXqGZXAwUhObCYhEGuWDQXUgK4LOXFFJSS8CrqfIUoBdApIsyKLq5hjh76AKtjIf7L97ZD-E6YzEGFmC_P1WCQQHYljFMNjazTE1KTAJ22gwTjdMMTEF3g5uYJaeaGiVJMkjhNkcKn6Q8A6dHiK9PvI-nn7c0Axco3MArUC1kGFhKikpTZYH1bEmSHDiAAC5ff40
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Sound+event+detection+learning&rft.inventor=GUO+YONGHONG&rft.inventor=VISSER+ERIK&rft.inventor=SAGI+FIRAS&rft.inventor=XU+ERIC&rft.date=2022-07-08&rft.externalDBID=A&rft.externalDocID=CN114730566A