Sound event detection learning

An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neura...

Full description

Saved in:
Bibliographic Details
Main Authors GUO YONGHONG, VISSER ERIK, SAGI FIRAS, XU ERIC
Format Patent
LanguageChinese
English
Published 08.07.2022
Subjects
Online AccessGet full text

Cover

Loading…
Abstract An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neural network to generate a second output corresponding to a second set of sound categories. The second class count of the second set of sound classes is greater than the first class count of the first set of sound classes. The processor is further configured to provide the first output to the neural adapter to generate a third output corresponding to the second set of sound categories. The processor is further configured to provide the second output and the third output to the merge adapter to generate sound event identification data based on the audio data samples. 一种设备,包括处理器,该处理器被配置为接收音频数据样本并将音频数据样本提供给第一神经网络以生成对应于第一组声音类别的第一输出。处理器还被配置为将音频数据样本提供给第二神经网络以生成对应于第二组声音类别的第二输出。第二组声音类别的第二类别计数大于第一组声音类别的第一类别计数。处理器还被配置为将第一输出提供给神
AbstractList An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neural network to generate a second output corresponding to a second set of sound categories. The second class count of the second set of sound classes is greater than the first class count of the first set of sound classes. The processor is further configured to provide the first output to the neural adapter to generate a third output corresponding to the second set of sound categories. The processor is further configured to provide the second output and the third output to the merge adapter to generate sound event identification data based on the audio data samples. 一种设备,包括处理器,该处理器被配置为接收音频数据样本并将音频数据样本提供给第一神经网络以生成对应于第一组声音类别的第一输出。处理器还被配置为将音频数据样本提供给第二神经网络以生成对应于第二组声音类别的第二输出。第二组声音类别的第二类别计数大于第一组声音类别的第一类别计数。处理器还被配置为将第一输出提供给神
Author SAGI FIRAS
XU ERIC
VISSER ERIK
GUO YONGHONG
Author_xml – fullname: GUO YONGHONG
– fullname: VISSER ERIK
– fullname: SAGI FIRAS
– fullname: XU ERIC
BookMark eNrjYmDJy89L5WSQC84vzUtRSC1LzStRSEktSU0uyczPU8hJTSzKy8xL52FgTUvMKU7lhdLcDIpuriHOHrqpBfnxqcUFicmpeakl8c5-hoYm5sYGpmZmjsbEqAEA2-EmXw
ContentType Patent
DBID EVB
DatabaseName esp@cenet
DatabaseTitleList
Database_xml – sequence: 1
  dbid: EVB
  name: esp@cenet
  url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Chemistry
Sciences
Physics
DocumentTitleAlternate 声音事件检测学习
ExternalDocumentID CN114730566A
GroupedDBID EVB
ID FETCH-epo_espacenet_CN114730566A3
IEDL.DBID EVB
IngestDate Fri Jul 19 13:10:53 EDT 2024
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language Chinese
English
LinkModel DirectLink
MergedId FETCHMERGED-epo_espacenet_CN114730566A3
Notes Application Number: CN202080078739
OpenAccessLink https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220708&DB=EPODOC&CC=CN&NR=114730566A
ParticipantIDs epo_espacenet_CN114730566A
PublicationCentury 2000
PublicationDate 20220708
PublicationDateYYYYMMDD 2022-07-08
PublicationDate_xml – month: 07
  year: 2022
  text: 20220708
  day: 08
PublicationDecade 2020
PublicationYear 2022
RelatedCompanies QUALCOMM INCORPORATED
RelatedCompanies_xml – name: QUALCOMM INCORPORATED
Score 3.5403454
Snippet An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first...
SourceID epo
SourceType Open Access Repository
SubjectTerms ACOUSTICS
MUSICAL INSTRUMENTS
PHYSICS
SPEECH ANALYSIS OR SYNTHESIS
SPEECH OR AUDIO CODING OR DECODING
SPEECH OR VOICE PROCESSING
SPEECH RECOGNITION
Title Sound event detection learning
URI https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220708&DB=EPODOC&locale=&CC=CN&NR=114730566A
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfR1dS8Mw8Jjz802rolNHBelbcOvX2ociLm0ZwrqhU_Y2miad-jCHjQj-ei-hc77o6wUuyXGX-8h9AFx1wjJA5uWktIVNXM5dkvshI7xgAWes5zEdyh5m_uDRvZt60wa8rmphdJ_QT90cESWqQHmX-r1eroNYsc6trK7ZC4LebtJJFFu1d2zbyMGBFfejZDyKR9SiNKKZld1HaPb3lLXs327ApjKjVZ_95KmvqlKWv1VKug9bY8S2kAfQ-Ho2YJeuJq8ZsDOsP7wN2NYZmkWFwFoKq0NoP6hZSKbuvWRyIXU21cKsB0DMj-AyTSZ0QHDL2c_9ZjRbn845hib6_eIETLRluoXjh4LlDJ22bu6UHU_NBnf9Qng2O4XW33ha_y2ewZ6ilc46Dc6hKd8_xAXqVsnamijfNcF8ow
link.rule.ids 230,309,786,891,25594,76903
linkProvider European Patent Office
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQMbBMswAm3hTdNKNUI12TlBQT3UQzyyTdlOQki5SkJHPTJPBQtq-fmUeoiVeEaQQTQxZsLwz4nNBy8OGIwByVDMzvJeDyugAxiOUCXltZrJ-UCRTKt3cLsXVRg_aOjYyAKdhCzcXJ1jXA38XfWc3Z2dbZT80vyBbY7DcHtZbNHJkZWM2BXUJwVynMCbQrpQC5SnETZGALAJqWVyLEwFSVIczA6Qy7eU2YgcMXOuEtzMAOXqGZXAwUhObCYhEGuWDQXUgK4LOXFFJSS8CrqfIUoBdApIsyKLq5hjh76AKtjIf7L97ZD-E6YzEGFmC_P1WCQQHYljFMNjazTE1KTAJ22gwTjdMMTEF3g5uYJaeaGiVJMkjhNkcKn6Q8A6dHiK9PvI-nn7c0Axco3MArUC1kGFhKikpTZYH1bEmSHDiAAC5ff40
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Sound+event+detection+learning&rft.inventor=GUO+YONGHONG&rft.inventor=VISSER+ERIK&rft.inventor=SAGI+FIRAS&rft.inventor=XU+ERIC&rft.date=2022-07-08&rft.externalDBID=A&rft.externalDocID=CN114730566A