Sound event detection learning
An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neura...
Saved in:
Main Authors | , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
08.07.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neural network to generate a second output corresponding to a second set of sound categories. The second class count of the second set of sound classes is greater than the first class count of the first set of sound classes. The processor is further configured to provide the first output to the neural adapter to generate a third output corresponding to the second set of sound categories. The processor is further configured to provide the second output and the third output to the merge adapter to generate sound event identification data based on the audio data samples.
一种设备,包括处理器,该处理器被配置为接收音频数据样本并将音频数据样本提供给第一神经网络以生成对应于第一组声音类别的第一输出。处理器还被配置为将音频数据样本提供给第二神经网络以生成对应于第二组声音类别的第二输出。第二组声音类别的第二类别计数大于第一组声音类别的第一类别计数。处理器还被配置为将第一输出提供给神 |
---|---|
AbstractList | An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neural network to generate a second output corresponding to a second set of sound categories. The second class count of the second set of sound classes is greater than the first class count of the first set of sound classes. The processor is further configured to provide the first output to the neural adapter to generate a third output corresponding to the second set of sound categories. The processor is further configured to provide the second output and the third output to the merge adapter to generate sound event identification data based on the audio data samples.
一种设备,包括处理器,该处理器被配置为接收音频数据样本并将音频数据样本提供给第一神经网络以生成对应于第一组声音类别的第一输出。处理器还被配置为将音频数据样本提供给第二神经网络以生成对应于第二组声音类别的第二输出。第二组声音类别的第二类别计数大于第一组声音类别的第一类别计数。处理器还被配置为将第一输出提供给神 |
Author | SAGI FIRAS XU ERIC VISSER ERIK GUO YONGHONG |
Author_xml | – fullname: GUO YONGHONG – fullname: VISSER ERIK – fullname: SAGI FIRAS – fullname: XU ERIC |
BookMark | eNrjYmDJy89L5WSQC84vzUtRSC1LzStRSEktSU0uyczPU8hJTSzKy8xL52FgTUvMKU7lhdLcDIpuriHOHrqpBfnxqcUFicmpeakl8c5-hoYm5sYGpmZmjsbEqAEA2-EmXw |
ContentType | Patent |
DBID | EVB |
DatabaseName | esp@cenet |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: EVB name: esp@cenet url: http://worldwide.espacenet.com/singleLineSearch?locale=en_EP sourceTypes: Open Access Repository |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Chemistry Sciences Physics |
DocumentTitleAlternate | 声音事件检测学习 |
ExternalDocumentID | CN114730566A |
GroupedDBID | EVB |
ID | FETCH-epo_espacenet_CN114730566A3 |
IEDL.DBID | EVB |
IngestDate | Fri Jul 19 13:10:53 EDT 2024 |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | Chinese English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-epo_espacenet_CN114730566A3 |
Notes | Application Number: CN202080078739 |
OpenAccessLink | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220708&DB=EPODOC&CC=CN&NR=114730566A |
ParticipantIDs | epo_espacenet_CN114730566A |
PublicationCentury | 2000 |
PublicationDate | 20220708 |
PublicationDateYYYYMMDD | 2022-07-08 |
PublicationDate_xml | – month: 07 year: 2022 text: 20220708 day: 08 |
PublicationDecade | 2020 |
PublicationYear | 2022 |
RelatedCompanies | QUALCOMM INCORPORATED |
RelatedCompanies_xml | – name: QUALCOMM INCORPORATED |
Score | 3.5403454 |
Snippet | An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first... |
SourceID | epo |
SourceType | Open Access Repository |
SubjectTerms | ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION |
Title | Sound event detection learning |
URI | https://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220708&DB=EPODOC&locale=&CC=CN&NR=114730566A |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfR1dS8Mw8Jjz802rolNHBelbcOvX2ociLm0ZwrqhU_Y2miad-jCHjQj-ei-hc77o6wUuyXGX-8h9AFx1wjJA5uWktIVNXM5dkvshI7xgAWes5zEdyh5m_uDRvZt60wa8rmphdJ_QT90cESWqQHmX-r1eroNYsc6trK7ZC4LebtJJFFu1d2zbyMGBFfejZDyKR9SiNKKZld1HaPb3lLXs327ApjKjVZ_95KmvqlKWv1VKug9bY8S2kAfQ-Ho2YJeuJq8ZsDOsP7wN2NYZmkWFwFoKq0NoP6hZSKbuvWRyIXU21cKsB0DMj-AyTSZ0QHDL2c_9ZjRbn845hib6_eIETLRluoXjh4LlDJ22bu6UHU_NBnf9Qng2O4XW33ha_y2ewZ6ilc46Dc6hKd8_xAXqVsnamijfNcF8ow |
link.rule.ids | 230,309,786,891,25594,76903 |
linkProvider | European Patent Office |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwY2BQMbBMswAm3hTdNKNUI12TlBQT3UQzyyTdlOQki5SkJHPTJPBQtq-fmUeoiVeEaQQTQxZsLwz4nNBy8OGIwByVDMzvJeDyugAxiOUCXltZrJ-UCRTKt3cLsXVRg_aOjYyAKdhCzcXJ1jXA38XfWc3Z2dbZT80vyBbY7DcHtZbNHJkZWM2BXUJwVynMCbQrpQC5SnETZGALAJqWVyLEwFSVIczA6Qy7eU2YgcMXOuEtzMAOXqGZXAwUhObCYhEGuWDQXUgK4LOXFFJSS8CrqfIUoBdApIsyKLq5hjh76AKtjIf7L97ZD-E6YzEGFmC_P1WCQQHYljFMNjazTE1KTAJ22gwTjdMMTEF3g5uYJaeaGiVJMkjhNkcKn6Q8A6dHiK9PvI-nn7c0Axco3MArUC1kGFhKikpTZYH1bEmSHDiAAC5ff40 |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Apatent&rft.title=Sound+event+detection+learning&rft.inventor=GUO+YONGHONG&rft.inventor=VISSER+ERIK&rft.inventor=SAGI+FIRAS&rft.inventor=XU+ERIC&rft.date=2022-07-08&rft.externalDBID=A&rft.externalDocID=CN114730566A |