Sound event detection learning

An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neura...

Full description

Saved in:

Bibliographic Details
Main Authors	GUO YONGHONG, VISSER ERIK, SAGI FIRAS, XU ERIC
Format	Patent
Language	Chinese English
Published	08.07.2022
Subjects	ACOUSTICS MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION
Online Access	Get full text

Cover

Loading…

More Information
Summary:	An apparatus includes a processor configured to receive audio data samples and provide the audio data samples to a first neural network to generate a first output corresponding to a first set of sound categories. The processor is further configured to provide the audio data samples to a second neural network to generate a second output corresponding to a second set of sound categories. The second class count of the second set of sound classes is greater than the first class count of the first set of sound classes. The processor is further configured to provide the first output to the neural adapter to generate a third output corresponding to the second set of sound categories. The processor is further configured to provide the second output and the third output to the merge adapter to generate sound event identification data based on the audio data samples. 一种设备，包括处理器，该处理器被配置为接收音频数据样本并将音频数据样本提供给第一神经网络以生成对应于第一组声音类别的第一输出。处理器还被配置为将音频数据样本提供给第二神经网络以生成对应于第二组声音类别的第二输出。第二组声音类别的第二类别计数大于第一组声音类别的第一类别计数。处理器还被配置为将第一输出提供给神
Bibliography:	Application Number: CN202080078739