Single-channel voice separation method and device for multiple speakers

The invention relates to a single-channel voice separation method and device for multiple speakers. The method comprises the following steps: acquiring a frequency spectrum amplitude and a frequency spectrum phase of mixed voice; and inputting the frequency spectrum amplitude of the mixed voice into...

Full description

Saved in:
Bibliographic Details
Main Authors SHI HUIYU, OUYANG PENG, YIN SHOUYI
Format Patent
LanguageChinese
English
Published 05.02.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention relates to a single-channel voice separation method and device for multiple speakers. The method comprises the following steps: acquiring a frequency spectrum amplitude and a frequency spectrum phase of mixed voice; and inputting the frequency spectrum amplitude of the mixed voice into a trained generative adversarial network model, and obtaining a plurality of estimated amplitude masks of the generative adversarial network model; obtaining a plurality of target frequency spectrum amplitudes according to the plurality of estimated amplitude masks and the frequency spectrum amplitude of the mixed voice; and reconstructing a plurality of target frequency spectrum amplitudes and frequency spectrum phases one by one to generate a plurality of target voices. According to the method, the target voice separation result corresponding to each speaker can be obtained, the number of speakers in the mixed voice can be quickly judged, the separation accuracy is improved, the distortion rate of the voice is r
Bibliography:Application Number: CN202011057899