Speech translation and speech recognition method based on sequence dynamic compression

The invention relates to a speech translation and speech recognition method based on dynamic sequence compression, and belongs to the technical field of natural language processing. The problems that in the prior art, voice data cannot be effectively compressed through a voice translation or voice r...

Full description

Saved in:
Bibliographic Details
Main Authors YANG MURUN, DU QUAN
Format Patent
LanguageChinese
English
Published 02.06.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention relates to a speech translation and speech recognition method based on dynamic sequence compression, and belongs to the technical field of natural language processing. The problems that in the prior art, voice data cannot be effectively compressed through a voice translation or voice recognition method, and consequently computing resources are too large are solved; or the problem of information loss caused by excessive compression of the data due to the fact that the voice data cannot be dynamically compressed step by step is solved; the speech translation method comprises the following steps: acquiring to-be-translated source language speech data; performing length prediction, dynamic compression, feature fusion and encoding on the feature sequence of the voice data through an acoustic encoder to obtain an acoustic encoder implicit vector; performing text mode conversion on the acoustic encoder implicit vector by using a text encoder, and performing feature extraction and encoding to obtain a t
Bibliography:Application Number: CN202211732994