Low-resource neural machine translation method based on multi-strategy prototype generation

The invention relates to a low-resource neural machine translation method based on multi-strategy prototype generation, and belongs to the technical field of natural language processing. The method comprises the steps that firstly, a prototype sequence is retrieved by combining keyword matching and...

Full description

Saved in:
Bibliographic Details
Main Authors ZHU ENCHANG, YU ZHENGTAO, YU ZHIQIANG
Format Patent
LanguageChinese
English
Published 28.06.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention relates to a low-resource neural machine translation method based on multi-strategy prototype generation, and belongs to the technical field of natural language processing. The method comprises the steps that firstly, a prototype sequence is retrieved by combining keyword matching and distributed representation matching, and if matching is not obtained, an available pseudo prototype sequence is generated through a pseudo prototype generation method; secondly, in order to effectively utilize the prototype sequence, a traditional encoder-decoder framework is improved. The encoding end receives prototype sequence input by using an additional encoder; a decoding end uses an improved loss function to reduce the influence of a low-quality prototype sequence on a model while controlling information flow by using a gating mechanism. According to the method provided by the invention, the number and quality of retrieval results can be effectively improved based on a small amount of parallel corpora, and t
Bibliography:Application Number: CN202210293213