Zero-Shot Dual Machine Translation

Neural Machine Translation (NMT) systems rely on large amounts of parallel data. This is a major challenge for low-resource languages. Building on recent work on unsupervised and semi-supervised methods, we present an approach that combines zero-shot and dual learning. The latter relies on reinforce...

Full description

Saved in:

Bibliographic Details
Main Authors	Sestorain, Lierni, Ciaramita, Massimiliano, Buck, Christian, Hofmann, Thomas
Format	Journal Article
Language	English
Published	25.05.2018
Subjects	Computer Science - Computation and Language Computer Science - Neural and Evolutionary Computing
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Neural Machine Translation (NMT) systems rely on large amounts of parallel data. This is a major challenge for low-resource languages. Building on recent work on unsupervised and semi-supervised methods, we present an approach that combines zero-shot and dual learning. The latter relies on reinforcement learning, to exploit the duality of the machine translation task, and requires only monolingual data for the target language pair. Experiments show that a zero-shot dual system, trained on English-French and English-Spanish, outperforms by large margins a standard NMT system in zero-shot translation performance on Spanish-French (both directions). The zero-shot dual method approaches the performance, within 2.2 BLEU points, of a comparable supervised setting. Our method can obtain improvements also on the setting where a small amount of parallel data for the zero-shot language pair is available. Adding Russian, to extend our experiments to jointly modeling 6 zero-shot translation directions, all directions improve between 4 and 15 BLEU points, again, reaching performance near that of the supervised setting.
DOI:	10.48550/arxiv.1805.10338