A multi-scenario text generation method based on meta reinforcement learning

•We propose a multi-scene text generation framework based on meta learning.•We use discriminator to provide guidance for the generative model.•Good performance on the open source English text datasets. Multi-scenario text generation is an essential task in natural language generation because of the...

Full description

Saved in:
Bibliographic Details
Published inPattern recognition letters Vol. 165; pp. 47 - 54
Main Authors Zhao, Tingting, Li, Guixi, Song, Yajing, Wang, Yuan, Chen, Yarui, Yang, Jucheng
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.01.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•We propose a multi-scene text generation framework based on meta learning.•We use discriminator to provide guidance for the generative model.•Good performance on the open source English text datasets. Multi-scenario text generation is an essential task in natural language generation because of the multi-scene interlaced property of real-world problems. Traditional methods typically train the multi-scenario text generation models based on maximum likelihood estimation, which may suffer from the problem of exposure bias. Reinforcement learning (RL) based text generation methods could mitigate the exposure bias problem to some extent. However, the RL-based text generation methods are limited to the single-scenario tasks, which cannot be straightforwardly generalized to new scenario tasks. To address this problem, in this paper, we propose a multi-scenario text generation method based on meta RL (MetaRL-TG), which implements the method of model-agnostic meta-learning (MAML) in the framework of RL-based text generation. The proposed MetaRL-TG method first learns the initial parameters from multiple training tasks, then fine-tunes them in the target task. Thus, the proposed method is expected to efficiently achieve high-quality generated text in the new scenario. Finally, the effectiveness and generalization capability of the proposed method are demonstrated for eight scenarios through English test datasets.
ISSN:0167-8655
1872-7344
DOI:10.1016/j.patrec.2022.11.031