A Case-Based Approach to Data-to-Text Generation

Traditional Data-to-Text Generation (D2T) systems utilise carefully crafted domain specific rules and templates to generate high quality accurate texts. More recent approaches use neural systems to learn domain rules from the training data to produce very fluent and diverse texts. However, there is...

Full description

Saved in:

Bibliographic Details
Published in	Case-Based Reasoning Research and Development Vol. 12877; pp. 232 - 247
Main Authors	Upadhyay, Ashish, Massie, Stewart, Singh, Ritwik Kumar, Gupta, Garima, Ojha, Muneendra
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 2021 Springer International Publishing
Series	Lecture Notes in Computer Science
Subjects	Data-to-Text Feature weighting Textual CBR
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Traditional Data-to-Text Generation (D2T) systems utilise carefully crafted domain specific rules and templates to generate high quality accurate texts. More recent approaches use neural systems to learn domain rules from the training data to produce very fluent and diverse texts. However, there is a trade-off with rule-based systems producing accurate text but that may lack variation, while learning-based systems produce more diverse texts but often with poorer accuracy. In this paper, we propose a Case-Based approach for D2T that mitigates the impact of this trade-off by dynamically selecting templates from the training corpora. In our approach we develop a novel case-alignment based, feature weighing method that is used to build an effective similarity measure. Extensive experimentation is performed on a sports domain dataset. Through Extractive Evaluation metrics, we demonstrate the benefit of the CBR system over a rule-based baseline and a neural benchmark.
ISBN:	3030869563 9783030869564
ISSN:	0302-9743 1611-3349
DOI:	10.1007/978-3-030-86957-1_16