Terminological paraphrase extraction from scientific literature based on predicate argument tuples
Terminological paraphrases (TPs) are sentences or phrases that express the concepts of terminologies in a different form. Here we propose an effective way to identify and extract TPs from large-scale scientific literature databases. We propose a novel method for effectively retrieving sentences that...
Saved in:
Published in | Journal of information science Vol. 38; no. 6; pp. 593 - 611 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
London, England
SAGE Publications
01.12.2012
Sage Publications Bowker-Saur Ltd |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Terminological paraphrases (TPs) are sentences or phrases that express the concepts of terminologies in a different form. Here we propose an effective way to identify and extract TPs from large-scale scientific literature databases. We propose a novel method for effectively retrieving sentences that contain a given terminological concept based on semantic units called predicate-argument tuples. This method enables effective textual similarity computations and minimized errors based on six TP ranking models. For evaluation, we constructed an evaluation collection for the TP recognition task by extracting TPs from a target literature database using the proposed method. Through the two experiments, we learned that scientific literature contain many TPs that could not have been identified so far. Also, the experimental results showed the potential and extensibility of our proposed methods to extract the TPs. |
---|---|
Bibliography: | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2 |
ISSN: | 0165-5515 1741-6485 |
DOI: | 10.1177/0165551512459920 |