Lexicon-based context-sensitive reference comments crawler

This paper proposes a novel system that aids in the writing of research papers by gathering and analysing other researchers’ comments for a given reference paper to provide some features, advantages or disadvantages of the referenced research. A lexicon-based reference comments crawler (LRCC) classi...

Full description

Saved in:
Bibliographic Details
Published inJournal of information science Vol. 41; no. 3; pp. 342 - 353
Main Author Jeon, Hocheol
Format Journal Article
LanguageEnglish
Published London, England SAGE Publications 01.06.2015
Bowker-Saur Ltd
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This paper proposes a novel system that aids in the writing of research papers by gathering and analysing other researchers’ comments for a given reference paper to provide some features, advantages or disadvantages of the referenced research. A lexicon-based reference comments crawler (LRCC) classifies the comments about a reference paper and the surrounding sentences using part-of-speech lexicons and a dynamic text window into four categories (normal, advantage, disadvantage and complex). The extraction of comments and surrounding sentences from research papers is effectively and efficiently carried out using the reference identifier and some simple extraction rules. In this paper, we considered the various types of reference identifiers, because a reference identifier is a key solution for the sentence extraction in the LRCC system. Several experiments were performed using published research papers to evaluate the LRCC’s precision and recall. The results showed that the LRCC can extract and classify comments with a high degree of precision and recall, as well as present them to the user in an effective and efficient manner.
ISSN:0165-5515
1741-6485
DOI:10.1177/0165551515575921