Citation function, polarity and influence classification

Current methods for assessing the impact of authors and scientific media employ tools such as H-Index, Co-Citation and PageRank. These tools are primarily based on citation counting, which considers all citations to be equal. This type of methods can produce perverse incentives to publish controvers...

Full description

Saved in:

Bibliographic Details
Published in	Natural language engineering Vol. 23; no. 4; pp. 561 - 588
Main Authors	HERNÁNDEZ-ALVAREZ, MYRIAM, GOMEZ SORIANO, JOSÉ M., MARTÍNEZ-BARCO, PATRICIO
Format	Journal Article
Language	English
Published	Cambridge, UK Cambridge University Press 01.07.2017
Subjects	Algorithms Annotations Citation analysis Classification Classification schemes Cocitation Coders Coding Computational linguistics Counting Human error Incentives Information dissemination Information science Keywords Labels Library associations Marking Medical libraries Methods Natural language processing Nobel prizes Polarity Production methods Qualitative research Science Search engines
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Current methods for assessing the impact of authors and scientific media employ tools such as H-Index, Co-Citation and PageRank. These tools are primarily based on citation counting, which considers all citations to be equal. This type of methods can produce perverse incentives to publish controversial or incomplete papers, as mixed or negative reviews often generate larger citation counts and better indexes, regardless of whether the citations were critical or exerted minimal influence on the citing document. Passing citations that are employed to establish background, which do not have a real impact on the citing paper, are common in scientific literature. However, these citations have equal weight in impact evaluations. Notable researchers have emphasized the need to correct this situation by developing estimation methods that consider the different roles of quotations in citing papers. To accomplish this type of evaluation, a context citation analysis should be applied to determine the nature of the citations. We propose that citations should be categorized using four dimensions – FUNCTION, POLARITY, ASPECTS and INFLUENCE – as these dimensions provide adequate information that can be employed toward the generation of a qualitative method to measure the impact of a given publication in a citing paper. In this paper, we used interchangeably the words influence and impact. We present a method for obtaining this information using our proposed classification scheme and manually annotated corpus, which is marked with meaningful keywords and labels to help identify the characteristics or properties that constitute what we call ASPECTS. We develop a classification scheme which considers purpose definition shared by previous works. Our contribution is to abstract purpose classes from several other schemes and divide a complex structure in more manageable parts, to attain a simple system that combines low granularity dimensions but nevertheless produces a fine-grained classification. For annotators, the classification process is simple because in a first step, the coders distinguish only four primary classes, and in a second pass, they add the information contained in ASPECTS keyword and labels to obtain the more specific functions. This way, we gain a high granularity labeling that gives enough information about the citations to characterize and classify them, and we achieve this detailed coding with a straightforward process where the level of human error could be minimized.
ISSN:	1351-3249 1469-8110
DOI:	10.1017/S1351324916000346