A citation-based approach to automatic topical indexing of scientific literature

Topical indexing of documents with keyphrases is a common method used for revealing the subject of scientific and research documents to both human readers and information retrieval tools, such as search engines. However, scientific documents that are manually indexed with keyphrases are still in the...

Full description

Saved in:
Bibliographic Details
Published inJournal of information science Vol. 36; no. 6; pp. 798 - 811
Main Authors Mahdi, Abdulhussain E., Joorabchi, Arash
Format Journal Article
LanguageEnglish
Published London, England SAGE Publications 01.12.2010
Sage Publications
Bowker-Saur Ltd
Subjects
Online AccessGet full text
ISSN0165-5515
1741-6485
DOI10.1177/0165551510388080

Cover

More Information
Summary:Topical indexing of documents with keyphrases is a common method used for revealing the subject of scientific and research documents to both human readers and information retrieval tools, such as search engines. However, scientific documents that are manually indexed with keyphrases are still in the minority. This article describes a new unsupervised method for automatic keyphrase extraction from scientific documents which yields a performance on a par with human indexers. The method is based on identifying references cited in the document to be indexed and, using the keyphrases assigned to those references, for generating a set of high-likelihood keyphrases for the document. We have evaluated the performance of the proposed method by using it to automatically index a third-party testset of research documents. Reported experimental results show that the performance of our method, measured in terms of consistency with human indexers, is competitive with that achieved by state-of-the-art supervised methods.
Bibliography:SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-2
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
ISSN:0165-5515
1741-6485
DOI:10.1177/0165551510388080