Automatic Keyphrase Extraction from Chinese News Documents

This paper presents a framework for automatically supplying keyphrases for a Chinese news document. It works as follows: extracts Chinese character strings from a source article as an initial set of keyphrase candidates based on frequency and length of the strings, then, filters out unimportant cand...

Full description

Saved in:

Bibliographic Details
Published in	Fuzzy Systems and Knowledge Discovery pp. 648 - 657
Main Authors	Wang, Houfeng, Li, Sujian, Yu, Shiwen
Format	Book Chapter
Language	English
Published	Berlin, Heidelberg Springer Berlin Heidelberg 2005
Series	Lecture Notes in Computer Science
Subjects	Chinese Character Chinese Word Segmentation Elimination Rule Keyphrase Extraction Source Article
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This paper presents a framework for automatically supplying keyphrases for a Chinese news document. It works as follows: extracts Chinese character strings from a source article as an initial set of keyphrase candidates based on frequency and length of the strings, then, filters out unimportant candidates from the initial set by using elimination-rules and transforms vague ones into their canonical forms according to controlled synonymous terms list and abbreviation list, and finally, selects the best items from the set of the remaining candidates by score measure. The approach is tested on People Daily corpus and the experiment results are satisfactory.
Bibliography:	Supported by National Natural Science Foundation of China under Grant No.60473138.
ISBN:	3540283315 9783540283317
ISSN:	0302-9743 1611-3349
DOI:	10.1007/11540007_80