A practical framework for formalizing and extracting Chinese collocations
In this paper we argue for a word-sense based formalization for collocation, and proposes a seed-based approach for collocation extraction for specific purposes. The approach uses RFR_SUM model to iteratively classify polysemous word sense in the corpus. The collocation strength is also obtained by...
Saved in:
Published in | 2011 7th International Conference on Natural Language Processing and Knowledge Engineering pp. 390 - 396 |
---|---|
Main Authors | , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.11.2011
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | In this paper we argue for a word-sense based formalization for collocation, and proposes a seed-based approach for collocation extraction for specific purposes. The approach uses RFR_SUM model to iteratively classify polysemous word sense in the corpus. The collocation strength is also obtained by RFR. To capture the syntactic relation inside collocations, this paper presents a frame-based collocation extraction method, which uses word-related frames to obtain collocation with structural information automatically from a large-scale corpus with an average accuracy rate of 89.69%. |
---|---|
DOI: | 10.1109/NLPKE.2011.6138230 |