Learning Language from a Large (Unannotated) Corpus

A novel approach to the fully automated, unsupervised extraction of dependency grammars and associated syntax-to-semantic-relationship mappings from large text corpora is described. The suggested approach builds on the authors' prior work with the Link Grammar, RelEx and OpenCog systems, as wel...

Full description

Saved in:
Bibliographic Details
Main Authors Vepstas, Linas, Goertzel, Ben
Format Journal Article
LanguageEnglish
Published 14.01.2014
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A novel approach to the fully automated, unsupervised extraction of dependency grammars and associated syntax-to-semantic-relationship mappings from large text corpora is described. The suggested approach builds on the authors' prior work with the Link Grammar, RelEx and OpenCog systems, as well as on a number of prior papers and approaches from the statistical language learning literature. If successful, this approach would enable the mining of all the information needed to power a natural language comprehension and generation system, directly from a large, unannotated corpus.
DOI:10.48550/arxiv.1401.3372