Separate Training for Conditional Random Fields Using Co-occurrence Rate Factorization

The standard training method of Conditional Random Fields (CRFs) is very slow for large-scale applications. As an alternative, piecewise training divides the full graph into pieces, trains them independently, and combines the learned weights at test time. In this paper, we present \emph{separate} tr...

Full description

Saved in:
Bibliographic Details
Main Authors Zhu, Zhemin, Hiemstra, Djoerd, Apers, Peter, Wombacher, Andreas
Format Journal Article
LanguageEnglish
Published 09.08.2010
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The standard training method of Conditional Random Fields (CRFs) is very slow for large-scale applications. As an alternative, piecewise training divides the full graph into pieces, trains them independently, and combines the learned weights at test time. In this paper, we present \emph{separate} training for undirected models based on the novel Co-occurrence Rate Factorization (CR-F). Separate training is a local training method. In contrast to MEMMs, separate training is unaffected by the label bias problem. Experiments show that separate training (i) is unaffected by the label bias problem; (ii) reduces the training time from weeks to seconds; and (iii) obtains competitive results to the standard and piecewise training on linear-chain CRFs.
Bibliography:TR-CTIT-12-29
DOI:10.48550/arxiv.1008.1566