Numerical Atrribute Extraction from Clinical Texts

This paper describes about information extraction system, which is an extension of the system developed by team Hitachi for "Disease/Disorder Template filling" task organized by ShARe/CLEF eHealth Evolution Lab 2014. In this extension module we focus on extraction of numerical attributes a...

Full description

Saved in:

Bibliographic Details
Published in	arXiv.org
Main Authors	Sarath, P R, Mandhan, Sunil, Niwa, Yoshiki
Format	Paper Journal Article
Language	English
Published	Ithaca Cornell University Library, arXiv.org 31.01.2016
Subjects	Algorithms Computer Science - Artificial Intelligence Information retrieval Mathematical models Modules Support vector machines
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This paper describes about information extraction system, which is an extension of the system developed by team Hitachi for "Disease/Disorder Template filling" task organized by ShARe/CLEF eHealth Evolution Lab 2014. In this extension module we focus on extraction of numerical attributes and values from discharge summary records and associating correct relation between attributes and values. We solve the problem in two steps. First step is extraction of numerical attributes and values, which is developed as a Named Entity Recognition (NER) model using Stanford NLP libraries. Second step is correctly associating the attributes to values, which is developed as a relation extraction module in Apache cTAKES framework. We integrated Stanford NER model as cTAKES pipeline component and used in relation extraction module. Conditional Random Field (CRF) algorithm is used for NER and Support Vector Machines (SVM) for relation extraction. For attribute value relation extraction, we observe 95% accuracy using NER alone and combined accuracy of 87% with NER and SVM.
Bibliography:	Submission 42, CLEF 2015
ISSN:	2331-8422
DOI:	10.48550/arxiv.1602.00269