How to improve information extraction from German medical records

Vast amounts of medical information are still recorded as unstructured text. The knowledge contained in this textual data has a great potential to improve clinical routine care, to support clinical research, and to advance personalization of medicine. To access this knowledge, the underlying data ha...

Full description

Saved in:

Bibliographic Details
Published in	Information technology (Munich, Germany) Vol. 59; no. 4; pp. 171 - 179
Main Authors	Starlinger, Johannes, Kittner, Madeleine, Blankenstein, Oliver, Leser, Ulf
Format	Journal Article
Language	English
Published	De Gruyter Oldenbourg 28.08.2017
Subjects	Applied computing →Document management and text processing →Document preparation →Annotation Applied computing →Life and medical sciences →Health care information systems information extraction Information systems →Information Retrieval →Retrieval tasks and goals →Information extraction Medical text mining semantic information integration
Online Access	Get full text
ISSN	1611-2776 2196-7032
DOI	10.1515/itit-2016-0027

Cover

More Information
Summary:	Vast amounts of medical information are still recorded as unstructured text. The knowledge contained in this textual data has a great potential to improve clinical routine care, to support clinical research, and to advance personalization of medicine. To access this knowledge, the underlying data has to be semantically integrated – an essential prerequisite to which is information extraction from clinical documents. A body of work, and a good selection of openly available tools for information extraction and semantic integration in the medical domain exist, yet almost exclusively for English language documents. For German texts the situation is rather different: research work is sparse, tools are proprietary or unpublished, and rarely any freely available textual resources exist. In this survey, we (1) describe the challenges of information extraction from German medical documents and the hurdles posed to research in this area, (2) especially address the problems of missing German language resources and privacy implications, and (3) identify the steps necessary to overcome these hurdles and fuel research in semantic integration of textual clinical data.
ISSN:	1611-2776 2196-7032
DOI:	10.1515/itit-2016-0027