Text Mining Predictive Methods for Analyzing Unstructured Information

The growth of the web can be seen as an expanding public digital library collection. Online digital information extends far beyond the web and its publicly available information. Huge amounts of information are private and are of interest to local communities, such as the records of customers of a b...

Full description

Saved in:
Bibliographic Details
Main Authors Damerau, Fred, Indurkhya, Nitin, Weiss, Sholom M, Zhang, Tong
Format eBook Book
LanguageEnglish
Published New York, NY Springer-Verlag 2004
Springer
Springer New York
Edition1. Aufl.
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The growth of the web can be seen as an expanding public digital library collection. Online digital information extends far beyond the web and its publicly available information. Huge amounts of information are private and are of interest to local communities, such as the records of customers of a business. This information is overwhelmingly text and has its record-keeping purpose, but an automated analysis might be desirable to find patterns in the stored records. Analogous to this data mining is text mining, which also finds patterns and trends in information samples but which does so with far less structured--though with greater immediate utility for users--ingredients. This book focuses on the concepts and methods needed to expand horizons beyond structured, numeric data to automated mining of text samples. It introduces the new world of text mining and examines proven methods for various critical text-mining tasks, such as automated document indexing and information retrieval and search. New research areas are explored, such as information extraction and document summarization, that rely on evolving text-mining techniques. TOC:Overview of text mining.- From textual information to numerical vectors.- Using text for prediction.- Information retrieval and text mining.- Finding structure in a document collection.- Looking for information in documents.- Case studies.- Emerging directions.- Appendix: software notes.- References.- Author and subject indexes.
Bibliography:Includes bibliographical references (p. [217]-228) and indexes
ISBN:0387954333
9780387954332
DOI:10.1007/978-0-387-34555-0