MapReduce-Based Bayesian Automatic Text Classifier Used in Digital Library

Bayesian theorem is an effective method for text classification. But it will consume too much time and resource when used in large-scale database such as digital library. How to process data efficiently becomes a vital problem for the further development of digital library. Because Mapreduce model h...

Full description

Saved in:

Bibliographic Details
Published in	Computational Intelligence and Intelligent Systems pp. 121 - 126
Main Authors	Niu, Zhen, Yin, Zelong, Cui, Huayang
Format	Book Chapter
Language	English
Published	Berlin, Heidelberg Springer Berlin Heidelberg
Series	Communications in Computer and Information Science
Subjects	Bayesian Text Classifier Cloud Computing Digital Library Machine Learning Mapreduce
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Bayesian theorem is an effective method for text classification. But it will consume too much time and resource when used in large-scale database such as digital library. How to process data efficiently becomes a vital problem for the further development of digital library. Because Mapreduce model has strong capacity of processing mass data, Mapreduce-based Bayesian classifier can reduce the time that is caused by machine learning and therefore enhance the overall efficiency. The new method can classify documents into different groups according to its subject. In this way, it is helpful for management and storage of information. Thus Mapreduce-based Bayesian text classifier can be used in digital library successfully which will provide better service for people.
ISBN:	3642342884 9783642342882
ISSN:	1865-0929 1865-0937
DOI:	10.1007/978-3-642-34289-9_14