Web Mining of Relations from XML and Construct Database Schema

Increasing amount of commercial data is presented in XML format for exchanging or publishing on the Web. It is emerging as a new standard for information representation and exchanging over the Internet. How to retrieve valuable information from XML documents on the Web is a new challenge to data min...

Full description

Saved in:
Bibliographic Details
Published in2006 International Conference on Computational Inteligence for Modelling Control and Automation and International Conference on Intelligent Agents Web Technologies and International Commerce (CIMCA'06) p. 211
Main Authors Xu Zhou, Xuezeng Pan, Yu Ren
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.11.2006
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Increasing amount of commercial data is presented in XML format for exchanging or publishing on the Web. It is emerging as a new standard for information representation and exchanging over the Internet. How to retrieve valuable information from XML documents on the Web is a new challenge to data mining research. Compared with relational database, XML data in documents is stored as file with tree logical structure inside, it results in lower efficiency and performance in directly querying data. So it is still necessary to transform data into database (warehouse) for data mining afterwards. In this paper, we present a scheme to analyze relation of elements in XML on the Web, and construct relational database schema based on the analysis. During the process, there would be a worthy accessory product - a glossary, which can facilitate the process of data mining warehouse designing and building.
ISBN:0769527310
9780769527314
DOI:10.1109/CIMCA.2006.233