A Mechanism Generating Semi-Structured RDF Metadata from Web Documents

The Semantic Web still cannot be realized on the Internet, since a large number of un-structure web documents available on the Internet contain texts in natural language that are still only read by human beings. For content providers and developers, it is almost impossible to generate metadata of We...

Full description

Saved in:
Bibliographic Details
Published in2011 IEEE Asia-Pacific Services Computing Conference pp. 102 - 109
Main Authors Hsiang-Yuan Hsueh, Chun-Nan Chen, Kun-Fu Huang
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.12.2011
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The Semantic Web still cannot be realized on the Internet, since a large number of un-structure web documents available on the Internet contain texts in natural language that are still only read by human beings. For content providers and developers, it is almost impossible to generate metadata of Web content manually. In this paper, a mechanism generating content-based RDF Semantic Web schema from web document set as the semantic metadata is proposed. Analyzing the structural information and content of web documents, they can be conceptualized as resource objects with inter-relationships in RDF diagram. It is expected that with the semantic metadata of document sets on the Web being systematically translated instead of manually edited, the semantic operation on the whole Web, such as semantic query or semantic search, will be possible in the near future.
ISBN:1467302066
9781467302067
DOI:10.1109/APSCC.2011.73