Complex Matching of RDF Datatype Properties

Property mapping is a fundamental component of ontology matching, and yet there is little support that goes beyond the identification of single property matches. Real data often requires some degree of composition, trivially exemplified by the mapping of “first name” and “last name” to “full name” o...

Full description

Saved in:
Bibliographic Details
Published inDatabase and Expert Systems Applications pp. 195 - 208
Main Authors Pereira Nunes, Bernardo, Mera, Alexander, Casanova, Marco Antônio, Fetahu, Besnik, P. Paes Leme, Luiz André, Dietze, Stefan
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg 2013
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Property mapping is a fundamental component of ontology matching, and yet there is little support that goes beyond the identification of single property matches. Real data often requires some degree of composition, trivially exemplified by the mapping of “first name” and “last name” to “full name” on one end, to complex matchings, such as parsing and pairing symbol/digit strings to SSN numbers, at the other end of the spectrum. In this paper, we propose a two-phase instance-based technique for complex datatype property matching. Phase 1 computes the Estimate Mutual Information matrix of the property values to (1) find simple, 1:1 matches, and (2) compute a list of possible complex matches. Phase 2 applies Genetic Programming to the much reduced search space of candidate matches to find complex matches. We conclude with experimental results that illustrate how the technique works. Furthermore, we show that the proposed technique greatly improves results over those obtained if the Estimate Mutual Information matrix or the Genetic Programming techniques were to be used independently.
AbstractList Property mapping is a fundamental component of ontology matching, and yet there is little support that goes beyond the identification of single property matches. Real data often requires some degree of composition, trivially exemplified by the mapping of “first name” and “last name” to “full name” on one end, to complex matchings, such as parsing and pairing symbol/digit strings to SSN numbers, at the other end of the spectrum. In this paper, we propose a two-phase instance-based technique for complex datatype property matching. Phase 1 computes the Estimate Mutual Information matrix of the property values to (1) find simple, 1:1 matches, and (2) compute a list of possible complex matches. Phase 2 applies Genetic Programming to the much reduced search space of candidate matches to find complex matches. We conclude with experimental results that illustrate how the technique works. Furthermore, we show that the proposed technique greatly improves results over those obtained if the Estimate Mutual Information matrix or the Genetic Programming techniques were to be used independently.
Author Casanova, Marco Antônio
Pereira Nunes, Bernardo
Mera, Alexander
P. Paes Leme, Luiz André
Fetahu, Besnik
Dietze, Stefan
Author_xml – sequence: 1
  givenname: Bernardo
  surname: Pereira Nunes
  fullname: Pereira Nunes, Bernardo
  email: bnunes@inf.puc-rio.br
  organization: L3S Research Center, Leibniz University Hannover, Hannover, Germany
– sequence: 2
  givenname: Alexander
  surname: Mera
  fullname: Mera, Alexander
  email: acaraballo@inf.puc-rio.br
  organization: Department of Informatics, PUC-Rio - Rio de Janeiro, Brazil
– sequence: 3
  givenname: Marco Antônio
  surname: Casanova
  fullname: Casanova, Marco Antônio
  email: casanova@inf.puc-rio.br
  organization: Department of Informatics, PUC-Rio - Rio de Janeiro, Brazil
– sequence: 4
  givenname: Besnik
  surname: Fetahu
  fullname: Fetahu, Besnik
  email: fetahu@l3s.de
  organization: L3S Research Center, Leibniz University Hannover, Hannover, Germany
– sequence: 5
  givenname: Luiz André
  surname: P. Paes Leme
  fullname: P. Paes Leme, Luiz André
  email: lapaesleme@ic.uff.br
  organization: Computer Science Institute, Fluminense Federal University, Niterói, Brazil
– sequence: 6
  givenname: Stefan
  surname: Dietze
  fullname: Dietze, Stefan
  email: dietze@l3s.de
  organization: L3S Research Center, Leibniz University Hannover, Hannover, Germany
BookMark eNpVkMtOwzAQRQ0UiVLyByyyRwbPTPxaoj4AqQiEYG3ZiQOFkkRxFvD3pIUNsxnpXGk0556ySdM2kbFzEJcghL6y2nDiqkBeCDSSowNzwLIR0wj3DA_ZFBQAJyrs0b-skBM2FSSQW13QCctSehfjWGMAxJRdzNvPbhu_8ns_lG-b5jVv6_xpscoXfvDDdxfzx77tYj9sYjpjx7Xfppj97Rl7WS2f57d8_XBzN79e8wTSGo5QGy9rJLJa1yit1FVpVUCkILyCClUFoZJkMejKS1JKRxtsCLUtSXqaMfy9m7p-_Cj2LrTtR3Ig3K4RN-o5cqOg29u7XSP0AyhRToQ
ContentType Book Chapter
Copyright Springer-Verlag Berlin Heidelberg 2013
Copyright_xml – notice: Springer-Verlag Berlin Heidelberg 2013
DOI 10.1007/978-3-642-40285-2_18
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 9783642402852
3642402852
EISSN 1611-3349
Editor Lhotská, Lenka
Link, Sebastian
Basl, Josef
Decker, Hendrik
Tjoa, A Min
Editor_xml – sequence: 1
  givenname: Hendrik
  surname: Decker
  fullname: Decker, Hendrik
  email: hendrik@iti.es
– sequence: 2
  givenname: Lenka
  surname: Lhotská
  fullname: Lhotská, Lenka
  email: lhotska@fel.cvut.cz
– sequence: 3
  givenname: Sebastian
  surname: Link
  fullname: Link, Sebastian
  email: s.link@auckland.ac.nz
– sequence: 4
  givenname: Josef
  surname: Basl
  fullname: Basl, Josef
  email: basl@vse.cz
– sequence: 5
  givenname: A Min
  surname: Tjoa
  fullname: Tjoa, A Min
  email: amin@ifs.tuwien.ac.at
EndPage 208
GroupedDBID -DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ABMNI
ACGFS
ADCXD
AEFIE
ALMA_UNASSIGNED_HOLDINGS
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RIG
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-s1598-21f8a5f233977f25957dc96b223b0a61d26d1bd5392b7da53667e9b9bbf9c35a3
ISBN 9783642402845
3642402844
ISSN 0302-9743
IngestDate Wed Nov 06 06:26:24 EST 2024
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-s1598-21f8a5f233977f25957dc96b223b0a61d26d1bd5392b7da53667e9b9bbf9c35a3
OpenAccessLink https://www.repo.uni-hannover.de/bitstream/123456789/1358/1/Pereira%20Nunes%20et%20al%202013%2c%20Postprint%2c%20Complex%20matching%20of%20RDF%20datatype%20properties.pdf
PageCount 14
ParticipantIDs springer_books_10_1007_978_3_642_40285_2_18
PublicationCentury 2000
PublicationDate 2013
PublicationDateYYYYMMDD 2013-01-01
PublicationDate_xml – year: 2013
  text: 2013
PublicationDecade 2010
PublicationPlace Berlin, Heidelberg
PublicationPlace_xml – name: Berlin, Heidelberg
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSubtitle 24th International Conference, DEXA 2013, Prague, Czech Republic, August 26-29, 2013. Proceedings, Part I
PublicationTitle Database and Expert Systems Applications
PublicationYear 2013
Publisher Springer Berlin Heidelberg
Publisher_xml – name: Springer Berlin Heidelberg
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Nierstrasz, Oscar
Steffen, Bernhard
Kittler, Josef
Vardi, Moshe Y.
Weikum, Gerhard
Sudan, Madhu
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Pandu Rangan, C.
Kanade, Takeo
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
  organization: Lancaster University, Lancaster, UK
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
  organization: Carnegie Mellon University, Pittsburgh, USA
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
  organization: University of Surrey, Guildford, UK
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
  organization: Cornell University, Ithaca, USA
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
  organization: ETH Zurich, Zurich, Switzerland
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
  organization: Stanford University, Stanford, USA
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
  organization: Weizmann Institute of Science, Rehovot, Israel
– sequence: 8
  givenname: Oscar
  surname: Nierstrasz
  fullname: Nierstrasz, Oscar
  organization: University of Bern, Bern, Switzerland
– sequence: 9
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
  organization: Indian Institute of Technology, Madras, India
– sequence: 10
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: University of Dortmund, Dortmund, Germany
– sequence: 11
  givenname: Madhu
  surname: Sudan
  fullname: Sudan, Madhu
  organization: Massachusetts Institute of Technology, USA
– sequence: 12
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
  organization: University of California, Los Angeles, USA
– sequence: 13
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
  organization: University of California, Berkeley, USA
– sequence: 14
  givenname: Moshe Y.
  surname: Vardi
  fullname: Vardi, Moshe Y.
  organization: Rice University, Houston, USA
– sequence: 15
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
  organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany
SSID ssj0000988110
ssib024821215
ssj0002792
Score 1.8162469
Snippet Property mapping is a fundamental component of ontology matching, and yet there is little support that goes beyond the identification of single property...
SourceID springer
SourceType Publisher
StartPage 195
SubjectTerms Genetic Programming
Mutual Information
Ontology Matching
Schema Matching
Title Complex Matching of RDF Datatype Properties
URI http://link.springer.com/10.1007/978-3-642-40285-2_18
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELa25YI40PIQhYJy6C0KSmInsQ8cKHRVVdWKQ4t6i-zYESukREqyF34FP5kZP3aztEIql2gVJd54vmQ8z8-EnIncsCpXWSKKlidMapooKkpwVXij84aWQtpqi1V5ecuu7oq7xeL3rGppM6mPza8H-0r-B1U4B7hil-wjkN0OCifgN-ALR0AYjn8Zv_thVlfLIieJa5AN_1vG4inwjzvbch6Ms8pvMOtBxquNZ-c_x1DgoPutzM0g93pedvmJUeLeqb63p-mRcgAz7OesW29vX5pJ_ti4gcdu_XP-MqLWgVHh7snVbiIFytdljHOwQeBvmBMYplDRiNIz46drn-BY9ZOtG4vDHhRBJc1jFrh_xF7MIsQs439Qetn2Eoa5H-74JkOXF2hw8IGcUjROaZdIxUgd9alXxJnbutOv6bnljri_XMwrRODP0JnmRZLXGT8gB5UAjfnk88XV9fegoXLGc6Tj2EbxUsG5NaD82o90jC5v5Z4Su4nCLBwN5WxWs07Ohx7hXm7emjw3R-QZtsFE2J8CAj8mC9O9IM8DAJEH4CWJPbZRwDbq2wiwjQK20Q7bV-R2eXHz5TLxO3AkI5i58LFlLZdFm1N0E1rwlItKN6JUYFOqVJaZzkudKV2Aka0qLQtalpURSijVioYWkr4mh13fmTckKqhMU9UIzVjLKKVcShgvVdJQrVlVnZA4zLbGb2qsA6E2yKamNcimtrKpUTZvH3X1O_J09xaeksNp2Jj3YEtO6oMH-A-9DGZV
link.rule.ids 782,783,787,796,27937
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Database+and+Expert+Systems+Applications&rft.au=Pereira+Nunes%2C+Bernardo&rft.au=Mera%2C+Alexander&rft.au=Casanova%2C+Marco+Ant%C3%B4nio&rft.au=Fetahu%2C+Besnik&rft.atitle=Complex+Matching+of+RDF+Datatype+Properties&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2013-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783642402845&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=195&rft.epage=208&rft_id=info:doi/10.1007%2F978-3-642-40285-2_18
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon