Complex Matching of RDF Datatype Properties
Property mapping is a fundamental component of ontology matching, and yet there is little support that goes beyond the identification of single property matches. Real data often requires some degree of composition, trivially exemplified by the mapping of “first name” and “last name” to “full name” o...
Saved in:
Published in | Database and Expert Systems Applications pp. 195 - 208 |
---|---|
Main Authors | , , , , , |
Format | Book Chapter |
Language | English |
Published |
Berlin, Heidelberg
Springer Berlin Heidelberg
2013
|
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Property mapping is a fundamental component of ontology matching, and yet there is little support that goes beyond the identification of single property matches. Real data often requires some degree of composition, trivially exemplified by the mapping of “first name” and “last name” to “full name” on one end, to complex matchings, such as parsing and pairing symbol/digit strings to SSN numbers, at the other end of the spectrum. In this paper, we propose a two-phase instance-based technique for complex datatype property matching. Phase 1 computes the Estimate Mutual Information matrix of the property values to (1) find simple, 1:1 matches, and (2) compute a list of possible complex matches. Phase 2 applies Genetic Programming to the much reduced search space of candidate matches to find complex matches. We conclude with experimental results that illustrate how the technique works. Furthermore, we show that the proposed technique greatly improves results over those obtained if the Estimate Mutual Information matrix or the Genetic Programming techniques were to be used independently. |
---|---|
AbstractList | Property mapping is a fundamental component of ontology matching, and yet there is little support that goes beyond the identification of single property matches. Real data often requires some degree of composition, trivially exemplified by the mapping of “first name” and “last name” to “full name” on one end, to complex matchings, such as parsing and pairing symbol/digit strings to SSN numbers, at the other end of the spectrum. In this paper, we propose a two-phase instance-based technique for complex datatype property matching. Phase 1 computes the Estimate Mutual Information matrix of the property values to (1) find simple, 1:1 matches, and (2) compute a list of possible complex matches. Phase 2 applies Genetic Programming to the much reduced search space of candidate matches to find complex matches. We conclude with experimental results that illustrate how the technique works. Furthermore, we show that the proposed technique greatly improves results over those obtained if the Estimate Mutual Information matrix or the Genetic Programming techniques were to be used independently. |
Author | Casanova, Marco Antônio Pereira Nunes, Bernardo Mera, Alexander P. Paes Leme, Luiz André Fetahu, Besnik Dietze, Stefan |
Author_xml | – sequence: 1 givenname: Bernardo surname: Pereira Nunes fullname: Pereira Nunes, Bernardo email: bnunes@inf.puc-rio.br organization: L3S Research Center, Leibniz University Hannover, Hannover, Germany – sequence: 2 givenname: Alexander surname: Mera fullname: Mera, Alexander email: acaraballo@inf.puc-rio.br organization: Department of Informatics, PUC-Rio - Rio de Janeiro, Brazil – sequence: 3 givenname: Marco Antônio surname: Casanova fullname: Casanova, Marco Antônio email: casanova@inf.puc-rio.br organization: Department of Informatics, PUC-Rio - Rio de Janeiro, Brazil – sequence: 4 givenname: Besnik surname: Fetahu fullname: Fetahu, Besnik email: fetahu@l3s.de organization: L3S Research Center, Leibniz University Hannover, Hannover, Germany – sequence: 5 givenname: Luiz André surname: P. Paes Leme fullname: P. Paes Leme, Luiz André email: lapaesleme@ic.uff.br organization: Computer Science Institute, Fluminense Federal University, Niterói, Brazil – sequence: 6 givenname: Stefan surname: Dietze fullname: Dietze, Stefan email: dietze@l3s.de organization: L3S Research Center, Leibniz University Hannover, Hannover, Germany |
BookMark | eNpVkMtOwzAQRQ0UiVLyByyyRwbPTPxaoj4AqQiEYG3ZiQOFkkRxFvD3pIUNsxnpXGk0556ySdM2kbFzEJcghL6y2nDiqkBeCDSSowNzwLIR0wj3DA_ZFBQAJyrs0b-skBM2FSSQW13QCctSehfjWGMAxJRdzNvPbhu_8ns_lG-b5jVv6_xpscoXfvDDdxfzx77tYj9sYjpjx7Xfppj97Rl7WS2f57d8_XBzN79e8wTSGo5QGy9rJLJa1yit1FVpVUCkILyCClUFoZJkMejKS1JKRxtsCLUtSXqaMfy9m7p-_Cj2LrTtR3Ig3K4RN-o5cqOg29u7XSP0AyhRToQ |
ContentType | Book Chapter |
Copyright | Springer-Verlag Berlin Heidelberg 2013 |
Copyright_xml | – notice: Springer-Verlag Berlin Heidelberg 2013 |
DOI | 10.1007/978-3-642-40285-2_18 |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 9783642402852 3642402852 |
EISSN | 1611-3349 |
Editor | Lhotská, Lenka Link, Sebastian Basl, Josef Decker, Hendrik Tjoa, A Min |
Editor_xml | – sequence: 1 givenname: Hendrik surname: Decker fullname: Decker, Hendrik email: hendrik@iti.es – sequence: 2 givenname: Lenka surname: Lhotská fullname: Lhotská, Lenka email: lhotska@fel.cvut.cz – sequence: 3 givenname: Sebastian surname: Link fullname: Link, Sebastian email: s.link@auckland.ac.nz – sequence: 4 givenname: Josef surname: Basl fullname: Basl, Josef email: basl@vse.cz – sequence: 5 givenname: A Min surname: Tjoa fullname: Tjoa, A Min email: amin@ifs.tuwien.ac.at |
EndPage | 208 |
GroupedDBID | -DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ABMNI ACGFS ADCXD AEFIE ALMA_UNASSIGNED_HOLDINGS EJD F5P FEDTE HVGLF LAS LDH P2P RIG RNI RSU SVGTG VI1 ~02 |
ID | FETCH-LOGICAL-s1598-21f8a5f233977f25957dc96b223b0a61d26d1bd5392b7da53667e9b9bbf9c35a3 |
ISBN | 9783642402845 3642402844 |
ISSN | 0302-9743 |
IngestDate | Wed Nov 06 06:26:24 EST 2024 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-s1598-21f8a5f233977f25957dc96b223b0a61d26d1bd5392b7da53667e9b9bbf9c35a3 |
OpenAccessLink | https://www.repo.uni-hannover.de/bitstream/123456789/1358/1/Pereira%20Nunes%20et%20al%202013%2c%20Postprint%2c%20Complex%20matching%20of%20RDF%20datatype%20properties.pdf |
PageCount | 14 |
ParticipantIDs | springer_books_10_1007_978_3_642_40285_2_18 |
PublicationCentury | 2000 |
PublicationDate | 2013 |
PublicationDateYYYYMMDD | 2013-01-01 |
PublicationDate_xml | – year: 2013 text: 2013 |
PublicationDecade | 2010 |
PublicationPlace | Berlin, Heidelberg |
PublicationPlace_xml | – name: Berlin, Heidelberg |
PublicationSeriesTitle | Lecture Notes in Computer Science |
PublicationSubtitle | 24th International Conference, DEXA 2013, Prague, Czech Republic, August 26-29, 2013. Proceedings, Part I |
PublicationTitle | Database and Expert Systems Applications |
PublicationYear | 2013 |
Publisher | Springer Berlin Heidelberg |
Publisher_xml | – name: Springer Berlin Heidelberg |
RelatedPersons | Kleinberg, Jon M. Mattern, Friedemann Nierstrasz, Oscar Steffen, Bernhard Kittler, Josef Vardi, Moshe Y. Weikum, Gerhard Sudan, Madhu Naor, Moni Mitchell, John C. Terzopoulos, Demetri Pandu Rangan, C. Kanade, Takeo Hutchison, David Tygar, Doug |
RelatedPersons_xml | – sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David organization: Lancaster University, Lancaster, UK – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo organization: Carnegie Mellon University, Pittsburgh, USA – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef organization: University of Surrey, Guildford, UK – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. organization: Cornell University, Ithaca, USA – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann organization: ETH Zurich, Zurich, Switzerland – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. organization: Stanford University, Stanford, USA – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni organization: Weizmann Institute of Science, Rehovot, Israel – sequence: 8 givenname: Oscar surname: Nierstrasz fullname: Nierstrasz, Oscar organization: University of Bern, Bern, Switzerland – sequence: 9 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. organization: Indian Institute of Technology, Madras, India – sequence: 10 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: University of Dortmund, Dortmund, Germany – sequence: 11 givenname: Madhu surname: Sudan fullname: Sudan, Madhu organization: Massachusetts Institute of Technology, USA – sequence: 12 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri organization: University of California, Los Angeles, USA – sequence: 13 givenname: Doug surname: Tygar fullname: Tygar, Doug organization: University of California, Berkeley, USA – sequence: 14 givenname: Moshe Y. surname: Vardi fullname: Vardi, Moshe Y. organization: Rice University, Houston, USA – sequence: 15 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany |
SSID | ssj0000988110 ssib024821215 ssj0002792 |
Score | 1.8162469 |
Snippet | Property mapping is a fundamental component of ontology matching, and yet there is little support that goes beyond the identification of single property... |
SourceID | springer |
SourceType | Publisher |
StartPage | 195 |
SubjectTerms | Genetic Programming Mutual Information Ontology Matching Schema Matching |
Title | Complex Matching of RDF Datatype Properties |
URI | http://link.springer.com/10.1007/978-3-642-40285-2_18 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9QwELa25YI40PIQhYJy6C0KSmInsQ8cKHRVVdWKQ4t6i-zYESukREqyF34FP5kZP3aztEIql2gVJd54vmQ8z8-EnIncsCpXWSKKlidMapooKkpwVXij84aWQtpqi1V5ecuu7oq7xeL3rGppM6mPza8H-0r-B1U4B7hil-wjkN0OCifgN-ALR0AYjn8Zv_thVlfLIieJa5AN_1vG4inwjzvbch6Ms8pvMOtBxquNZ-c_x1DgoPutzM0g93pedvmJUeLeqb63p-mRcgAz7OesW29vX5pJ_ti4gcdu_XP-MqLWgVHh7snVbiIFytdljHOwQeBvmBMYplDRiNIz46drn-BY9ZOtG4vDHhRBJc1jFrh_xF7MIsQs439Qetn2Eoa5H-74JkOXF2hw8IGcUjROaZdIxUgd9alXxJnbutOv6bnljri_XMwrRODP0JnmRZLXGT8gB5UAjfnk88XV9fegoXLGc6Tj2EbxUsG5NaD82o90jC5v5Z4Su4nCLBwN5WxWs07Ohx7hXm7emjw3R-QZtsFE2J8CAj8mC9O9IM8DAJEH4CWJPbZRwDbq2wiwjQK20Q7bV-R2eXHz5TLxO3AkI5i58LFlLZdFm1N0E1rwlItKN6JUYFOqVJaZzkudKV2Aka0qLQtalpURSijVioYWkr4mh13fmTckKqhMU9UIzVjLKKVcShgvVdJQrVlVnZA4zLbGb2qsA6E2yKamNcimtrKpUTZvH3X1O_J09xaeksNp2Jj3YEtO6oMH-A-9DGZV |
link.rule.ids | 782,783,787,796,27937 |
linkProvider | Library Specific Holdings |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Database+and+Expert+Systems+Applications&rft.au=Pereira+Nunes%2C+Bernardo&rft.au=Mera%2C+Alexander&rft.au=Casanova%2C+Marco+Ant%C3%B4nio&rft.au=Fetahu%2C+Besnik&rft.atitle=Complex+Matching+of+RDF+Datatype+Properties&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2013-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783642402845&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=195&rft.epage=208&rft_id=info:doi/10.1007%2F978-3-642-40285-2_18 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon |