Strategies for Efficiently Keeping Local Linked Open Data Caches Up-To-Date

Quite often, Linked Open Data (LOD) applications pre-fetch data from the Web and store local copies of it in a cache for faster access at runtime. Yet, recent investigations have shown that data published and interlinked on the LOD cloud is subject to frequent changes. As the data in the cloud chang...

Full description

Saved in:
Bibliographic Details
Published inThe Semantic Web - ISWC 2015 pp. 356 - 373
Main Authors Dividino, Renata, Gottron, Thomas, Scherp, Ansgar
Format Book Chapter
LanguageEnglish
Published Cham Springer International Publishing 2015
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Quite often, Linked Open Data (LOD) applications pre-fetch data from the Web and store local copies of it in a cache for faster access at runtime. Yet, recent investigations have shown that data published and interlinked on the LOD cloud is subject to frequent changes. As the data in the cloud changes, local copies of the data need to be updated. However, due to limitations of the available computational resources (e.g., network bandwidth for fetching data, computation time) LOD applications may not be able to permanently visit all of the LOD sources at brief intervals in order to check for changes. These limitations imply the need to prioritize which data sources should be considered first for retrieving their data and synchronizing the local copy with the original data. In order to make best use of the resources available, it is vital to choose a good scheduling strategy to know when to fetch data of which data source. In this paper, we investigate different strategies proposed in the literature and evaluate them on a large-scale LOD dataset that is obtained from the LOD cloud by weekly crawls over the course of three years. We investigate two different setups: (i) in the single step setup, we evaluate the quality of update strategies for a single and isolated update of a local data cache, while (ii) the iterative progression setup involves measuring the quality of the local data cache when considering iterative updates over a longer period of time. Our evaluation indicates the effectiveness of each strategy for updating local copies of LOD sources, i. e, we demonstrate for given limitations of bandwidth, the strategies’ performance in terms of data accuracy and freshness. The evaluation shows that the measures capturing change behavior of LOD sources over time are most suitable for conducting updates.
AbstractList Quite often, Linked Open Data (LOD) applications pre-fetch data from the Web and store local copies of it in a cache for faster access at runtime. Yet, recent investigations have shown that data published and interlinked on the LOD cloud is subject to frequent changes. As the data in the cloud changes, local copies of the data need to be updated. However, due to limitations of the available computational resources (e.g., network bandwidth for fetching data, computation time) LOD applications may not be able to permanently visit all of the LOD sources at brief intervals in order to check for changes. These limitations imply the need to prioritize which data sources should be considered first for retrieving their data and synchronizing the local copy with the original data. In order to make best use of the resources available, it is vital to choose a good scheduling strategy to know when to fetch data of which data source. In this paper, we investigate different strategies proposed in the literature and evaluate them on a large-scale LOD dataset that is obtained from the LOD cloud by weekly crawls over the course of three years. We investigate two different setups: (i) in the single step setup, we evaluate the quality of update strategies for a single and isolated update of a local data cache, while (ii) the iterative progression setup involves measuring the quality of the local data cache when considering iterative updates over a longer period of time. Our evaluation indicates the effectiveness of each strategy for updating local copies of LOD sources, i. e, we demonstrate for given limitations of bandwidth, the strategies’ performance in terms of data accuracy and freshness. The evaluation shows that the measures capturing change behavior of LOD sources over time are most suitable for conducting updates.
Author Gottron, Thomas
Scherp, Ansgar
Dividino, Renata
Author_xml – sequence: 1
  givenname: Renata
  surname: Dividino
  fullname: Dividino, Renata
  email: dividino@uni-koblenz.de
– sequence: 2
  givenname: Thomas
  surname: Gottron
  fullname: Gottron, Thomas
– sequence: 3
  givenname: Ansgar
  surname: Scherp
  fullname: Scherp, Ansgar
BookMark eNo9kMtOwzAQRQ0UibTwByz8A4bxI068RKU81Ehd0K4tP8YltEqiJBv-nrQgViOd0YzuPXMya9oGCbnn8MABikdTlEwyyQ0TOXBg2gp1QeZyImdQXpKMa86ZlMpc_S_AqBnJQIJgplDyhsyH4QsARGFERtYfY-9G3Nc40NT2dJVSHWpsxuM3XSN2dbOnVRvckVZ1c8BINx029NmNji5d-Jyudh3btmwieEuukzsOePc3F2T3stou31i1eX1fPlVsL3I-Mp1QpzJJz5Vy4BFi4gZyCUolnUAZHlEHX3KV8jKK6L2EiLFIQfkQpJYLIn7_Dl0_5cPe-rY9DJaDPYmykygr7dTenr3Ykyj5AyRTWL4
ContentType Book Chapter
Copyright Springer International Publishing Switzerland 2015
Copyright_xml – notice: Springer International Publishing Switzerland 2015
DOI 10.1007/978-3-319-25010-6_24
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Computer Science
EISBN 3319250108
9783319250106
EISSN 1611-3349
Editor Heflin, Jeff
Thirunarayan, Krishnaprasad
Simperl, Elena
Corcho, Oscar
Staab, Steffen
Srinivas, Kavitha
Groth, Paul
Arenas, Marcelo
Strohmaier, Markus
d'Aquin, Mathieu
Dumontier, Michel
Editor_xml – sequence: 1
  givenname: Marcelo
  surname: Arenas
  fullname: Arenas, Marcelo
  email: marenas@ing.puc.cl
– sequence: 2
  givenname: Oscar
  surname: Corcho
  fullname: Corcho, Oscar
  email: ocorcho@fi.upm.es
– sequence: 3
  givenname: Elena
  surname: Simperl
  fullname: Simperl, Elena
  email: e.simperl@soton.ac.uk
– sequence: 4
  givenname: Markus
  surname: Strohmaier
  fullname: Strohmaier, Markus
  email: markus.strohmaier@gesis.org
– sequence: 5
  givenname: Mathieu
  surname: d'Aquin
  fullname: d'Aquin, Mathieu
  email: mathieu.daquin@open.ac.uk
– sequence: 6
  givenname: Kavitha
  surname: Srinivas
  fullname: Srinivas, Kavitha
  email: ksrinivs@us.ibm.co
– sequence: 7
  givenname: Paul
  surname: Groth
  fullname: Groth, Paul
  email: pgroth@gmail.com
– sequence: 8
  givenname: Michel
  surname: Dumontier
  fullname: Dumontier, Michel
  email: michel.dumontier@gmail.com
– sequence: 9
  givenname: Jeff
  surname: Heflin
  fullname: Heflin, Jeff
  email: heflin@cse.lehigh.edu
– sequence: 10
  givenname: Krishnaprasad
  surname: Thirunarayan
  fullname: Thirunarayan, Krishnaprasad
  email: t.k.prasad@wright.edu
– sequence: 11
  givenname: Steffen
  surname: Staab
  fullname: Staab, Steffen
  email: staab@uni-koblenz.de
EndPage 373
GroupedDBID -DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ABMNI
ACGFS
ADCXD
AEFIE
ALMA_UNASSIGNED_HOLDINGS
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RIG
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-g251t-6fe6f8f3b144a0be0df19053044f6f0491de6cb814f58d2dbb30ded7fc4bcc363
ISBN 3319250094
9783319250090
ISSN 0302-9743
IngestDate Tue Jul 29 20:38:14 EDT 2025
IsPeerReviewed true
IsScholarly true
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-g251t-6fe6f8f3b144a0be0df19053044f6f0491de6cb814f58d2dbb30ded7fc4bcc363
PageCount 18
ParticipantIDs springer_books_10_1007_978_3_319_25010_6_24
PublicationCentury 2000
PublicationDate 2015
PublicationDateYYYYMMDD 2015-01-01
PublicationDate_xml – year: 2015
  text: 2015
PublicationDecade 2010
PublicationPlace Cham
PublicationPlace_xml – name: Cham
PublicationSeriesSubtitle Information Systems and Applications, incl. Internet/Web, and HCI
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSeriesTitleAlternate Lect.Notes Computer
PublicationSubtitle 14th International Semantic Web Conference, Bethlehem, PA, USA, October 11-15, 2015, Proceedings, Part II
PublicationTitle The Semantic Web - ISWC 2015
PublicationYear 2015
Publisher Springer International Publishing
Publisher_xml – name: Springer International Publishing
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Steffen, Bernhard
Pandu Rangan, C.
Kanade, Takeo
Kittler, Josef
Weikum, Gerhard
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
– sequence: 8
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
– sequence: 9
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
– sequence: 10
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
– sequence: 11
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
– sequence: 12
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
SSID ssj0002792
ssj0001585540
Score 2.0880075
Snippet Quite often, Linked Open Data (LOD) applications pre-fetch data from the Web and store local copies of it in a cache for faster access at runtime. Yet, recent...
SourceID springer
SourceType Publisher
StartPage 356
SubjectTerms Data Cache
Dynamics Strategy
Link Open Data
Local Copy
Schedule Strategy
Title Strategies for Efficiently Keeping Local Linked Open Data Caches Up-To-Date
URI http://link.springer.com/10.1007/978-3-319-25010-6_24
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07b9swECYcdyk6NH2hbdKCQzeChRw-JA8ZCiNFmrhZYjfZBFEkAwOpHcTq0A797b0jRUtxsqSLYBCGJd1H34v33RHyKa-QYKkkN945LnM7Bj2oal6NfZX7wqo8MOS-n-njuTy5VJeDwd8-u6Qxn-s_D_JK_gdVWANckSX7CGQ3PwoL8BnwhSsgDNct5_dumnUD8Ln7CZJZ1OzCGcbZt_OLCQNbqzbuKZKtFmG6NghyGXlobT_8BpPgWzVC4UQGULyJtY7rq-q2v6dSK1sXmjjgVOZF4FNe_2anzgXm1XQVUuMQ4YIni9UqsK-aik2wb_SazW_4bMVhJW4nFJJbH07bc4yzVRPKw1gaNZE0Tz81MVJbqYmUmtxKbnb5tTuxrABlAP5YFqeHJk4X6GuIeKIKdFFFa2y8KGKj01btCqV7FlzE4Sj3jEO_HgS5W3A3MEK6PJA7ZCcv1JA8-XJ0Mv3R5egUFvFlG8uOzRbjqVR8KuQKpaeOTSZ7b9HjaT50y3sn78Ghme2SZ0hyocg-AaG9IAO3fEmeJ7nTVu6vyGkHOQXIaQ9y2kJOA-Q0Qk4RcoqQ0wg57SB_TeZfj2aTY97O3-BX4PU2XHunfeGFgaC7yozLrAf3UYlMSq89hJYj63RtipH0qrAH1hiRWWdzX0tT10KLN2S4XC3dW0KtzZWuwbhW4A6Clah0Aa62kq7QY2u0e0dYkkaJ_6h1mdppg-xKUYLsyiC7EmX3_lHf3iNPu825T4bN7S_3ATzJxnxsAf8H22Bnkw
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=The+Semantic+Web+-+ISWC+2015&rft.au=Dividino%2C+Renata&rft.au=Gottron%2C+Thomas&rft.au=Scherp%2C+Ansgar&rft.atitle=Strategies+for+Efficiently+Keeping+Local+Linked+Open+Data+Caches+Up-To-Date&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2015-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783319250090&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=356&rft.epage=373&rft_id=info:doi/10.1007%2F978-3-319-25010-6_24
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon