Strategies for Efficiently Keeping Local Linked Open Data Caches Up-To-Date
Quite often, Linked Open Data (LOD) applications pre-fetch data from the Web and store local copies of it in a cache for faster access at runtime. Yet, recent investigations have shown that data published and interlinked on the LOD cloud is subject to frequent changes. As the data in the cloud chang...
Saved in:
Published in | The Semantic Web - ISWC 2015 pp. 356 - 373 |
---|---|
Main Authors | , , |
Format | Book Chapter |
Language | English |
Published |
Cham
Springer International Publishing
2015
|
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Quite often, Linked Open Data (LOD) applications pre-fetch data from the Web and store local copies of it in a cache for faster access at runtime. Yet, recent investigations have shown that data published and interlinked on the LOD cloud is subject to frequent changes. As the data in the cloud changes, local copies of the data need to be updated. However, due to limitations of the available computational resources (e.g., network bandwidth for fetching data, computation time) LOD applications may not be able to permanently visit all of the LOD sources at brief intervals in order to check for changes. These limitations imply the need to prioritize which data sources should be considered first for retrieving their data and synchronizing the local copy with the original data. In order to make best use of the resources available, it is vital to choose a good scheduling strategy to know when to fetch data of which data source. In this paper, we investigate different strategies proposed in the literature and evaluate them on a large-scale LOD dataset that is obtained from the LOD cloud by weekly crawls over the course of three years. We investigate two different setups: (i) in the single step setup, we evaluate the quality of update strategies for a single and isolated update of a local data cache, while (ii) the iterative progression setup involves measuring the quality of the local data cache when considering iterative updates over a longer period of time. Our evaluation indicates the effectiveness of each strategy for updating local copies of LOD sources, i. e, we demonstrate for given limitations of bandwidth, the strategies’ performance in terms of data accuracy and freshness. The evaluation shows that the measures capturing change behavior of LOD sources over time are most suitable for conducting updates. |
---|---|
AbstractList | Quite often, Linked Open Data (LOD) applications pre-fetch data from the Web and store local copies of it in a cache for faster access at runtime. Yet, recent investigations have shown that data published and interlinked on the LOD cloud is subject to frequent changes. As the data in the cloud changes, local copies of the data need to be updated. However, due to limitations of the available computational resources (e.g., network bandwidth for fetching data, computation time) LOD applications may not be able to permanently visit all of the LOD sources at brief intervals in order to check for changes. These limitations imply the need to prioritize which data sources should be considered first for retrieving their data and synchronizing the local copy with the original data. In order to make best use of the resources available, it is vital to choose a good scheduling strategy to know when to fetch data of which data source. In this paper, we investigate different strategies proposed in the literature and evaluate them on a large-scale LOD dataset that is obtained from the LOD cloud by weekly crawls over the course of three years. We investigate two different setups: (i) in the single step setup, we evaluate the quality of update strategies for a single and isolated update of a local data cache, while (ii) the iterative progression setup involves measuring the quality of the local data cache when considering iterative updates over a longer period of time. Our evaluation indicates the effectiveness of each strategy for updating local copies of LOD sources, i. e, we demonstrate for given limitations of bandwidth, the strategies’ performance in terms of data accuracy and freshness. The evaluation shows that the measures capturing change behavior of LOD sources over time are most suitable for conducting updates. |
Author | Gottron, Thomas Scherp, Ansgar Dividino, Renata |
Author_xml | – sequence: 1 givenname: Renata surname: Dividino fullname: Dividino, Renata email: dividino@uni-koblenz.de – sequence: 2 givenname: Thomas surname: Gottron fullname: Gottron, Thomas – sequence: 3 givenname: Ansgar surname: Scherp fullname: Scherp, Ansgar |
BookMark | eNo9kMtOwzAQRQ0UibTwByz8A4bxI068RKU81Ehd0K4tP8YltEqiJBv-nrQgViOd0YzuPXMya9oGCbnn8MABikdTlEwyyQ0TOXBg2gp1QeZyImdQXpKMa86ZlMpc_S_AqBnJQIJgplDyhsyH4QsARGFERtYfY-9G3Nc40NT2dJVSHWpsxuM3XSN2dbOnVRvckVZ1c8BINx029NmNji5d-Jyudh3btmwieEuukzsOePc3F2T3stou31i1eX1fPlVsL3I-Mp1QpzJJz5Vy4BFi4gZyCUolnUAZHlEHX3KV8jKK6L2EiLFIQfkQpJYLIn7_Dl0_5cPe-rY9DJaDPYmykygr7dTenr3Ykyj5AyRTWL4 |
ContentType | Book Chapter |
Copyright | Springer International Publishing Switzerland 2015 |
Copyright_xml | – notice: Springer International Publishing Switzerland 2015 |
DOI | 10.1007/978-3-319-25010-6_24 |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Computer Science |
EISBN | 3319250108 9783319250106 |
EISSN | 1611-3349 |
Editor | Heflin, Jeff Thirunarayan, Krishnaprasad Simperl, Elena Corcho, Oscar Staab, Steffen Srinivas, Kavitha Groth, Paul Arenas, Marcelo Strohmaier, Markus d'Aquin, Mathieu Dumontier, Michel |
Editor_xml | – sequence: 1 givenname: Marcelo surname: Arenas fullname: Arenas, Marcelo email: marenas@ing.puc.cl – sequence: 2 givenname: Oscar surname: Corcho fullname: Corcho, Oscar email: ocorcho@fi.upm.es – sequence: 3 givenname: Elena surname: Simperl fullname: Simperl, Elena email: e.simperl@soton.ac.uk – sequence: 4 givenname: Markus surname: Strohmaier fullname: Strohmaier, Markus email: markus.strohmaier@gesis.org – sequence: 5 givenname: Mathieu surname: d'Aquin fullname: d'Aquin, Mathieu email: mathieu.daquin@open.ac.uk – sequence: 6 givenname: Kavitha surname: Srinivas fullname: Srinivas, Kavitha email: ksrinivs@us.ibm.co – sequence: 7 givenname: Paul surname: Groth fullname: Groth, Paul email: pgroth@gmail.com – sequence: 8 givenname: Michel surname: Dumontier fullname: Dumontier, Michel email: michel.dumontier@gmail.com – sequence: 9 givenname: Jeff surname: Heflin fullname: Heflin, Jeff email: heflin@cse.lehigh.edu – sequence: 10 givenname: Krishnaprasad surname: Thirunarayan fullname: Thirunarayan, Krishnaprasad email: t.k.prasad@wright.edu – sequence: 11 givenname: Steffen surname: Staab fullname: Staab, Steffen email: staab@uni-koblenz.de |
EndPage | 373 |
GroupedDBID | -DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ABMNI ACGFS ADCXD AEFIE ALMA_UNASSIGNED_HOLDINGS EJD F5P FEDTE HVGLF LAS LDH P2P RIG RNI RSU SVGTG VI1 ~02 |
ID | FETCH-LOGICAL-g251t-6fe6f8f3b144a0be0df19053044f6f0491de6cb814f58d2dbb30ded7fc4bcc363 |
ISBN | 3319250094 9783319250090 |
ISSN | 0302-9743 |
IngestDate | Tue Jul 29 20:38:14 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-g251t-6fe6f8f3b144a0be0df19053044f6f0491de6cb814f58d2dbb30ded7fc4bcc363 |
PageCount | 18 |
ParticipantIDs | springer_books_10_1007_978_3_319_25010_6_24 |
PublicationCentury | 2000 |
PublicationDate | 2015 |
PublicationDateYYYYMMDD | 2015-01-01 |
PublicationDate_xml | – year: 2015 text: 2015 |
PublicationDecade | 2010 |
PublicationPlace | Cham |
PublicationPlace_xml | – name: Cham |
PublicationSeriesSubtitle | Information Systems and Applications, incl. Internet/Web, and HCI |
PublicationSeriesTitle | Lecture Notes in Computer Science |
PublicationSeriesTitleAlternate | Lect.Notes Computer |
PublicationSubtitle | 14th International Semantic Web Conference, Bethlehem, PA, USA, October 11-15, 2015, Proceedings, Part II |
PublicationTitle | The Semantic Web - ISWC 2015 |
PublicationYear | 2015 |
Publisher | Springer International Publishing |
Publisher_xml | – name: Springer International Publishing |
RelatedPersons | Kleinberg, Jon M. Mattern, Friedemann Naor, Moni Mitchell, John C. Terzopoulos, Demetri Steffen, Bernhard Pandu Rangan, C. Kanade, Takeo Kittler, Josef Weikum, Gerhard Hutchison, David Tygar, Doug |
RelatedPersons_xml | – sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni – sequence: 8 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. – sequence: 9 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard – sequence: 10 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri – sequence: 11 givenname: Doug surname: Tygar fullname: Tygar, Doug – sequence: 12 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard |
SSID | ssj0002792 ssj0001585540 |
Score | 2.0880075 |
Snippet | Quite often, Linked Open Data (LOD) applications pre-fetch data from the Web and store local copies of it in a cache for faster access at runtime. Yet, recent... |
SourceID | springer |
SourceType | Publisher |
StartPage | 356 |
SubjectTerms | Data Cache Dynamics Strategy Link Open Data Local Copy Schedule Strategy |
Title | Strategies for Efficiently Keeping Local Linked Open Data Caches Up-To-Date |
URI | http://link.springer.com/10.1007/978-3-319-25010-6_24 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV07b9swECYcdyk6NH2hbdKCQzeChRw-JA8ZCiNFmrhZYjfZBFEkAwOpHcTq0A797b0jRUtxsqSLYBCGJd1H34v33RHyKa-QYKkkN945LnM7Bj2oal6NfZX7wqo8MOS-n-njuTy5VJeDwd8-u6Qxn-s_D_JK_gdVWANckSX7CGQ3PwoL8BnwhSsgDNct5_dumnUD8Ln7CZJZ1OzCGcbZt_OLCQNbqzbuKZKtFmG6NghyGXlobT_8BpPgWzVC4UQGULyJtY7rq-q2v6dSK1sXmjjgVOZF4FNe_2anzgXm1XQVUuMQ4YIni9UqsK-aik2wb_SazW_4bMVhJW4nFJJbH07bc4yzVRPKw1gaNZE0Tz81MVJbqYmUmtxKbnb5tTuxrABlAP5YFqeHJk4X6GuIeKIKdFFFa2y8KGKj01btCqV7FlzE4Sj3jEO_HgS5W3A3MEK6PJA7ZCcv1JA8-XJ0Mv3R5egUFvFlG8uOzRbjqVR8KuQKpaeOTSZ7b9HjaT50y3sn78Ghme2SZ0hyocg-AaG9IAO3fEmeJ7nTVu6vyGkHOQXIaQ9y2kJOA-Q0Qk4RcoqQ0wg57SB_TeZfj2aTY97O3-BX4PU2XHunfeGFgaC7yozLrAf3UYlMSq89hJYj63RtipH0qrAH1hiRWWdzX0tT10KLN2S4XC3dW0KtzZWuwbhW4A6Clah0Aa62kq7QY2u0e0dYkkaJ_6h1mdppg-xKUYLsyiC7EmX3_lHf3iNPu825T4bN7S_3ATzJxnxsAf8H22Bnkw |
linkProvider | Library Specific Holdings |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=The+Semantic+Web+-+ISWC+2015&rft.au=Dividino%2C+Renata&rft.au=Gottron%2C+Thomas&rft.au=Scherp%2C+Ansgar&rft.atitle=Strategies+for+Efficiently+Keeping+Local+Linked+Open+Data+Caches+Up-To-Date&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2015-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783319250090&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=356&rft.epage=373&rft_id=info:doi/10.1007%2F978-3-319-25010-6_24 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon |