Data cleaning approaches in Web2.0 VGI application

As the Web provide a more flexible and sophisticated platform for information dissemination and exchanging, the idea that allow user to add and upload geospatial-related data in GIS-enabled online site to promote the geographic information sharing in public is becoming a new research topic in GIS. T...

Full description

Saved in:
Bibliographic Details
Published in2009 17th International Conference on Geoinformatics pp. 1 - 4
Main Authors Xinlin Qian, Liping Di, Deren Li, Pingxiang Li, Lite Shi, Liefei Cai
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.08.2009
Subjects
Online AccessGet full text
ISBN1424445620
9781424445622
ISSN2161-024X
DOI10.1109/GEOINFORMATICS.2009.5293442

Cover

Abstract As the Web provide a more flexible and sophisticated platform for information dissemination and exchanging, the idea that allow user to add and upload geospatial-related data in GIS-enabled online site to promote the geographic information sharing in public is becoming a new research topic in GIS. There are some real-world Web sites already partially implement VGI thought like wikimapia.org and openstreetmap.org, but considered the new characteristics of VGI data, there needs some new methods especially data cleaning approaches in the process of handling VGI data to make the non-professional user generated data into applicable information. The features of VGI data are as follows: since general users can add and change data, the stored data should be updated frequently, result in an abundant and keep updated geographic dataset; data acquired by non-professionals with non-professional equipment, so there is no guarantee of quality about the data; contributors are spontaneous, the density of data will be unpredictable, data distribution will be uneven and inconsistent. These characteristics brought challenges for VGI data management, New data cleaning methods must be used to solve problems like data clutter, . Data cleaning, including the merge, delete, correct operations on geographical features to make it into a effective and consistent form, in general, merge and delete operation can be achieved automatically, the correct operation require some human user's interaction to complete. Spatial information as well as attribute information must be taken into account in the geographic data cleaning like duplicated points finding and repeated description merging, on this basis, we will discuss several data cleaning methods for VGI data and their experiment result as well as some possible application environments in using these methods.
AbstractList As the Web provide a more flexible and sophisticated platform for information dissemination and exchanging, the idea that allow user to add and upload geospatial-related data in GIS-enabled online site to promote the geographic information sharing in public is becoming a new research topic in GIS. There are some real-world Web sites already partially implement VGI thought like wikimapia.org and openstreetmap.org, but considered the new characteristics of VGI data, there needs some new methods especially data cleaning approaches in the process of handling VGI data to make the non-professional user generated data into applicable information. The features of VGI data are as follows: since general users can add and change data, the stored data should be updated frequently, result in an abundant and keep updated geographic dataset; data acquired by non-professionals with non-professional equipment, so there is no guarantee of quality about the data; contributors are spontaneous, the density of data will be unpredictable, data distribution will be uneven and inconsistent. These characteristics brought challenges for VGI data management, New data cleaning methods must be used to solve problems like data clutter, . Data cleaning, including the merge, delete, correct operations on geographical features to make it into a effective and consistent form, in general, merge and delete operation can be achieved automatically, the correct operation require some human user's interaction to complete. Spatial information as well as attribute information must be taken into account in the geographic data cleaning like duplicated points finding and repeated description merging, on this basis, we will discuss several data cleaning methods for VGI data and their experiment result as well as some possible application environments in using these methods.
Author Xinlin Qian
Pingxiang Li
Lite Shi
Deren Li
Liping Di
Liefei Cai
Author_xml – sequence: 1
  surname: Xinlin Qian
  fullname: Xinlin Qian
  organization: Center for Spatial Inf. Sci. & Syst., George Mason Univ., Greenbelt, MD, USA
– sequence: 2
  surname: Liping Di
  fullname: Liping Di
  organization: Center for Spatial Inf. Sci. & Syst., George Mason Univ., Greenbelt, MD, USA
– sequence: 3
  surname: Deren Li
  fullname: Deren Li
  organization: State Key Lab. of Inf. Eng. in Surveying, Mapping & Remote Sensing, Wuhan Univ., Wuhan, China
– sequence: 4
  surname: Pingxiang Li
  fullname: Pingxiang Li
  organization: State Key Lab. of Inf. Eng. in Surveying, Mapping & Remote Sensing, Wuhan Univ., Wuhan, China
– sequence: 5
  surname: Lite Shi
  fullname: Lite Shi
  organization: State Key Lab. of Inf. Eng. in Surveying, Mapping & Remote Sensing, Wuhan Univ., Wuhan, China
– sequence: 6
  surname: Liefei Cai
  fullname: Liefei Cai
  organization: State Key Lab. of Inf. Eng. in Surveying, Mapping & Remote Sensing, Wuhan Univ., Wuhan, China
BookMark eNo1j81Kw0AYRUdswabmCdwEXCd-85eZWZbYxkA1oEHdlZlkRkdiEpJs-vZarKvL5XIunAAtur6zCN1iSDAGdZdvy-JpVz4_bqoie0kIgEo4UZQxcoFCJSRmhDHGU6ouUfBfCCzQiuAUx0DY-xIFJ04BVRKuUDhNXwCAVSqwFCtE7vWso7q1uvPdR6SHYex1_WmnyHfRmzUkgeg1L05D62s9-767Rkun28mG51yjaretsod4X-ZFttnHXsEc09qBMU0jjGCccsMFk7bWKRXOSa0cMbjBFvMaqIZfWcMdUGeEpFQKZi1do5u_W2-tPQyj_9bj8XDWpz_r20zK
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/GEOINFORMATICS.2009.5293442
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Geography
Government
EISBN 9781424445639
1424445639
EndPage 4
ExternalDocumentID 5293442
Genre orig-research
GroupedDBID 6IE
6IL
6IN
AAWTH
ABLEC
ADZIZ
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
CHZPO
IEGSK
OCL
RIE
RIL
ID FETCH-LOGICAL-i90t-3cf0bbdd7b74535b5748eca637ff8a9f2b1d1e15c03a0109b5f03fb7833874ee3
IEDL.DBID RIE
ISBN 1424445620
9781424445622
ISSN 2161-024X
IngestDate Wed Aug 27 01:36:22 EDT 2025
IsPeerReviewed false
IsScholarly true
LCCN 2009903980
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i90t-3cf0bbdd7b74535b5748eca637ff8a9f2b1d1e15c03a0109b5f03fb7833874ee3
PageCount 4
ParticipantIDs ieee_primary_5293442
PublicationCentury 2000
PublicationDate 2009-Aug.
PublicationDateYYYYMMDD 2009-08-01
PublicationDate_xml – month: 08
  year: 2009
  text: 2009-Aug.
PublicationDecade 2000
PublicationTitle 2009 17th International Conference on Geoinformatics
PublicationTitleAbbrev GEOINFORMATICS
PublicationYear 2009
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0001967187
ssj0003211765
Score 1.7776223
Snippet As the Web provide a more flexible and sophisticated platform for information dissemination and exchanging, the idea that allow user to add and upload...
SourceID ieee
SourceType Publisher
StartPage 1
SubjectTerms Character generation
data cleaning
data correction
Data engineering
Geographic Information Systems
Government
Green cleaning
Humans
Information science
Laboratories
Merging
Remote sensing
user generated geographic data
VGI
WebGIS
Title Data cleaning approaches in Web2.0 VGI application
URI https://ieeexplore.ieee.org/document/5293442
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV27TsMwFLVKB2AC2iLesgQjaV0_4nhEpS-kFiQKdKvs2BYVUoogHeDrsZM0BcTAFmdxYl353nMf5wBwQaWLhURog1gTHFDcDgNhmA6wVMpwZTXOVEtG43DwQG-mbFoBl-UsjDEmaz4zTf-Y1fL1Il76VFmLOd9EqbtwN5yZ5bNa63yKCN01y8s1cciGZ0qSbn-HmDGdrua6fNBf0j0Va7wJzgv-zVa_ezscOww2upoMO_c5p2Wx9Q8NlswF9XbAaPXxeefJS3OZqmb8-YvX8b9_twsa62E_eFe6sT1QMUkNbBXi6M8fNbC91uOtA3wtUwmdrUmfUIErSnLzDucJfPIAGsHH_hB-K4w3wKTXnXQGQaG7EMwFSgMSW6SU1lxxyghTjNPIxDIk3NpICotVW7dNm8WISF9YU8wiYhWPHNrl1BiyD6rJIjEHAJIoZkYQSS1nVAsksTYCKcsUsg4dR4eg7s9h9poza8yKIzj6-_Ux2M5rOb797gRU07elOXUhQarOMlv4AsTtrCs
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1bT8IwFD4hmAhPKmC8u0QfHYxe1u3RIJcpQxNReSPt2kZiMoyOB_31dhdAjQ--tX1Z1zQ95zuX7wM4J9z4Qr6r7UhiZBPUdm1fUWkjLoRiQkuUqZaEI3fwQK4ndFKCi1UvjFIqKz5TzXSY5fLlPFqkobIWNbaJEPPgbhi7T2jerbWOqPiueWjZao4NtmGZlqTZgcHMiEyWnV2p278ifCrmaBPOCgbOVr97G4wMCgsvx0HnPme1LD7-Q4UlM0K9LQiX289rT16ai0Q0o89fzI7__b9taKzb_ay7lSHbgZKKa1Ap5NGfP2pQXSvy1gFd8YRb5rbxNKRiLUnJ1bs1i62nFEI71mM_sL6lxhsw7nXHnYFdKC_YM99JbBxpRwgpmWCEYiooI56KuIuZ1h73NRJt2VZtGjmYp6k1QbWDtWCewbuMKIV3oRzPY7UHFvYiqnzMiWaUSN_hSCrfEZoKRxt87O1DPT2H6WvOrTEtjuDg7-VTqAzG4XA6DEY3h1DNMztpMd4RlJO3hTo2DkIiTrJ78QUEPK94
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2009+17th+International+Conference+on+Geoinformatics&rft.atitle=Data+cleaning+approaches+in+Web2.0+VGI+application&rft.au=Xinlin+Qian&rft.au=Liping+Di&rft.au=Deren+Li&rft.au=Pingxiang+Li&rft.date=2009-08-01&rft.pub=IEEE&rft.isbn=9781424445622&rft.issn=2161-024X&rft.spage=1&rft.epage=4&rft_id=info:doi/10.1109%2FGEOINFORMATICS.2009.5293442&rft.externalDocID=5293442
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2161-024X&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2161-024X&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2161-024X&client=summon