Crawling and Detecting Community Structure in Online Social Networks Using Local Information

As Online Social Networks (OSNs) become an intensive subject of research for example in computer science, networking, social sciences etc., a growing need for valid and useful datasets is present. The time taken to crawl the network is however introducing a bias which should be minimized. Usual ways...

Full description

Saved in:
Bibliographic Details
Published inNETWORKING 2012 pp. 56 - 67
Main Authors Blenn, Norbert, Doerr, Christian, Van Kester, Bas, Van Mieghem, Piet
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg 2012
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
Abstract As Online Social Networks (OSNs) become an intensive subject of research for example in computer science, networking, social sciences etc., a growing need for valid and useful datasets is present. The time taken to crawl the network is however introducing a bias which should be minimized. Usual ways of addressing this problem are sampling based on the nodes (users) ids in the network or crawling the network until one “feels” a sufficient amount of data has been obtained. In this paper we introduce a new way of directing the crawling procedure to selectively obtain communities of the network. Thus, a researcher is able to obtain those users belonging to the same community and rapidly begin with the evaluation. As all users involved in the same community are crawled first, the bias introduced by the time taken to crawl the network and the evolution of the network itself is less. Our presented technique is also detecting communities during runtime. We compare our method called Mutual Friend Crawling (MFC) to the standard methods Breadth First Search (BFS) and Depth First Search (DFS) and different community detection algorithms. The presented results are very promising as our method takes only linear runtime but is detecting equal structures as modularity based community detection algorithms.
AbstractList As Online Social Networks (OSNs) become an intensive subject of research for example in computer science, networking, social sciences etc., a growing need for valid and useful datasets is present. The time taken to crawl the network is however introducing a bias which should be minimized. Usual ways of addressing this problem are sampling based on the nodes (users) ids in the network or crawling the network until one “feels” a sufficient amount of data has been obtained. In this paper we introduce a new way of directing the crawling procedure to selectively obtain communities of the network. Thus, a researcher is able to obtain those users belonging to the same community and rapidly begin with the evaluation. As all users involved in the same community are crawled first, the bias introduced by the time taken to crawl the network and the evolution of the network itself is less. Our presented technique is also detecting communities during runtime. We compare our method called Mutual Friend Crawling (MFC) to the standard methods Breadth First Search (BFS) and Depth First Search (DFS) and different community detection algorithms. The presented results are very promising as our method takes only linear runtime but is detecting equal structures as modularity based community detection algorithms.
Author Van Kester, Bas
Blenn, Norbert
Van Mieghem, Piet
Doerr, Christian
Author_xml – sequence: 1
  givenname: Norbert
  surname: Blenn
  fullname: Blenn, Norbert
  email: N.Blenn@tudelft.nl
  organization: Department of Telecommunication, TU Delft, Delft, The Netherlands
– sequence: 2
  givenname: Christian
  surname: Doerr
  fullname: Doerr, Christian
  email: C.Doerr@tudelft.nl
  organization: Department of Telecommunication, TU Delft, Delft, The Netherlands
– sequence: 3
  givenname: Bas
  surname: Van Kester
  fullname: Van Kester, Bas
  email: S.vanKester@student.tudelft.nl
  organization: Department of Telecommunication, TU Delft, Delft, The Netherlands
– sequence: 4
  givenname: Piet
  surname: Van Mieghem
  fullname: Van Mieghem, Piet
  email: P.F.A.VanMieghem@tudelft.nl
  organization: Department of Telecommunication, TU Delft, Delft, The Netherlands
BookMark eNpFkMlOwzAQhg0UibT0Cbj4BQx2vCQ-orBVquih9IZkOWYCoa2NHFcVb49bkJg5jP7ZNPON0cgHDwhdMXrNKK1udFUTTpQoCadUSCKNPEFjnhNHrU5RwRRjhHOhz_4Loh6hgnJaEl0JfoGmw_BJsymlpNAFem2i3W96_46tf8N3kMClg2rCdrvzffrGyxR3Lu0i4N7jhc-9gJfB9XaDnyHtQ1wPeDUcZubB5eTMdyFubeqDv0Tnnd0MMP2LE7R6uH9pnsh88Thrbufkg2kmibbW1dAxaFmplHWy7KxuZS2ZLYUVbedEK2pXQqUFONq1tQBLJQihFNj85wSx373DV8yHQDRtCOvBMGoO7ExmZ7jJRMyRlcnOfwB7VGCL
ContentType Book Chapter
Copyright IFIP International Federation for Information Processing 2012
Copyright_xml – notice: IFIP International Federation for Information Processing 2012
DOI 10.1007/978-3-642-30045-5_5
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISBN 3642300456
9783642300455
EISSN 1611-3349
Editor Widmer, Joerg
Bestak, Robert
Yin, Hao
Li, Li Erran
Kencl, Lukas
Editor_xml – sequence: 1
  givenname: Robert
  surname: Bestak
  fullname: Bestak, Robert
  email: robert.bestak@fel.cvut.cz
– sequence: 2
  givenname: Lukas
  surname: Kencl
  fullname: Kencl, Lukas
  email: lukas.kencl@fel.cvut.cz
– sequence: 3
  givenname: Li Erran
  surname: Li
  fullname: Li, Li Erran
  email: erranlli@research.bell-labs.com
– sequence: 4
  givenname: Joerg
  surname: Widmer
  fullname: Widmer, Joerg
  email: joerg.widmer@imdea.org
– sequence: 5
  givenname: Hao
  surname: Yin
  fullname: Yin, Hao
  email: h-yin@mail.cs.tsinghua.edu.cn
EndPage 67
GroupedDBID -DT
-GH
-~X
1SB
29L
2HA
2HV
5QI
875
AASHB
ABMNI
ACGFS
ADCXD
AEFIE
ALMA_UNASSIGNED_HOLDINGS
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RIG
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-h1915-9aac8ef1eb1266ac52fa9b5851a24a4bfc4b48c2e794ec0fb84ea05e4466ea423
ISBN 3642300448
9783642300448
ISSN 0302-9743
IngestDate Tue Jul 29 20:12:10 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-h1915-9aac8ef1eb1266ac52fa9b5851a24a4bfc4b48c2e794ec0fb84ea05e4466ea423
OpenAccessLink https://inria.hal.science/hal-01531140
PageCount 12
ParticipantIDs springer_books_10_1007_978_3_642_30045_5_5
PublicationCentury 2000
PublicationDate 2012
PublicationDateYYYYMMDD 2012-01-01
PublicationDate_xml – year: 2012
  text: 2012
PublicationDecade 2010
PublicationPlace Berlin, Heidelberg
PublicationPlace_xml – name: Berlin, Heidelberg
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSubtitle 11th International IFIP TC 6 Networking Conference, Prague, Czech Republic, May 21-25, 2012, Proceedings, Part I
PublicationTitle NETWORKING 2012
PublicationYear 2012
Publisher Springer Berlin Heidelberg
Publisher_xml – name: Springer Berlin Heidelberg
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Nierstrasz, Oscar
Steffen, Bernhard
Kittler, Josef
Vardi, Moshe Y.
Weikum, Gerhard
Sudan, Madhu
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Pandu Rangan, C.
Kanade, Takeo
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
  organization: Lancaster University, Lancaster, UK
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
  organization: Carnegie Mellon University, Pittsburgh, USA
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
  organization: University of Surrey, Guildford, UK
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
  organization: Cornell University, Ithaca, USA
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
  organization: ETH Zurich, Zurich, Switzerland
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
  organization: Stanford University, Stanford, USA
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
  organization: Weizmann Institute of Science, Rehovot, Israel
– sequence: 8
  givenname: Oscar
  surname: Nierstrasz
  fullname: Nierstrasz, Oscar
  organization: University of Bern, Bern, Switzerland
– sequence: 9
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
  organization: Indian Institute of Technology, Madras, India
– sequence: 10
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: University of Dortmund, Dortmund, Germany
– sequence: 11
  givenname: Madhu
  surname: Sudan
  fullname: Sudan, Madhu
  organization: Massachusetts Institute of Technology, USA
– sequence: 12
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
  organization: University of California, Los Angeles, USA
– sequence: 13
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
  organization: University of California, Berkeley, USA
– sequence: 14
  givenname: Moshe Y.
  surname: Vardi
  fullname: Vardi, Moshe Y.
  organization: Rice University, Houston, USA
– sequence: 15
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
  organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany
SSID ssj0000666549
ssj0002792
Score 1.4069551
Snippet As Online Social Networks (OSNs) become an intensive subject of research for example in computer science, networking, social sciences etc., a growing need for...
SourceID springer
SourceType Publisher
StartPage 56
SubjectTerms Community Detection
Crawling
Social Networks
Title Crawling and Detecting Community Structure in Online Social Networks Using Local Information
URI http://link.springer.com/10.1007/978-3-642-30045-5_5
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1Lb9NAEF6l4QIcgAKC8tAeOGEZOe6u6xw4UAiq2jQgSEsPSNZ6M0sjVY6UBCHxN_jDzOwrS4uQiiJZiZ3E1uy3M7Pz-JaxF62pZjj_Bjmo2uRC0JRSYpgbM9OyNaKEGcUhjyfVwYk4PJNnvd6vpGrp-7p9pX_-ta_kf0YVz-G4UpfsNUY2_imewPc4vnjEEcbjJef3zzCr65gdTb98-HRE0SY0r7FIeB_NSOczMlQ0HT3VBSyXGzqBBBWnOMWP4g4d-2qVXjiew7dzsKD5OA_N0p7YYKl-XIQex3dA2QhfREAtJ-jcf7bctJShmHeZ4zTNfDvwxFWfrzJXsjAmi5r51qgIFZIhrF6PfZpjsljb6rEs7EQRFFMaubAlIGnkIkQus38Qe9kmE_T4KO-cqGnUTGWOKyGnGsGp7ooIGXcdAapXx7JKDLvb9uOKyUirRPBWOd1L5rKRW2xrr5Z9duPN6HB8GgN3tOCzqVFv7omB0aWq3CNRA1F8ZEfxtPkcea8ctfGlO17JxlsnZ3qX3abGF04dKSjce6wH3Ta7E4TNvbC32a2Ew_I--xpgwBEGPMKARxjwCAM-77iDAXcw4AEG3MKAWxjwBAYP2Mn70fTtQe637MjPceEvcZ4rXYMZoAeAnp_SsjRq2FLqWZVCidZo0Ypal4BmAHRh2lqAKiRQVQEolNJD1u8WHTxi3FStweWJxuuFgLIaFvijshBmoIdqr5g9Zi-DsBqahKsmMHCjZJvdBiXbWMk2-Nq5zpefsJsbuD5lfZQSPEPXc90-92D4DYWVfG4
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=NETWORKING+2012&rft.au=Blenn%2C+Norbert&rft.au=Doerr%2C+Christian&rft.au=Van+Kester%2C+Bas&rft.au=Van+Mieghem%2C+Piet&rft.atitle=Crawling+and+Detecting+Community+Structure+in+Online+Social+Networks+Using+Local+Information&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2012-01-01&rft.pub=Springer+Berlin+Heidelberg&rft.isbn=9783642300448&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=56&rft.epage=67&rft_id=info:doi/10.1007%2F978-3-642-30045-5_5
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0302-9743&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0302-9743&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0302-9743&client=summon