Faster Random Walks by Rewiring Online Social Networks On-the-Fly

Many online social networks feature restrictive web interfaces that only allow the query of a user’s local neighborhood. To enable analytics over such an online social network through its web interface, many recent efforts use Markov Chain Monte Carlo (MCMC) methods such as random walks to sample us...

Full description

Saved in:
Bibliographic Details
Published inACM transactions on database systems Vol. 40; no. 4; pp. 1 - 36
Main Authors Zhou, Zhuojie, Zhang, Nan, Gong, Zhiguo, Das, Gautam
Format Journal Article
LanguageEnglish
Published 01.01.2016
Subjects
Online AccessGet full text
ISSN0362-5915
1557-4644
DOI10.1145/2847526

Cover

Abstract Many online social networks feature restrictive web interfaces that only allow the query of a user’s local neighborhood. To enable analytics over such an online social network through its web interface, many recent efforts use Markov Chain Monte Carlo (MCMC) methods such as random walks to sample users in the social network and thereby support analytics based on the samples. The problem with such an approach, however, is the large amount of queries often required for a random walk to converge to a desired (stationary) sampling distribution. In this article, we consider a novel problem of enabling a faster random walk over online social networks by “rewiring” the social network on-the-fly. Specifically, we develop a Modified TOpology Sampling (MTO-Sampling) scheme that, by using only information exposed by the restrictive web interface, constructs a “virtual” random-walk-friendly overlay topology of the social network while performing a random walk and ensures that the random walk follows the modified overlay topology rather than the original one. We describe in this article instantiations of MTO-Sampling for various types of random walks, such as Simple Random Walk (MTO-SRW), Metropolis-Hastings Random Walk (MTO-MHRW), and General Random Walk (MTO-GRW). We not only rigidly prove that MTO-Sampling improves the efficiency of sampling, but we also demonstrate the significance of such improvement through experiments on real-world online social networks such as Google Plus, Epinion, Facebook, etc.
AbstractList Many online social networks feature restrictive web interfaces that only allow the query of a user's local neighborhood. To enable analytics over such an online social network through its web interface, many recent efforts use Markov Chain Monte Carlo (MCMC) methods such as random walks to sample users in the social network and thereby support analytics based on the samples. The problem with such an approach, however, is the large amount of queries often required for a random walk to converge to a desired (stationary) sampling distribution. In this article, we consider a novel problem of enabling a faster random walk over online social networks by "rewiring" the social network on-the-fly. Specifically, we develop a Modified TOpology Sampling (MTO-Sampling) scheme that, by using only information exposed by the restrictive web interface, constructs a "virtual" random-walk-friendly overlay topology of the social network while performing a random walk and ensures that the random walk follows the modified overlay topology rather than the original one. We describe in this article instantiations of MTO-Sampling for various types of random walks, such as Simple Random Walk (MTO-SRW), Metropolis-Hastings Random Walk (MTO-MHRW), and General Random Walk (MTO-GRW). We not only rigidly prove that MTO-Sampling improves the efficiency of sampling, but we also demonstrate the significance of such improvement through experiments on real-world online social networks such as Google Plus, Epinion, Facebook, etc.
Author Zhou, Zhuojie
Gong, Zhiguo
Das, Gautam
Zhang, Nan
Author_xml – sequence: 1
  givenname: Zhuojie
  orcidid: 0000-0002-3312-7732
  surname: Zhou
  fullname: Zhou, Zhuojie
  organization: George Washington University, Washington, DC
– sequence: 2
  givenname: Nan
  surname: Zhang
  fullname: Zhang, Nan
  organization: George Washington University, Washington, DC
– sequence: 3
  givenname: Zhiguo
  surname: Gong
  fullname: Gong, Zhiguo
  organization: University of Macau
– sequence: 4
  givenname: Gautam
  surname: Das
  fullname: Das, Gautam
  organization: University of Texas at Arlington, Arlington, TX
BookMark eNpl0E1LAzEYBOAgFWyr-Bf2ppdospuP3WMpVoVioSoelzT7RqNpUpOU0n_vSnvS01weBmZGaOCDB4QuKbmhlPHbsmaSl-IEDSnnEjPB2AANSSVKzBvKz9AopU9CCKsbOUSTmUoZYrFUvgvr4k25r1Ss9sUSdjZa_14svLMeiuegrXLFE-RdiD1ZeJw_AM_c_hydGuUSXBxzjF5ndy_TBzxf3D9OJ3OsKyEzrqGrZFetDDW0AmBUMd3ohnRaGlIpRTpTCg6yMSsQDe8UMSvZECZAq1p0UI3R9aF3E8P3FlJu1zZpcE55CNvU0prU_ShJaE-vDlTHkFIE026iXau4bylpf09qjyf1Ev-R2maVbfA5Kuv--R_B9Wlf
CitedBy_id crossref_primary_10_1007_s13278_018_0520_3
crossref_primary_10_1007_s10115_022_01691_8
crossref_primary_10_1145_3561388
crossref_primary_10_1007_s41109_019_0201_9
crossref_primary_10_1007_s42486_019_00021_2
crossref_primary_10_1145_3524105
crossref_primary_10_1145_3299877
crossref_primary_10_1093_comnet_cny032
crossref_primary_10_1109_TKDE_2021_3126906
crossref_primary_10_1109_TNNLS_2021_3083318
crossref_primary_10_1145_3322205_3311086
crossref_primary_10_1016_j_knosys_2020_105891
Cites_doi 10.1145/1879141.1879191
10.1145/1117454.1117459
10.1063/1.1699114
10.1016/j.laa.2006.07.018
10.1145/2000172.2000178
10.1145/2254756.2254795
10.5555/1833515.1833840
10.1145/1117454.1117457
10.1080/15427951.2009.10129177
10.1214/09-AOS735
10.1145/1378533.1378557
10.1093/biomet/57.1.97
10.1109/ICDE.2013.6544873
10.1007/BF02579166
10.1007/978-3-540-39718-2_23
10.1145/1150402.1150479
10.1137/S0036144503423264
10.1214/ss/1177011137
10.1145/1963405.1963489
10.1145/1993744.1993773
10.1145/1772690.1772778
ContentType Journal Article
DBID AAYXX
CITATION
7SC
8FD
JQ2
L7M
L~C
L~D
DOI 10.1145/2847526
DatabaseName CrossRef
Computer and Information Systems Abstracts
Technology Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
DatabaseTitle CrossRef
Computer and Information Systems Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Advanced Technologies Database with Aerospace
ProQuest Computer Science Collection
Computer and Information Systems Abstracts Professional
DatabaseTitleList Computer and Information Systems Abstracts
CrossRef
DeliveryMethod fulltext_linktorsrc
Discipline Sciences (General)
Computer Science
EISSN 1557-4644
EndPage 36
ExternalDocumentID 10_1145_2847526
GroupedDBID --Z
-DZ
-~X
.DC
23M
4.4
5GY
5VS
6J9
8US
8VB
AAKMM
AALFJ
AAYFX
AAYXX
ABPPZ
ACGFO
ACGOD
ACM
ADBCU
ADL
ADMLS
AEBYY
AEFXT
AEGXH
AEJOY
AEMOZ
AENEX
AENSD
AETEA
AFWIH
AFWXC
AHQJS
AIAGR
AIKLT
AKRVB
AKVCP
ALMA_UNASSIGNED_HOLDINGS
ASPBG
AVWKF
BDXCO
CCLIF
CITATION
CS3
D0L
EBS
EJD
FEDTE
GUFHI
HGAVV
H~9
I07
IAO
ICD
IEA
IGS
IOF
K1G
LHSKQ
N95
P1C
P2P
PQQKQ
QWB
RNS
ROL
RXW
TAE
TH9
U5U
UPT
WH7
X6Y
XH6
XSW
ZCA
ZL0
7SC
8FD
JQ2
L7M
L~C
L~D
ID FETCH-LOGICAL-c367t-8ed37d3bf1f13ee41a4c9c90dc7f03aa0df265e79fbe695da0fb79046eca86de3
ISSN 0362-5915
IngestDate Fri Jul 11 08:05:36 EDT 2025
Thu Jul 03 08:16:16 EDT 2025
Thu Apr 24 22:59:24 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 4
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c367t-8ed37d3bf1f13ee41a4c9c90dc7f03aa0df265e79fbe695da0fb79046eca86de3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0002-3312-7732
PQID 1808048701
PQPubID 23500
PageCount 36
ParticipantIDs proquest_miscellaneous_1808048701
crossref_primary_10_1145_2847526
crossref_citationtrail_10_1145_2847526
PublicationCentury 2000
PublicationDate 2016-01-01
PublicationDateYYYYMMDD 2016-01-01
PublicationDate_xml – month: 01
  year: 2016
  text: 2016-01-01
  day: 01
PublicationDecade 2010
PublicationTitle ACM transactions on database systems
PublicationYear 2016
References Chung Fan (e_1_2_1_8_1)
Chung Fan R. K. (e_1_2_1_9_1) 1996
John (e_1_2_1_14_1) 1954
e_1_2_1_20_1
e_1_2_1_23_1
Lovász L. (e_1_2_1_22_1) 1993; 2
e_1_2_1_24_1
e_1_2_1_21_1
e_1_2_1_27_1
e_1_2_1_25_1
e_1_2_1_26_1
e_1_2_1_29_1
Geweke John (e_1_2_1_11_1)
Sarkar Purnamrita (e_1_2_1_28_1) 2010
e_1_2_1_7_1
e_1_2_1_31_1
e_1_2_1_30_1
e_1_2_1_5_1
e_1_2_1_6_1
e_1_2_1_3_1
e_1_2_1_12_1
e_1_2_1_4_1
e_1_2_1_13_1
e_1_2_1_1_1
e_1_2_1_10_1
e_1_2_1_2_1
e_1_2_1_16_1
e_1_2_1_17_1
e_1_2_1_15_1
e_1_2_1_18_1
e_1_2_1_19_1
References_xml – ident: e_1_2_1_24_1
  doi: 10.1145/1879141.1879191
– ident: e_1_2_1_29_1
  doi: 10.1145/1117454.1117459
– ident: e_1_2_1_26_1
  doi: 10.1063/1.1699114
– ident: e_1_2_1_7_1
  doi: 10.1016/j.laa.2006.07.018
– volume-title: Bayesian Statistics
  ident: e_1_2_1_11_1
– ident: e_1_2_1_16_1
  doi: 10.1145/2000172.2000178
– volume-title: Hammersley and K William Morton
  year: 1954
  ident: e_1_2_1_14_1
– ident: e_1_2_1_19_1
  doi: 10.1145/2254756.2254795
– ident: e_1_2_1_23_1
– ident: e_1_2_1_13_1
  doi: 10.5555/1833515.1833840
– ident: e_1_2_1_2_1
  doi: 10.1145/1117454.1117457
– ident: e_1_2_1_21_1
  doi: 10.1080/15427951.2009.10129177
– ident: e_1_2_1_10_1
  doi: 10.1214/09-AOS735
– ident: e_1_2_1_4_1
  doi: 10.1145/1378533.1378557
– ident: e_1_2_1_1_1
– ident: e_1_2_1_15_1
  doi: 10.1093/biomet/57.1.97
– ident: e_1_2_1_31_1
  doi: 10.1109/ICDE.2013.6544873
– volume-title: Moore
  year: 2010
  ident: e_1_2_1_28_1
– volume-title: Paul Erdos is Eighty 2, 157--172
  year: 1996
  ident: e_1_2_1_9_1
– ident: e_1_2_1_3_1
  doi: 10.1007/BF02579166
– ident: e_1_2_1_25_1
  doi: 10.1007/978-3-540-39718-2_23
– ident: e_1_2_1_6_1
– ident: e_1_2_1_20_1
  doi: 10.1145/1150402.1150479
– ident: e_1_2_1_5_1
  doi: 10.1137/S0036144503423264
– ident: e_1_2_1_12_1
  doi: 10.1214/ss/1177011137
– ident: e_1_2_1_17_1
  doi: 10.1145/1963405.1963489
– volume: 2
  start-page: 1
  year: 1993
  ident: e_1_2_1_22_1
  article-title: Random walks on graphs: A survey
  publication-title: Combinatorics, Paul Erdos Is Eighty
– ident: e_1_2_1_30_1
– ident: e_1_2_1_18_1
  doi: 10.1145/1993744.1993773
– volume-title: The small world phenomenon in hybrid power law graphs
  ident: e_1_2_1_8_1
– ident: e_1_2_1_27_1
  doi: 10.1145/1772690.1772778
SSID ssj0004897
Score 2.2283733
Snippet Many online social networks feature restrictive web interfaces that only allow the query of a user’s local neighborhood. To enable analytics over such an...
Many online social networks feature restrictive web interfaces that only allow the query of a user's local neighborhood. To enable analytics over such an...
SourceID proquest
crossref
SourceType Aggregation Database
Enrichment Source
Index Database
StartPage 1
SubjectTerms Online
Random walk
Sampling
Social networks
Topology
User interfaces
Websites
World Wide Web
Title Faster Random Walks by Rewiring Online Social Networks On-the-Fly
URI https://www.proquest.com/docview/1808048701
Volume 40
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3db9MwELdge-GFsQFiAyYjIQSqPOJ82Mljta2bEO3D2MS0l8h2nLXQJYgmQvDXc46dpO0m8aFIUWTFiZL76e7su_sdQq8ZVZLBQQAgPjEuNpGUU-Ir8CWEYjltsgnHE3Z6EX64jC7XqksqeaB-3VlX8j9ShTGQq6mS_QfJdg-FAbgG-cIZJAznv5LxSBiag8GZKLLyZvBZzL8ujDt5pn_MmrQ6yyM6cCW4E5vxvYBhAm4fGc1XQrrDw7FpGNF2D2_CCCZ_1Ng5R_jc-d9X07JuwhrTuvwy0_2w232e9Jg7cTm_V9PZdV32G-N2R17UlbhZ3nmgyzsPfcVVlNhyzAPtFGjEScgsp2OrYS0hk0NSuKQu6ZLdtTwotzV6aMgvjBGN_Ds4s9dsWZdhaOuto9RNvI82fc5NHH9zeDT--KkvnY2b_jvdp9i6ajP1vZu66rCs2uvGCTl_hB661QMeWihso3u62EFbbWcO7BT1Dtp2Vwv81vGKv3uMhhYu2MIFN3DB8idu4YItXLCFC27hgnu4PEEXo-Pzw1PiOmgQFTBekVhnAc8CmdOcBlqHVIQqUYmXKZ57gRBelvss0jzJpWZJlAkvlzzxQqaViFmmg6dooygL_Qxhj8aShjJPBDixVCQCVrpKRkLGsIDQMthFb9rflCpHL2-6nMzTNVHsItzd-M0yqty-5VX7n1PQdiaEJQpd1ouUGhpUeLNH9_78mOfoQQ_bF2ij-l7rl-BCVnLfweA3k5dwJA
linkProvider EBSCOhost
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Faster+Random+Walks+by+Rewiring+Online+Social+Networks+On-the-Fly&rft.jtitle=ACM+transactions+on+database+systems&rft.au=Zhou%2C+Zhuojie&rft.au=Zhang%2C+Nan&rft.au=Gong%2C+Zhiguo&rft.au=Das%2C+Gautam&rft.date=2016-01-01&rft.issn=0362-5915&rft.eissn=1557-4644&rft.volume=40&rft.issue=4&rft.spage=1&rft.epage=36&rft_id=info:doi/10.1145%2F2847526&rft.externalDBID=n%2Fa&rft.externalDocID=10_1145_2847526
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0362-5915&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0362-5915&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0362-5915&client=summon