Fast Distributed PageRank Computation

Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining node importance in large graphs (and was a pioneering idea behind Google’s search engine). In distributed computing alone, PageRank vectors, or...

Full description

Saved in:
Bibliographic Details
Published inDistributed Computing and Networking Vol. 7730; pp. 11 - 26
Main Authors Das Sarma, Atish, Molla, Anisur Rahaman, Pandurangan, Gopal, Upfal, Eli
Format Book Chapter
LanguageEnglish
Published Germany Springer Berlin / Heidelberg 2012
Springer Berlin Heidelberg
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN9783642356674
3642356672
ISSN0302-9743
1611-3349
DOI10.1007/978-3-642-35668-1_2

Cover

Abstract Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining node importance in large graphs (and was a pioneering idea behind Google’s search engine). In distributed computing alone, PageRank vectors, or more generally random walk based quantities have been used for several different applications ranging from determining important nodes, load balancing, search, and identifying connectivity structures. Surprisingly, however, there has been little work towards designing provably efficient fully-distributed algorithms for computing PageRank. The difficulty is that traditional matrix-vector multiplication style iterative methods may not always adapt well to the distributed setting owing to communication bandwidth restrictions and convergence rates. In this paper, we present fast random walk-based distributed algorithms for computing PageRank in general graphs and prove strong bounds on the round complexity. We first present an algorithm that takes O(logn/ε) rounds with high probability on any graph (directed or undirected), where n is the network size and ε is the reset probability used in the PageRank computation (typically ε is a fixed constant). We then present a faster algorithm that takes $O(\sqrt{\log n}/{\epsilon})$ rounds in undirected graphs. Both of the above algorithms are scalable, as each node processes and sends only small (polylogarithmic in n, the network size) number of bits per round and hence work in the CONGEST distributed computing model. For directed graphs, we present an algorithm that has a running time of $O(\sqrt{\log n/{\epsilon}})$ , but it requires a polynomial number of bits to processed and sent per node in a round. To the best of our knowledge, these are the first fully distributed algorithms for computing PageRank vectors with provably efficient running time.
AbstractList Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining node importance in large graphs (and was a pioneering idea behind Google’s search engine). In distributed computing alone, PageRank vectors, or more generally random walk based quantities have been used for several different applications ranging from determining important nodes, load balancing, search, and identifying connectivity structures. Surprisingly, however, there has been little work towards designing provably efficient fully-distributed algorithms for computing PageRank. The difficulty is that traditional matrix-vector multiplication style iterative methods may not always adapt well to the distributed setting owing to communication bandwidth restrictions and convergence rates. In this paper, we present fast random walk-based distributed algorithms for computing PageRank in general graphs and prove strong bounds on the round complexity. We first present an algorithm that takes O(logn/ε) rounds with high probability on any graph (directed or undirected), where n is the network size and ε is the reset probability used in the PageRank computation (typically ε is a fixed constant). We then present a faster algorithm that takes $O(\sqrt{\log n}/{\epsilon})$ rounds in undirected graphs. Both of the above algorithms are scalable, as each node processes and sends only small (polylogarithmic in n, the network size) number of bits per round and hence work in the CONGEST distributed computing model. For directed graphs, we present an algorithm that has a running time of $O(\sqrt{\log n/{\epsilon}})$ , but it requires a polynomial number of bits to processed and sent per node in a round. To the best of our knowledge, these are the first fully distributed algorithms for computing PageRank vectors with provably efficient running time.
Author Molla, Anisur Rahaman
Pandurangan, Gopal
Upfal, Eli
Das Sarma, Atish
Author_xml – sequence: 1
  givenname: Atish
  surname: Das Sarma
  fullname: Das Sarma, Atish
  email: atish.dassarma@gmail.com
  organization: eBay Research Labs, eBay Inc., USA
– sequence: 2
  givenname: Anisur Rahaman
  surname: Molla
  fullname: Molla, Anisur Rahaman
  email: anisurpm@gmail.com
  organization: Division of Mathematical Sciences, Nanyang Technological University, Singapore
– sequence: 3
  givenname: Gopal
  surname: Pandurangan
  fullname: Pandurangan, Gopal
  organization: Department of Computer Science, Brown University, Providence, USA
– sequence: 4
  givenname: Eli
  surname: Upfal
  fullname: Upfal, Eli
  email: eli@cs.brown.edu
  organization: Department of Computer Science, Brown University, Providence, USA
BookMark eNpVkMFOwzAQRA0URCj9Ai69cDR4vU4cH1GhgFQJhOBs2bFTQksSYuf_cVsu7GWlWb3VzFyQSdu1npArYDfAmLxVsqRIC8Ep5kVRUtD8iMySiknbS3BMMigAKKJQJ_9uUkxIxpBxqqTAM5KVXBSQc2DnZBbCF0tTSqWUyMj10oQ4v29CHBo7Ru_mr2bt30y7mS-6736MJjZde0lOa7MNfva3p-Rj-fC-eKKrl8fnxd2K9pDnnNYCFNSVtc5zQMedc1WdO4Wc1clxyVEVYJhFabkta2-rnLkqGQZpvZKIUwKHv6EfmnbtB227bhM0ML2rRaeQGnWKqfcd6FRLYviB6YfuZ_Qhar-DKt_GwWyrT9NHPwSNrFCMJ0RorvAX2nFgpA
ContentType Book Chapter
Copyright Springer-Verlag Berlin Heidelberg 2013
Copyright_xml – notice: Springer-Verlag Berlin Heidelberg 2013
DBID FFUUA
DEWEY 004.36
DOI 10.1007/978-3-642-35668-1_2
DatabaseName ProQuest Ebook Central - Book Chapters - Demo use only
DatabaseTitleList
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Computer Science
EISBN 9783642356681
3642356680
EISSN 1611-3349
Editor Sinha, Prasun
Shyamasundar, Rudrapatna K
Frey, Davide
Sarkar, Saswati
Raynal, Michel
Editor_xml – sequence: 1
  fullname: Sinha, Prasun
– sequence: 2
  fullname: Frey, Davide
– sequence: 3
  fullname: Sarkar, Saswati
– sequence: 4
  fullname: Shyamasundar, Rudrapatna K
– sequence: 5
  fullname: Raynal, Michel
EndPage 26
ExternalDocumentID EBC3069028_14_29
GroupedDBID -EI
-JY
-K2
089
0D6
0DA
2HV
38.
A4J
AABBV
AARVG
AAUBL
AAWHR
ABBVZ
ABFTD
ABMLC
ABMNI
ACPRQ
ADHDZ
ADNMO
AEDXK
AEJLV
AEKFX
AETDV
AEZAY
AFJMS
AFPTF
ALMA_UNASSIGNED_HOLDINGS
ARZOH
AZZ
BBABE
CZZ
FFUUA
I4C
IEZ
IX0
JJU
MA.
N2R
SBO
TPJZQ
TSXQS
Z5O
Z7R
Z7S
Z7U
Z7V
Z7W
Z7X
Z7Y
Z7Z
Z81
Z83
Z84
Z85
Z87
Z88
-DT
-GH
-~X
1SB
29L
2HA
5QI
875
AASHB
ACGFS
ADCXD
AEFIE
EJD
F5P
FEDTE
HVGLF
LAS
LDH
P2P
RIG
RNI
RSU
SVGTG
VI1
~02
ID FETCH-LOGICAL-p1552-f4191fcbbde213d2dddcf5d9320f681823961a0b37b2b8febc50dc66717be9733
ISBN 9783642356674
3642356672
ISSN 0302-9743
IngestDate Tue Jul 29 19:51:30 EDT 2025
Thu May 29 00:16:09 EDT 2025
IsPeerReviewed false
IsScholarly false
LCCallNum TK5105.5-5105.9
Language English
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-p1552-f4191fcbbde213d2dddcf5d9320f681823961a0b37b2b8febc50dc66717be9733
Notes Original Abstract: Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining node importance in large graphs (and was a pioneering idea behind Google’s search engine). In distributed computing alone, PageRank vectors, or more generally random walk based quantities have been used for several different applications ranging from determining important nodes, load balancing, search, and identifying connectivity structures. Surprisingly, however, there has been little work towards designing provably efficient fully-distributed algorithms for computing PageRank. The difficulty is that traditional matrix-vector multiplication style iterative methods may not always adapt well to the distributed setting owing to communication bandwidth restrictions and convergence rates. In this paper, we present fast random walk-based distributed algorithms for computing PageRank in general graphs and prove strong bounds on the round complexity. We first present an algorithm that takes O(logn/ε) rounds with high probability on any graph (directed or undirected), where n is the network size and ε is the reset probability used in the PageRank computation (typically ε is a fixed constant). We then present a faster algorithm that takes \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$O(\sqrt{\log n}/{\epsilon})$\end{document} rounds in undirected graphs. Both of the above algorithms are scalable, as each node processes and sends only small (polylogarithmic in n, the network size) number of bits per round and hence work in the CONGEST distributed computing model. For directed graphs, we present an algorithm that has a running time of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$O(\sqrt{\log n/{\epsilon}})$\end{document}, but it requires a polynomial number of bits to processed and sent per node in a round. To the best of our knowledge, these are the first fully distributed algorithms for computing PageRank vectors with provably efficient running time.
OCLC 824615210
PQID EBC3069028_14_29
PageCount 16
ParticipantIDs springer_books_10_1007_978_3_642_35668_1_2
proquest_ebookcentralchapters_3069028_14_29
PublicationCentury 2000
PublicationDate 2012
2013
PublicationDateYYYYMMDD 2012-01-01
2013-01-01
PublicationDate_xml – year: 2012
  text: 2012
PublicationDecade 2010
PublicationPlace Germany
PublicationPlace_xml – name: Germany
– name: Berlin, Heidelberg
PublicationSeriesTitle Lecture Notes in Computer Science
PublicationSubtitle 14th International Conference, ICDCN 2013, Mumbai, India, January 3-6, 2013. Proceedings
PublicationTitle Distributed Computing and Networking
PublicationYear 2012
2013
Publisher Springer Berlin / Heidelberg
Springer Berlin Heidelberg
Publisher_xml – name: Springer Berlin / Heidelberg
– name: Springer Berlin Heidelberg
RelatedPersons Kleinberg, Jon M.
Mattern, Friedemann
Nierstrasz, Oscar
Steffen, Bernhard
Kittler, Josef
Vardi, Moshe Y.
Weikum, Gerhard
Sudan, Madhu
Naor, Moni
Mitchell, John C.
Terzopoulos, Demetri
Pandu Rangan, C.
Kanade, Takeo
Hutchison, David
Tygar, Doug
RelatedPersons_xml – sequence: 1
  givenname: David
  surname: Hutchison
  fullname: Hutchison, David
  organization: Lancaster University, Lancaster, UK
– sequence: 2
  givenname: Takeo
  surname: Kanade
  fullname: Kanade, Takeo
  organization: Carnegie Mellon University, Pittsburgh, USA
– sequence: 3
  givenname: Josef
  surname: Kittler
  fullname: Kittler, Josef
  organization: University of Surrey, Guildford, UK
– sequence: 4
  givenname: Jon M.
  surname: Kleinberg
  fullname: Kleinberg, Jon M.
  organization: Cornell University, Ithaca, USA
– sequence: 5
  givenname: Friedemann
  surname: Mattern
  fullname: Mattern, Friedemann
  organization: ETH Zurich, Zurich, Switzerland
– sequence: 6
  givenname: John C.
  surname: Mitchell
  fullname: Mitchell, John C.
  organization: Stanford University, Stanford, USA
– sequence: 7
  givenname: Moni
  surname: Naor
  fullname: Naor, Moni
  organization: Weizmann Institute of Science, Rehovot, Israel
– sequence: 8
  givenname: Oscar
  surname: Nierstrasz
  fullname: Nierstrasz, Oscar
  organization: University of Bern, Bern, Switzerland
– sequence: 9
  givenname: C.
  surname: Pandu Rangan
  fullname: Pandu Rangan, C.
  organization: Indian Institute of Technology, Madras, India
– sequence: 10
  givenname: Bernhard
  surname: Steffen
  fullname: Steffen, Bernhard
  organization: University of Dortmund, Dortmund, Germany
– sequence: 11
  givenname: Madhu
  surname: Sudan
  fullname: Sudan, Madhu
  organization: Massachusetts Institute of Technology, USA
– sequence: 12
  givenname: Demetri
  surname: Terzopoulos
  fullname: Terzopoulos, Demetri
  organization: University of California, Los Angeles, USA
– sequence: 13
  givenname: Doug
  surname: Tygar
  fullname: Tygar, Doug
  organization: University of California, Berkeley, USA
– sequence: 14
  givenname: Moshe Y.
  surname: Vardi
  fullname: Vardi, Moshe Y.
  organization: Rice University, Houston, USA
– sequence: 15
  givenname: Gerhard
  surname: Weikum
  fullname: Weikum, Gerhard
  organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany
SSID ssj0000879994
ssj0002792
Score 1.5936643
Snippet Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining...
SourceID springer
proquest
SourceType Publisher
StartPage 11
SubjectTerms Distributed Algorithm
Monte Carlo Method
PageRank
Random Walk
Title Fast Distributed PageRank Computation
URI http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=3069028&ppg=29
http://link.springer.com/10.1007/978-3-642-35668-1_2
Volume 7730
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PT8IwFG4EL-pBRY34KzvoRTIz2m4dRyUQQpQYA4Zbs_66mKARvPjX-7q1bCAXvSxkoUv3vqXv6-t730PomtIY3IaIQxUrGYILyMIMkyRMUkIEo1SwXL74aZQMJnQ4jadlvXZeXbIQd_J7Y13Jf1CFe4CrrZL9A7LLh8IN-A34whUQhusa-V0Ns7pclnnRrcrGaPPeDL7ecFSkdnun5D6IfjZftKqDnmEpeclmb250CZELAuTZFNUggA8CtpxGllU836STlddsAIECDlf0xvFrIGPF6civBbWaQwEjQzsUtp0cl_7Dn5m7yMWqenXvoUusGDJOYZPBcaeGaiyldbR93xs-vi6jYVHKgKnaHlzL-eFCHqmc71IzqpAFXpvPyg5h7VA75wrjA7Rn60cCW9gBUzxEW3rWQPu-e0bgFtMG2q1IQR6hGwtPUIEn8PAEFXiO0aTfG3cHoWthEX5YbbvQUNgPGymE0rhNFFZKSRMrIM2RSYArYdJJ2lkkCBNYpEYLGUdKwtu2mdAdRsgJqs_eZ_oUBZpI4KqRwaZtKDZxlmlwDtiIWBpNWaeJWt4APD9od9m9snjdOV_BoYluvY24_fOce_1qsC0nHGzLc9tysO3Znx59jnbKT_QC1RefX_oSmNtCXDnYfwDW-D4f
linkProvider Library Specific Holdings
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Distributed+Computing+and+Networking&rft.atitle=Fast+Distributed+PageRank+Computation&rft.date=2012-01-01&rft.pub=Springer+Berlin+%2F+Heidelberg&rft.isbn=9783642356674&rft.volume=7730&rft_id=info:doi/10.1007%2F978-3-642-35668-1_2&rft.externalDBID=29&rft.externalDocID=EBC3069028_14_29
thumbnail_s http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F3069028-l.jpg