Fast Distributed PageRank Computation
Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining node importance in large graphs (and was a pioneering idea behind Google’s search engine). In distributed computing alone, PageRank vectors, or...
Saved in:
Published in | Distributed Computing and Networking Vol. 7730; pp. 11 - 26 |
---|---|
Main Authors | , , , |
Format | Book Chapter |
Language | English |
Published |
Germany
Springer Berlin / Heidelberg
2012
Springer Berlin Heidelberg |
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
ISBN | 9783642356674 3642356672 |
ISSN | 0302-9743 1611-3349 |
DOI | 10.1007/978-3-642-35668-1_2 |
Cover
Abstract | Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining node importance in large graphs (and was a pioneering idea behind Google’s search engine). In distributed computing alone, PageRank vectors, or more generally random walk based quantities have been used for several different applications ranging from determining important nodes, load balancing, search, and identifying connectivity structures. Surprisingly, however, there has been little work towards designing provably efficient fully-distributed algorithms for computing PageRank. The difficulty is that traditional matrix-vector multiplication style iterative methods may not always adapt well to the distributed setting owing to communication bandwidth restrictions and convergence rates.
In this paper, we present fast random walk-based distributed algorithms for computing PageRank in general graphs and prove strong bounds on the round complexity. We first present an algorithm that takes O(logn/ε) rounds with high probability on any graph (directed or undirected), where n is the network size and ε is the reset probability used in the PageRank computation (typically ε is a fixed constant). We then present a faster algorithm that takes $O(\sqrt{\log n}/{\epsilon})$ rounds in undirected graphs. Both of the above algorithms are scalable, as each node processes and sends only small (polylogarithmic in n, the network size) number of bits per round and hence work in the CONGEST distributed computing model. For directed graphs, we present an algorithm that has a running time of $O(\sqrt{\log n/{\epsilon}})$ , but it requires a polynomial number of bits to processed and sent per node in a round. To the best of our knowledge, these are the first fully distributed algorithms for computing PageRank vectors with provably efficient running time. |
---|---|
AbstractList | Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining node importance in large graphs (and was a pioneering idea behind Google’s search engine). In distributed computing alone, PageRank vectors, or more generally random walk based quantities have been used for several different applications ranging from determining important nodes, load balancing, search, and identifying connectivity structures. Surprisingly, however, there has been little work towards designing provably efficient fully-distributed algorithms for computing PageRank. The difficulty is that traditional matrix-vector multiplication style iterative methods may not always adapt well to the distributed setting owing to communication bandwidth restrictions and convergence rates.
In this paper, we present fast random walk-based distributed algorithms for computing PageRank in general graphs and prove strong bounds on the round complexity. We first present an algorithm that takes O(logn/ε) rounds with high probability on any graph (directed or undirected), where n is the network size and ε is the reset probability used in the PageRank computation (typically ε is a fixed constant). We then present a faster algorithm that takes $O(\sqrt{\log n}/{\epsilon})$ rounds in undirected graphs. Both of the above algorithms are scalable, as each node processes and sends only small (polylogarithmic in n, the network size) number of bits per round and hence work in the CONGEST distributed computing model. For directed graphs, we present an algorithm that has a running time of $O(\sqrt{\log n/{\epsilon}})$ , but it requires a polynomial number of bits to processed and sent per node in a round. To the best of our knowledge, these are the first fully distributed algorithms for computing PageRank vectors with provably efficient running time. |
Author | Molla, Anisur Rahaman Pandurangan, Gopal Upfal, Eli Das Sarma, Atish |
Author_xml | – sequence: 1 givenname: Atish surname: Das Sarma fullname: Das Sarma, Atish email: atish.dassarma@gmail.com organization: eBay Research Labs, eBay Inc., USA – sequence: 2 givenname: Anisur Rahaman surname: Molla fullname: Molla, Anisur Rahaman email: anisurpm@gmail.com organization: Division of Mathematical Sciences, Nanyang Technological University, Singapore – sequence: 3 givenname: Gopal surname: Pandurangan fullname: Pandurangan, Gopal organization: Department of Computer Science, Brown University, Providence, USA – sequence: 4 givenname: Eli surname: Upfal fullname: Upfal, Eli email: eli@cs.brown.edu organization: Department of Computer Science, Brown University, Providence, USA |
BookMark | eNpVkMFOwzAQRA0URCj9Ai69cDR4vU4cH1GhgFQJhOBs2bFTQksSYuf_cVsu7GWlWb3VzFyQSdu1npArYDfAmLxVsqRIC8Ep5kVRUtD8iMySiknbS3BMMigAKKJQJ_9uUkxIxpBxqqTAM5KVXBSQc2DnZBbCF0tTSqWUyMj10oQ4v29CHBo7Ru_mr2bt30y7mS-6736MJjZde0lOa7MNfva3p-Rj-fC-eKKrl8fnxd2K9pDnnNYCFNSVtc5zQMedc1WdO4Wc1clxyVEVYJhFabkta2-rnLkqGQZpvZKIUwKHv6EfmnbtB227bhM0ML2rRaeQGnWKqfcd6FRLYviB6YfuZ_Qhar-DKt_GwWyrT9NHPwSNrFCMJ0RorvAX2nFgpA |
ContentType | Book Chapter |
Copyright | Springer-Verlag Berlin Heidelberg 2013 |
Copyright_xml | – notice: Springer-Verlag Berlin Heidelberg 2013 |
DBID | FFUUA |
DEWEY | 004.36 |
DOI | 10.1007/978-3-642-35668-1_2 |
DatabaseName | ProQuest Ebook Central - Book Chapters - Demo use only |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Engineering Computer Science |
EISBN | 9783642356681 3642356680 |
EISSN | 1611-3349 |
Editor | Sinha, Prasun Shyamasundar, Rudrapatna K Frey, Davide Sarkar, Saswati Raynal, Michel |
Editor_xml | – sequence: 1 fullname: Sinha, Prasun – sequence: 2 fullname: Frey, Davide – sequence: 3 fullname: Sarkar, Saswati – sequence: 4 fullname: Shyamasundar, Rudrapatna K – sequence: 5 fullname: Raynal, Michel |
EndPage | 26 |
ExternalDocumentID | EBC3069028_14_29 |
GroupedDBID | -EI -JY -K2 089 0D6 0DA 2HV 38. A4J AABBV AARVG AAUBL AAWHR ABBVZ ABFTD ABMLC ABMNI ACPRQ ADHDZ ADNMO AEDXK AEJLV AEKFX AETDV AEZAY AFJMS AFPTF ALMA_UNASSIGNED_HOLDINGS ARZOH AZZ BBABE CZZ FFUUA I4C IEZ IX0 JJU MA. N2R SBO TPJZQ TSXQS Z5O Z7R Z7S Z7U Z7V Z7W Z7X Z7Y Z7Z Z81 Z83 Z84 Z85 Z87 Z88 -DT -GH -~X 1SB 29L 2HA 5QI 875 AASHB ACGFS ADCXD AEFIE EJD F5P FEDTE HVGLF LAS LDH P2P RIG RNI RSU SVGTG VI1 ~02 |
ID | FETCH-LOGICAL-p1552-f4191fcbbde213d2dddcf5d9320f681823961a0b37b2b8febc50dc66717be9733 |
ISBN | 9783642356674 3642356672 |
ISSN | 0302-9743 |
IngestDate | Tue Jul 29 19:51:30 EDT 2025 Thu May 29 00:16:09 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | false |
LCCallNum | TK5105.5-5105.9 |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-p1552-f4191fcbbde213d2dddcf5d9320f681823961a0b37b2b8febc50dc66717be9733 |
Notes | Original Abstract: Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining node importance in large graphs (and was a pioneering idea behind Google’s search engine). In distributed computing alone, PageRank vectors, or more generally random walk based quantities have been used for several different applications ranging from determining important nodes, load balancing, search, and identifying connectivity structures. Surprisingly, however, there has been little work towards designing provably efficient fully-distributed algorithms for computing PageRank. The difficulty is that traditional matrix-vector multiplication style iterative methods may not always adapt well to the distributed setting owing to communication bandwidth restrictions and convergence rates. In this paper, we present fast random walk-based distributed algorithms for computing PageRank in general graphs and prove strong bounds on the round complexity. We first present an algorithm that takes O(logn/ε) rounds with high probability on any graph (directed or undirected), where n is the network size and ε is the reset probability used in the PageRank computation (typically ε is a fixed constant). We then present a faster algorithm that takes \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$O(\sqrt{\log n}/{\epsilon})$\end{document} rounds in undirected graphs. Both of the above algorithms are scalable, as each node processes and sends only small (polylogarithmic in n, the network size) number of bits per round and hence work in the CONGEST distributed computing model. For directed graphs, we present an algorithm that has a running time of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$O(\sqrt{\log n/{\epsilon}})$\end{document}, but it requires a polynomial number of bits to processed and sent per node in a round. To the best of our knowledge, these are the first fully distributed algorithms for computing PageRank vectors with provably efficient running time. |
OCLC | 824615210 |
PQID | EBC3069028_14_29 |
PageCount | 16 |
ParticipantIDs | springer_books_10_1007_978_3_642_35668_1_2 proquest_ebookcentralchapters_3069028_14_29 |
PublicationCentury | 2000 |
PublicationDate | 2012 2013 |
PublicationDateYYYYMMDD | 2012-01-01 2013-01-01 |
PublicationDate_xml | – year: 2012 text: 2012 |
PublicationDecade | 2010 |
PublicationPlace | Germany |
PublicationPlace_xml | – name: Germany – name: Berlin, Heidelberg |
PublicationSeriesTitle | Lecture Notes in Computer Science |
PublicationSubtitle | 14th International Conference, ICDCN 2013, Mumbai, India, January 3-6, 2013. Proceedings |
PublicationTitle | Distributed Computing and Networking |
PublicationYear | 2012 2013 |
Publisher | Springer Berlin / Heidelberg Springer Berlin Heidelberg |
Publisher_xml | – name: Springer Berlin / Heidelberg – name: Springer Berlin Heidelberg |
RelatedPersons | Kleinberg, Jon M. Mattern, Friedemann Nierstrasz, Oscar Steffen, Bernhard Kittler, Josef Vardi, Moshe Y. Weikum, Gerhard Sudan, Madhu Naor, Moni Mitchell, John C. Terzopoulos, Demetri Pandu Rangan, C. Kanade, Takeo Hutchison, David Tygar, Doug |
RelatedPersons_xml | – sequence: 1 givenname: David surname: Hutchison fullname: Hutchison, David organization: Lancaster University, Lancaster, UK – sequence: 2 givenname: Takeo surname: Kanade fullname: Kanade, Takeo organization: Carnegie Mellon University, Pittsburgh, USA – sequence: 3 givenname: Josef surname: Kittler fullname: Kittler, Josef organization: University of Surrey, Guildford, UK – sequence: 4 givenname: Jon M. surname: Kleinberg fullname: Kleinberg, Jon M. organization: Cornell University, Ithaca, USA – sequence: 5 givenname: Friedemann surname: Mattern fullname: Mattern, Friedemann organization: ETH Zurich, Zurich, Switzerland – sequence: 6 givenname: John C. surname: Mitchell fullname: Mitchell, John C. organization: Stanford University, Stanford, USA – sequence: 7 givenname: Moni surname: Naor fullname: Naor, Moni organization: Weizmann Institute of Science, Rehovot, Israel – sequence: 8 givenname: Oscar surname: Nierstrasz fullname: Nierstrasz, Oscar organization: University of Bern, Bern, Switzerland – sequence: 9 givenname: C. surname: Pandu Rangan fullname: Pandu Rangan, C. organization: Indian Institute of Technology, Madras, India – sequence: 10 givenname: Bernhard surname: Steffen fullname: Steffen, Bernhard organization: University of Dortmund, Dortmund, Germany – sequence: 11 givenname: Madhu surname: Sudan fullname: Sudan, Madhu organization: Massachusetts Institute of Technology, USA – sequence: 12 givenname: Demetri surname: Terzopoulos fullname: Terzopoulos, Demetri organization: University of California, Los Angeles, USA – sequence: 13 givenname: Doug surname: Tygar fullname: Tygar, Doug organization: University of California, Berkeley, USA – sequence: 14 givenname: Moshe Y. surname: Vardi fullname: Vardi, Moshe Y. organization: Rice University, Houston, USA – sequence: 15 givenname: Gerhard surname: Weikum fullname: Weikum, Gerhard organization: Max-Planck Institute of Computer Science, Saarbrücken, Germany |
SSID | ssj0000879994 ssj0002792 |
Score | 1.5936643 |
Snippet | Over the last decade, PageRank has gained importance in a wide range of applications and domains, ever since it first proved to be effective in determining... |
SourceID | springer proquest |
SourceType | Publisher |
StartPage | 11 |
SubjectTerms | Distributed Algorithm Monte Carlo Method PageRank Random Walk |
Title | Fast Distributed PageRank Computation |
URI | http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=3069028&ppg=29 http://link.springer.com/10.1007/978-3-642-35668-1_2 |
Volume | 7730 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3PT8IwFG4EL-pBRY34KzvoRTIz2m4dRyUQQpQYA4Zbs_66mKARvPjX-7q1bCAXvSxkoUv3vqXv6-t730PomtIY3IaIQxUrGYILyMIMkyRMUkIEo1SwXL74aZQMJnQ4jadlvXZeXbIQd_J7Y13Jf1CFe4CrrZL9A7LLh8IN-A34whUQhusa-V0Ns7pclnnRrcrGaPPeDL7ecFSkdnun5D6IfjZftKqDnmEpeclmb250CZELAuTZFNUggA8CtpxGllU836STlddsAIECDlf0xvFrIGPF6civBbWaQwEjQzsUtp0cl_7Dn5m7yMWqenXvoUusGDJOYZPBcaeGaiyldbR93xs-vi6jYVHKgKnaHlzL-eFCHqmc71IzqpAFXpvPyg5h7VA75wrjA7Rn60cCW9gBUzxEW3rWQPu-e0bgFtMG2q1IQR6hGwtPUIEn8PAEFXiO0aTfG3cHoWthEX5YbbvQUNgPGymE0rhNFFZKSRMrIM2RSYArYdJJ2lkkCBNYpEYLGUdKwtu2mdAdRsgJqs_eZ_oUBZpI4KqRwaZtKDZxlmlwDtiIWBpNWaeJWt4APD9od9m9snjdOV_BoYluvY24_fOce_1qsC0nHGzLc9tysO3Znx59jnbKT_QC1RefX_oSmNtCXDnYfwDW-D4f |
linkProvider | Library Specific Holdings |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Distributed+Computing+and+Networking&rft.atitle=Fast+Distributed+PageRank+Computation&rft.date=2012-01-01&rft.pub=Springer+Berlin+%2F+Heidelberg&rft.isbn=9783642356674&rft.volume=7730&rft_id=info:doi/10.1007%2F978-3-642-35668-1_2&rft.externalDBID=29&rft.externalDocID=EBC3069028_14_29 |
thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F3069028-l.jpg |