Treecode and fast multipole method for N-body simulation with CUDA

GPU Computing Gems Emerald Edition, (Morgan Kaufmann/Elsevier, 2011) pp. 113-132 Due to the variety and importance of applications of treecodes and FMM, the combination of algorithmic acceleration with hardware acceleration can have tremendous impact. Alas, programming these algorithms efficiently i...

Full description

Saved in:
Bibliographic Details
Main Authors Yokota, Rio, Barba, Lorena
Format Journal Article
LanguageEnglish
Published 07.10.2010
Subjects
Online AccessGet full text
DOI10.48550/arxiv.1010.1482

Cover

Abstract GPU Computing Gems Emerald Edition, (Morgan Kaufmann/Elsevier, 2011) pp. 113-132 Due to the variety and importance of applications of treecodes and FMM, the combination of algorithmic acceleration with hardware acceleration can have tremendous impact. Alas, programming these algorithms efficiently is no piece of cake. In this contribution, we aim to present GPU kernels for treecode and FMM in, as much as possible, an uncomplicated, accessible way. The interested reader should consult some of the copious literature on the subject for a deeper understanding of the algorithms themselves. Here, we will offer the briefest of summaries. We will focus our attention on achieving a GPU implementation that is efficient in its utilization of the architecture, but without applying the most advanced techniques known in the field (which would complicate the presentation).
AbstractList GPU Computing Gems Emerald Edition, (Morgan Kaufmann/Elsevier, 2011) pp. 113-132 Due to the variety and importance of applications of treecodes and FMM, the combination of algorithmic acceleration with hardware acceleration can have tremendous impact. Alas, programming these algorithms efficiently is no piece of cake. In this contribution, we aim to present GPU kernels for treecode and FMM in, as much as possible, an uncomplicated, accessible way. The interested reader should consult some of the copious literature on the subject for a deeper understanding of the algorithms themselves. Here, we will offer the briefest of summaries. We will focus our attention on achieving a GPU implementation that is efficient in its utilization of the architecture, but without applying the most advanced techniques known in the field (which would complicate the presentation).
Author Yokota, Rio
Barba, Lorena
Author_xml – sequence: 1
  givenname: Rio
  surname: Yokota
  fullname: Yokota, Rio
– sequence: 2
  givenname: Lorena
  surname: Barba
  fullname: Barba, Lorena
BackLink https://doi.org/10.48550/arXiv.1010.1482$$DView paper in arXiv
BookMark eNrjYmDJy89LZWCQMDTQM7EwNTXQTyyqyCzTMzQAChiaWBhxMjiFFKWmJuenpCok5qUopCUWlyjkluaUZBbk56Qq5KaWZOQDRfOLFPx0k_JTKhWKM4GyiSWZ-XkK5ZklGQrOoS6OPAysaYk5xam8UJqbQc7NNcTZQxdsWXxBUWZuYlFlPMjSeJClxgQVAADi2Tfq
ContentType Journal Article
Copyright http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID GOX
DOI 10.48550/arxiv.1010.1482
DatabaseName arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 1010_1482
GroupedDBID GOX
ID FETCH-arxiv_primary_1010_14823
IEDL.DBID GOX
IngestDate Tue Jul 22 23:10:40 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-arxiv_primary_1010_14823
OpenAccessLink https://arxiv.org/abs/1010.1482
ParticipantIDs arxiv_primary_1010_1482
PublicationCentury 2000
PublicationDate 2010-10-07
PublicationDateYYYYMMDD 2010-10-07
PublicationDate_xml – month: 10
  year: 2010
  text: 2010-10-07
  day: 07
PublicationDecade 2010
PublicationYear 2010
Score 2.892621
SecondaryResourceType preprint
Snippet GPU Computing Gems Emerald Edition, (Morgan Kaufmann/Elsevier, 2011) pp. 113-132 Due to the variety and importance of applications of treecodes and FMM, the...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Physics - Computational Physics
Title Treecode and fast multipole method for N-body simulation with CUDA
URI https://arxiv.org/abs/1010.1482
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1LSwMxEB5qT15E8f2oc_AacLPZ7O6xVmsRrJcW9iZJdwIFtbJdRf-9k2QVL72FZEhCwmS-ybwArq7rWhLpRFiXFULpmluZccKZnOGI0QlZH5z8ONWTuXqosqoHl7-xMKb5Wn7G_MDW1xzxTK0KfmO3pPS61f1TFY2NIRNXR_5Hxggz9PwTEeNd2OmwHQ7jZexBj9724WbWEPnAcWSdHZ1Ztxid-FYvhLF-MzJwxKmwq_ob18vXrqAW-i9SHM1vhwcwGN_NRhMRVn1-jwkivNsUQ2veT3oIfdbi6RgwVSUxL1ldpE4tSmlLJ_Ncs7YkF6Sz5ASONkxyunHkDLaDMdvbs_Nz6LfNB12wjGztIJzUD-x3a1Q
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Treecode+and+fast+multipole+method+for+N-body+simulation+with+CUDA&rft.au=Yokota%2C+Rio&rft.au=Barba%2C+Lorena&rft.date=2010-10-07&rft_id=info:doi/10.48550%2Farxiv.1010.1482&rft.externalDocID=1010_1482