Treecode and fast multipole method for N-body simulation with CUDA

GPU Computing Gems Emerald Edition, (Morgan Kaufmann/Elsevier, 2011) pp. 113-132 Due to the variety and importance of applications of treecodes and FMM, the combination of algorithmic acceleration with hardware acceleration can have tremendous impact. Alas, programming these algorithms efficiently i...

Full description

Saved in:

Bibliographic Details
Main Authors	Yokota, Rio, Barba, Lorena
Format	Journal Article
Language	English
Published	07.10.2010
Subjects	Physics - Computational Physics
Online Access	Get full text
DOI	10.48550/arxiv.1010.1482

Cover

Abstract	GPU Computing Gems Emerald Edition, (Morgan Kaufmann/Elsevier, 2011) pp. 113-132 Due to the variety and importance of applications of treecodes and FMM, the combination of algorithmic acceleration with hardware acceleration can have tremendous impact. Alas, programming these algorithms efficiently is no piece of cake. In this contribution, we aim to present GPU kernels for treecode and FMM in, as much as possible, an uncomplicated, accessible way. The interested reader should consult some of the copious literature on the subject for a deeper understanding of the algorithms themselves. Here, we will offer the briefest of summaries. We will focus our attention on achieving a GPU implementation that is efficient in its utilization of the architecture, but without applying the most advanced techniques known in the field (which would complicate the presentation).
AbstractList	GPU Computing Gems Emerald Edition, (Morgan Kaufmann/Elsevier, 2011) pp. 113-132 Due to the variety and importance of applications of treecodes and FMM, the combination of algorithmic acceleration with hardware acceleration can have tremendous impact. Alas, programming these algorithms efficiently is no piece of cake. In this contribution, we aim to present GPU kernels for treecode and FMM in, as much as possible, an uncomplicated, accessible way. The interested reader should consult some of the copious literature on the subject for a deeper understanding of the algorithms themselves. Here, we will offer the briefest of summaries. We will focus our attention on achieving a GPU implementation that is efficient in its utilization of the architecture, but without applying the most advanced techniques known in the field (which would complicate the presentation).
Author	Yokota, Rio Barba, Lorena
Author_xml	– sequence: 1 givenname: Rio surname: Yokota fullname: Yokota, Rio – sequence: 2 givenname: Lorena surname: Barba fullname: Barba, Lorena
BackLink	https://doi.org/10.48550/arXiv.1010.1482$$DView paper in arXiv
BookMark	eNrjYmDJy89LZWCQMDTQM7EwNTXQTyyqyCzTMzQAChiaWBhxMjiFFKWmJuenpCok5qUopCUWlyjkluaUZBbk56Qq5KaWZOQDRfOLFPx0k_JTKhWKM4GyiSWZ-XkK5ZklGQrOoS6OPAysaYk5xam8UJqbQc7NNcTZQxdsWXxBUWZuYlFlPMjSeJClxgQVAADi2Tfq
ContentType	Journal Article
Copyright	http://arxiv.org/licenses/nonexclusive-distrib/1.0
Copyright_xml	– notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0
DBID	GOX
DOI	10.48550/arxiv.1010.1482
DatabaseName	arXiv.org
DatabaseTitleList
Database_xml	– sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository
DeliveryMethod	fulltext_linktorsrc
ExternalDocumentID	1010_1482
GroupedDBID	GOX
ID	FETCH-arxiv_primary_1010_14823
IEDL.DBID	GOX
IngestDate	Tue Jul 22 23:10:40 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-arxiv_primary_1010_14823
OpenAccessLink	https://arxiv.org/abs/1010.1482
ParticipantIDs	arxiv_primary_1010_1482
PublicationCentury	2000
PublicationDate	2010-10-07
PublicationDateYYYYMMDD	2010-10-07
PublicationDate_xml	– month: 10 year: 2010 text: 2010-10-07 day: 07
PublicationDecade	2010
PublicationYear	2010
Score	2.892621
SecondaryResourceType	preprint
Snippet	GPU Computing Gems Emerald Edition, (Morgan Kaufmann/Elsevier, 2011) pp. 113-132 Due to the variety and importance of applications of treecodes and FMM, the...
SourceID	arxiv
SourceType	Open Access Repository
SubjectTerms	Physics - Computational Physics
Title	Treecode and fast multipole method for N-body simulation with CUDA
URI	https://arxiv.org/abs/1010.1482
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1LSwMxEB5qT15E8f2oc_AacLPZ7O6xVmsRrJcW9iZJdwIFtbJdRf-9k2QVL72FZEhCwmS-ybwArq7rWhLpRFiXFULpmluZccKZnOGI0QlZH5z8ONWTuXqosqoHl7-xMKb5Wn7G_MDW1xzxTK0KfmO3pPS61f1TFY2NIRNXR_5Hxggz9PwTEeNd2OmwHQ7jZexBj9724WbWEPnAcWSdHZ1Ztxid-FYvhLF-MzJwxKmwq_ob18vXrqAW-i9SHM1vhwcwGN_NRhMRVn1-jwkivNsUQ2veT3oIfdbi6RgwVSUxL1ldpE4tSmlLJ_Ncs7YkF6Sz5ASONkxyunHkDLaDMdvbs_Nz6LfNB12wjGztIJzUD-x3a1Q
linkProvider	Cornell University
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Treecode+and+fast+multipole+method+for+N-body+simulation+with+CUDA&rft.au=Yokota%2C+Rio&rft.au=Barba%2C+Lorena&rft.date=2010-10-07&rft_id=info:doi/10.48550%2Farxiv.1010.1482&rft.externalDocID=1010_1482