Treecode and fast multipole method for N-body simulation with CUDA
GPU Computing Gems Emerald Edition, (Morgan Kaufmann/Elsevier, 2011) pp. 113-132 Due to the variety and importance of applications of treecodes and FMM, the combination of algorithmic acceleration with hardware acceleration can have tremendous impact. Alas, programming these algorithms efficiently i...
Saved in:
Main Authors | , |
---|---|
Format | Journal Article |
Language | English |
Published |
07.10.2010
|
Subjects | |
Online Access | Get full text |
DOI | 10.48550/arxiv.1010.1482 |
Cover
Summary: | GPU Computing Gems Emerald Edition, (Morgan Kaufmann/Elsevier,
2011) pp. 113-132 Due to the variety and importance of applications of treecodes and FMM, the
combination of algorithmic acceleration with hardware acceleration can have
tremendous impact. Alas, programming these algorithms efficiently is no piece
of cake. In this contribution, we aim to present GPU kernels for treecode and
FMM in, as much as possible, an uncomplicated, accessible way. The interested
reader should consult some of the copious literature on the subject for a
deeper understanding of the algorithms themselves. Here, we will offer the
briefest of summaries. We will focus our attention on achieving a GPU
implementation that is efficient in its utilization of the architecture, but
without applying the most advanced techniques known in the field (which would
complicate the presentation). |
---|---|
DOI: | 10.48550/arxiv.1010.1482 |