TSM2X: High-performance tall-and-skinny matrix–matrix multiplication on GPUs
Linear algebra operations have been widely used in big data analytics and scientific computations. Many works have been done on optimizing linear algebra operations on GPUs with regular-shaped input. However, few works focus on fully utilizing GPU resources when the input is not regular-shaped. Curr...
Saved in:
Published in | Journal of parallel and distributed computing Vol. 151; pp. 70 - 85 |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
Elsevier Inc
01.05.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Be the first to leave a comment!