TSM2X: High-performance tall-and-skinny matrix–matrix multiplication on GPUs

Linear algebra operations have been widely used in big data analytics and scientific computations. Many works have been done on optimizing linear algebra operations on GPUs with regular-shaped input. However, few works focus on fully utilizing GPU resources when the input is not regular-shaped. Curr...

Full description

Saved in:
Bibliographic Details
Published inJournal of parallel and distributed computing Vol. 151; pp. 70 - 85
Main Authors Rivera, Cody, Chen, Jieyang, Xiong, Nan, Zhang, Jing, Song, Shuaiwen Leon, Tao, Dingwen
Format Journal Article
LanguageEnglish
Published Elsevier Inc 01.05.2021
Subjects
Online AccessGet full text

Cover

Loading…