Program optimization carving for GPU computing
Ryoo, Shane, Rodrigues, Christopher I., Stone, Sam S., Stratton, John A., Ueng, Sain-Zee, Baghsorkhi, Sara S., Hwu, Wen-mei W.
Published in Journal of parallel and distributed computing (01.10.2008)
Published in Journal of parallel and distributed computing (01.10.2008)
Get full text
Journal Article
SAVE: Sparsity-Aware Vector Engine for Accelerating DNN Training and Inference on CPUs
Gong, Zhangxiaowen, Ji, Houxiang, Fletcher, Christopher W., Hughes, Christopher J., Baghsorkhi, Sara, Torrellas, Josep
Published in 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) (01.10.2020)
Published in 2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) (01.10.2020)
Get full text
Conference Proceeding
Implicitly parallel programming models for thousand-core microprocessors
Hwu, Wen-mei, Ryoo, Shane, Ueng, Sain-Zee, Kelm, John H., Gelado, Isaac, Stone, Sam S., Kidd, Robert E., Baghsorkhi, Sara S., Mahesri, Aqeel A., Tsao, Stephanie C., Navarro, Nacho, Lumetta, Steve S., Frank, Matthew I., Patel, Sanjay J.
Published in 2007 44th ACM/IEEE Design Automation Conference (04.06.2007)
Published in 2007 44th ACM/IEEE Design Automation Conference (04.06.2007)
Get full text
Conference Proceeding
INSTRUCTION TO REDUCE ELEMENTS IN A VECTOR REGISTER WITH STRIDED ACCESS PATTERN
VASUDEVAN NALINI, BAGHSORKHI SARA S, LEE VICTOR W, KIM, DAE HYUN, BHARADWAJ JAYASHANKAR, HARTONO ALBERT
Year of Publication 01.07.2015
Get full text
Year of Publication 01.07.2015
Patent
C3-FIow: Compute Compression Co-Design FIow for Deep Neural Networks
Sotoudeh, Matthew, Baghsorkhi, Sara S.
Published in 2019 56th ACM/IEEE Design Automation Conference (DAC) (01.06.2019)
Get full text
Published in 2019 56th ACM/IEEE Design Automation Conference (DAC) (01.06.2019)
Conference Proceeding