Program optimization carving for GPU computing
Ryoo, Shane, Rodrigues, Christopher I., Stone, Sam S., Stratton, John A., Ueng, Sain-Zee, Baghsorkhi, Sara S., Hwu, Wen-mei W.
Published in Journal of parallel and distributed computing (01.10.2008)
Published in Journal of parallel and distributed computing (01.10.2008)
Get full text
Journal Article
INSTRUCTION TO REDUCE ELEMENTS IN A VECTOR REGISTER WITH STRIDED ACCESS PATTERN
VASUDEVAN NALINI, BAGHSORKHI SARA S, LEE VICTOR W, KIM, DAE HYUN, BHARADWAJ JAYASHANKAR, HARTONO ALBERT
Year of Publication 01.07.2015
Get full text
Year of Publication 01.07.2015
Patent
Implicitly parallel programming models for thousand-core microprocessors
Hwu, Wen-mei, Ryoo, Shane, Ueng, Sain-Zee, Kelm, John H., Gelado, Isaac, Stone, Sam S., Kidd, Robert E., Baghsorkhi, Sara S., Mahesri, Aqeel A., Tsao, Stephanie C., Navarro, Nacho, Lumetta, Steve S., Frank, Matthew I., Patel, Sanjay J.
Published in 2007 44th ACM/IEEE Design Automation Conference (04.06.2007)
Published in 2007 44th ACM/IEEE Design Automation Conference (04.06.2007)
Get full text
Conference Proceeding
C3-FIow: Compute Compression Co-Design FIow for Deep Neural Networks
Sotoudeh, Matthew, Baghsorkhi, Sara S.
Published in 2019 56th ACM/IEEE Design Automation Conference (DAC) (01.06.2019)
Get full text
Published in 2019 56th ACM/IEEE Design Automation Conference (DAC) (01.06.2019)
Conference Proceeding
Program optimization carving for GPU computing: General-Purpose Processing using Graphics Processing Units
RYOO, Shane, RODRIGUES, Christopher I, STONE, Sam S, STRATTON, John A, UENG, Sain-Zee, BAGHSORKHI, Sara S, HWU, Wen-Mei W
Published in Journal of parallel and distributed computing (2008)
Get full text
Published in Journal of parallel and distributed computing (2008)
Journal Article
Specialized fixed function hardware for efficient convolution
Chen, Xiaoming, Ould-Ahmed-Vall, Elmoustapha, Nealis, Kevin, Srivastava, Dhawal, Yao, Anbang, Vembu, Balaji, Nurvitadhi, Eriko, Barik, Rajkishore, Baghsorkhi, Sara S, Tang, Ping T, Shpeisman, Tatiana
Year of Publication 30.07.2024
Get full text
Year of Publication 30.07.2024
Patent
EFFICIENT SHARING AND COMPRESSION OF DATA ACROSS PROCESSING SYSTEMS
HURD, Linda L, MACPHERSON, Mike B, SAKTHIVEL, Chandrasekaran, WEAST, John C, BAGHSORKHI, Sara S, KOKER, Altug, APPU, Abhishek R, GOTTSCHLICH, Justin E, SURTI, Prasoonkumar, KIM, Dukhwan, RAY, Joydeep
Year of Publication 03.07.2024
Get full text
Year of Publication 03.07.2024
Patent
COMPUTE OPTIMIZATIONS FOR NEURAL NETWORKS
NURVITADHI, ERIKO, OULD-AHMED-VALL, ELMOUSTAPHA, VEMBU, BALAJI, GALOPPO VON BORRIES, NICOLAS C, LIN, TSUNG-HAN, YAO, ANBANG, CHEN, XIAOMING, NEALIS, KEVIN, BAGHSORKHI, SARA S, BARIK, RAJKISHORE, SINHA, KAMAL
Year of Publication 29.07.2024
Get full text
Year of Publication 29.07.2024
Patent