Cross-Feature Transfer Learning for Efficient Tensor Program Generation
Verma, Gaurav, Raskar, Siddhisanket, Emani, Murali, Chapman, Barbara
Published in Applied sciences (01.01.2024)
Published in Applied sciences (01.01.2024)
Get full text
Journal Article
Throughput-oriented and Accuracy-aware DNN Training with BFloat16 on GPU
Xie, Zhen, Raskar, Siddhisanket, Emani, Murali
Published in 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (01.05.2022)
Published in 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (01.05.2022)
Get full text
Conference Proceeding
Transfer Learning Across Heterogeneous Features For Efficient Tensor Program Generation
Verma, Gaurav, Raskar, Siddhisanket, Xie, Zhen, Malik, Abid M, Emani, Murali, Chapman, Barbara
Published in arXiv.org (26.12.2023)
Published in arXiv.org (26.12.2023)
Get full text
Paper
Journal Article
A Comprehensive Performance Study of Large Language Models on Novel AI Accelerators
Emani, Murali, Foreman, Sam, Sastry, Varuni, Xie, Zhen, Raskar, Siddhisanket, Arnold, William, Thakur, Rajeev, Vishwanath, Venkatram, Papka, Michael E
Year of Publication 06.10.2023
Year of Publication 06.10.2023
Get full text
Journal Article
Position Paper: Extending Codelet Model for Dataflow Software Pipelining using Software-Hardware Co-Design
Raskar, Siddhisanket, Applencourt, Thomas, Kumaran, Kalyan, Gao, Guang
Published in 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC) (01.07.2019)
Published in 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC) (01.07.2019)
Get full text
Conference Proceeding
A Comprehensive Evaluation of Novel AI Accelerators for Deep Learning Workloads
Emani, Murali, Xie, Zhen, Raskar, Siddhisanket, Sastry, Varuni, Arnold, William, Wilson, Bruce, Thakur, Rajeev, Vishwanath, Venkatram, Liu, Zhengchun, Papka, Michael E., Bohorquez, Cindy Orozco, Weisner, Rick, Li, Karen, Sheng, Yongning, Du, Yun, Zhang, Jian, Tsyplikhin, Alexander, Khaira, Gurdaman, Fowers, Jeremy, Sivakumar, Ramakrishnan, Godsoe, Victoria, Macias, Adrian, Tekur, Chetan, Boyd, Matthew
Published in 2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS) (01.11.2022)
Published in 2022 IEEE/ACM International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS) (01.11.2022)
Get full text
Conference Proceeding
Toward A High-Performance Emulation Platformfor Brain-Inspired Intelligent SystemsExploring Dataflow-Based Execution Model and Beyond
Zeng, Sihan, Monsalve Diaz, Jose M, Raskar, Siddhisanket
Published in 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC) (01.07.2019)
Published in 2019 IEEE 43rd Annual Computer Software and Applications Conference (COMPSAC) (01.07.2019)
Get full text
Conference Proceeding
Toward a Holistic Performance Evaluation of Large Language Models Across Diverse AI Accelerators
Emani, Murali, Foreman, Sam, Sastry, Varuni, Xie, Zhen, Raskar, Siddhisanket, Arnold, William, Thakur, Rajeev, Vishwanath, Venkatram, Papka, Michael E., Shanmugavelu, Sanjif, Gandhi, Darshan, Zhao, Hengyu, Ma, Dun, Ranganath, Kiran, Weisner, Rick, Chen, Jiunn-yeu, Yang, Yuting, Vassilieva, Natalia, Zhang, Bin C., Howland, Sylvia, Tsyplikhin, Alexander
Published in 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (27.05.2024)
Published in 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (27.05.2024)
Get full text
Conference Proceeding
CODIR: Towards an MLIR Codelet Model Dialect
Kabrick, Ryan, Perdomo, Diego A. Roa, Raskar, Siddhisanket, Diaz, Jose M. Monsalve, Fox, Dawson, Gao, Guang R.
Published in 2020 IEEE/ACM Fourth Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware (IPDRM) (01.11.2020)
Published in 2020 IEEE/ACM Fourth Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware (IPDRM) (01.11.2020)
Get full text
Conference Proceeding
DEMAC: A Modular Platform for HW-SW Co-Design
Perdomo, Diego A. Roa, Kabrick, Ryan, Diaz, Jose M. Monsalve, Raskar, Siddhisanket, Fox, Dawson, Gao, Guang R.
Published in 2020 IEEE/ACM Fourth Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware (IPDRM) (01.11.2020)
Published in 2020 IEEE/ACM Fourth Annual Workshop on Emerging Parallel and Distributed Runtime Systems and Middleware (IPDRM) (01.11.2020)
Get full text
Conference Proceeding