Toward a Holistic Performance Evaluation of Large Language Models Across Diverse AI Accelerators
Emani, Murali, Foreman, Sam, Sastry, Varuni, Xie, Zhen, Raskar, Siddhisanket, Arnold, William, Thakur, Rajeev, Vishwanath, Venkatram, Papka, Michael E., Shanmugavelu, Sanjif, Gandhi, Darshan, Zhao, Hengyu, Ma, Dun, Ranganath, Kiran, Weisner, Rick, Chen, Jiunn-yeu, Yang, Yuting, Vassilieva, Natalia, Zhang, Bin C., Howland, Sylvia, Tsyplikhin, Alexander
Published in 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (27.05.2024)
Published in 2024 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) (27.05.2024)
Get full text
Conference Proceeding