Automated systolic array architecture synthesis for high throughput CNN inference on FPGAs
Xuechao Wei, Yu, Cody Hao, Peng Zhang, Youxiang Chen, Yuxin Wang, Han Hu, Yun Liang, Cong, Jason
Published in 2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) (18.06.2017)
Published in 2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) (18.06.2017)
Get full text
Conference Proceeding
Customizable Computing-From Single Chip to Datacenters
Cong, Jason, Fang, Zhenman, Huang, Muhuan, Wei, Peng, Wu, Di, Yu, Cody Hao
Published in Proceedings of the IEEE (01.01.2019)
Published in Proceedings of the IEEE (01.01.2019)
Get full text
Journal Article
Thermal-Aware On-Line Scheduler for 3-D Many-Core Processor Throughput Optimization
Yu, Cody Hao, Chiao-Ling Lung, Yi-Lun Ho, Ruei-Siang Hsu, Ding-Ming Kwai, Shih-Chieh Chang
Published in IEEE transactions on computer-aided design of integrated circuits and systems (01.05.2014)
Published in IEEE transactions on computer-aided design of integrated circuits and systems (01.05.2014)
Get full text
Journal Article
Hidet: Task-Mapping Programming Paradigm for Deep Learning Tensor Programs
Ding, Yaoyao, Cody Hao Yu, Zheng, Bojian, Liu, Yizhi, Wang, Yida, Pekhimenko, Gennady
Published in arXiv.org (15.02.2023)
Published in arXiv.org (15.02.2023)
Get full text
Paper
Journal Article
Programming and Runtime Support to Blaze FPGA Accelerator Deployment at Datacenter Scale
Huang, Muhuan, Wu, Di, Yu, Cody Hao, Fang, Zhenman, Interlandi, Matteo, Condie, Tyson, Cong, Jason
Published in Proceedings of the ... ACM Symposium on Cloud Computing [electronic resource] : SOCC ... ... SoCC (Conference) (05.10.2016)
Published in Proceedings of the ... ACM Symposium on Cloud Computing [electronic resource] : SOCC ... ... SoCC (Conference) (05.10.2016)
Get more information
Journal Article
TGPA: Tile-Grained Pipeline Architecture for Low Latency CNN Inference
Wei, Xuechao, Liang, Yun, Li, Xiuhong, Yu, Cody Hao, Zhang, Peng, Cong, Jason
Published in 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) (01.11.2018)
Published in 2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) (01.11.2018)
Get full text
Conference Proceeding
Analysis and Optimization of the Implicit Broadcasts in FPGA HLS to Improve Maximum Frequency
Guo, Licheng, Lau, Jason, Chi, Yuze, Wang, Jie, Yu, Cody Hao, Chen, Zhe, Zhang, Zhiru, Cong, Jason
Published in 2020 57th ACM/IEEE Design Automation Conference (DAC) (01.07.2020)
Published in 2020 57th ACM/IEEE Design Automation Conference (DAC) (01.07.2020)
Get full text
Conference Proceeding
Bandwidth optimization through on-chip memory restructuring for HLS
Jason Cong, Peng Wei, Yu, Cody Hao, Peipei Zhou
Published in 2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) (01.06.2017)
Published in 2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) (01.06.2017)
Get full text
Conference Proceeding
Invited: Heterogeneous datacenters: Options and opportunities
Cong, Jason, Muhuan Huang, Di Wu, Yu, Cody Hao
Published in 2016 53nd ACM/EDAC/IEEE Design Automation Conference (DAC) (05.06.2016)
Published in 2016 53nd ACM/EDAC/IEEE Design Automation Conference (DAC) (05.06.2016)
Get full text
Conference Proceeding
Automated Deep Learning Optimization via DSL-Based Source Code Transformation
Wang, Ruixin, Lu, Minghai, Yu, Cody Hao, Lai, Yi-Hsiang, Zhang, Tianyi
Year of Publication 05.05.2024
Year of Publication 05.05.2024
Get full text
Journal Article
Latte: Locality Aware Transformation for High-Level Synthesis
Cong, Jason, Wei, Peng, Yu, Cody Hao, Zhou, Peipei
Published in 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) (01.04.2018)
Published in 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) (01.04.2018)
Get full text
Conference Proceeding
Grape: Practical and Efficient Graph-based Executions for Dynamic Deep Neural Networks on GPUs
Zheng, Bojian, Yu, Cody Hao, Wang, Jie, Ding, Yaoyao, Liu, Yizhi, Wang, Yida, Pekhimenko, Gennady
Published in 2023 56th IEEE/ACM International Symposium on Microarchitecture (MICRO) (28.10.2023)
Get full text
Published in 2023 56th IEEE/ACM International Symposium on Microarchitecture (MICRO) (28.10.2023)
Conference Proceeding
Slapo: A Schedule Language for Progressive Optimization of Large Deep Learning Model Training
Chen, Hongzheng, Yu, Cody Hao, Zheng, Shuai, Zhang, Zhen, Zhang, Zhiru, Wang, Yida
Year of Publication 15.02.2023
Year of Publication 15.02.2023
Get full text
Journal Article
Efficient Memory Management for Large Language Model Serving with PagedAttention
Kwon, Woosuk, Li, Zhuohan, Zhuang, Siyuan, Sheng, Ying, Zheng, Lianmin, Yu, Cody Hao, Gonzalez, Joseph E, Zhang, Hao, Stoica, Ion
Year of Publication 12.09.2023
Year of Publication 12.09.2023
Get full text
Journal Article
SGLang: Efficient Execution of Structured Language Model Programs
Zheng, Lianmin, Yin, Liangsheng, Xie, Zhiqiang, Sun, Chuyue, Huang, Jeff, Yu, Cody Hao, Cao, Shiyi, Kozyrakis, Christos, Stoica, Ion, Gonzalez, Joseph E, Barrett, Clark, Sheng, Ying
Year of Publication 12.12.2023
Year of Publication 12.12.2023
Get full text
Journal Article
Impact of loop transformations on software reliability
Cong, Jason, Yu, Cody Hao
Published in 2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) (01.11.2015)
Published in 2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) (01.11.2015)
Get full text
Conference Proceeding
RAF: Holistic Compilation for Deep Learning Model Training
Yu, Cody Hao, Fan, Haozheng, Huang, Guangtai, Jia, Zhen, Liu, Yizhi, Wang, Jie, Zheng, Zach, Zhou, Yuan, Shen, Haichen, Shao, Junru, Li, Mu, Wang, Yida
Year of Publication 08.03.2023
Year of Publication 08.03.2023
Get full text
Journal Article