Towards Highly Efficient DGEMM on the Emerging SW26010 Many-Core Processor
Lijuan Jiang, Chao Yang, Yulong Ao, Wanwang Yin, Wenjing Ma, Qiao Sun, Fangfang Liu, Rongfen Lin, Peng Zhang
Published in 2017 46th International Conference on Parallel Processing (ICPP) (01.08.2017)
Published in 2017 46th International Conference on Parallel Processing (ICPP) (01.08.2017)
Get full text
Conference Proceeding
Pattern-Driven Hybrid Multi- and Many-Core Acceleration in the MPAS Shallow-Water Model
Zhang, Peng, Ao, Yulong, Yang, Chao, Liu, Yiqun, Liu, Fangfang, Wu, Changmao, Zhao, Haitao
Published in 2015 44th International Conference on Parallel Processing (01.09.2015)
Published in 2015 44th International Conference on Parallel Processing (01.09.2015)
Get full text
Conference Proceeding
Journal Article
10M-Core Scalable Fully-Implicit Solver for Nonhydrostatic Atmospheric Dynamics
Chao Yang, Wei Xue, Haohuan Fu, Hongtao You, Xinliang Wang, Yulong Ao, Fangfang Liu, Lin Gan, Ping Xu, Lanning Wang, Guangwen Yang, Weimin Zheng
Published in SC16: International Conference for High Performance Computing, Networking, Storage and Analysis (01.11.2016)
Published in SC16: International Conference for High Performance Computing, Networking, Storage and Analysis (01.11.2016)
Get full text
Conference Proceeding
Adaptive SpMV/SpMSpV on GPUs for Input Vectors of Varied Sparsity
Li, Min, Ao, Yulong, Yang, Chao
Published in IEEE transactions on parallel and distributed systems (01.07.2021)
Published in IEEE transactions on parallel and distributed systems (01.07.2021)
Get full text
Journal Article
26 PFLOPS Stencil Computations for Atmospheric Modeling on Sunway TaihuLight
Yulong Ao, Chao Yang, Xinliang Wang, Wei Xue, Haohuan Fu, Fangfang Liu, Lin Gan, Ping Xu, Wenjing Ma
Published in 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (01.05.2017)
Published in 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) (01.05.2017)
Get full text
Conference Proceeding
Aquila2 Technical Report
Zhang, Bo-Wen, Wang, Liangdong, Li, Jijie, Gu, Shuhao, Wu, Xinya, Zhang, Zhengduo, Gao, Boyan, Ao, Yulong, Liu, Guang
Year of Publication 14.08.2024
Year of Publication 14.08.2024
Get full text
Journal Article
Emu3: Next-Token Prediction is All You Need
Wang, Xinlong, Zhang, Xiaosong, Luo, Zhengxiong, Sun, Quan, Cui, Yufeng, Wang, Jinsheng, Zhang, Fan, Wang, Yueze, Li, Zhen, Yu, Qiying, Zhao, Yingli, Ao, Yulong, Min, Xuebin, Li, Tao, Wu, Boya, Zhao, Bo, Zhang, Bowen, Wang, Liangdong, Liu, Guang, He, Zheqi, Yang, Xi, Liu, Jingjing, Lin, Yonghua, Huang, Tiejun, Wang, Zhongyuan
Year of Publication 27.09.2024
Year of Publication 27.09.2024
Get full text
Journal Article
AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies
Zhang, Bo-Wen, Wang, Liangdong, Yuan, Ye, Li, Jijie, Gu, Shuhao, Zhao, Mengdi, Wu, Xinya, Liu, Guang, Wu, Chengwei, Zhao, Hanyu, Du, Li, Ju, Yiming, Ma, Quanyue, Ao, Yulong, Zhao, Yingli, Zhu, Songhe, Cao, Zhou, Liang, Dong, Lin, Yonghua, Zhang, Ming, Wang, Shunfei, Zhou, Yanxin, Ye, Min, Chen, Xuekai, Yu, Xinyang, Huang, Xiangjun, Yang, Jian
Year of Publication 12.08.2024
Year of Publication 12.08.2024
Get full text
Journal Article
End-to-end Adaptive Distributed Training on PaddlePaddle
Ao, Yulong, Wu, Zhihua, Yu, Dianhai, Gong, Weibao, Kui, Zhiqing, Zhang, Minxu, Ye, Zilingfeng, Shen, Liang, Ma, Yanjun, Wu, Tian, Wang, Haifeng, Zeng, Wei, Yang, Chao
Year of Publication 05.12.2021
Year of Publication 05.12.2021
Get full text
Journal Article
Aquila2 Technical Report
Bo-Wen, Zhang, Wang, Liangdong, Li, Jijie, Gu, Shuhao, Wu, Xinya, Zhang, Zhengduo, Gao, Boyan, Ao, Yulong, Liu, Guang
Published in arXiv.org (14.08.2024)
Get full text
Published in arXiv.org (14.08.2024)
Paper
Emu3: Next-Token Prediction is All You Need
Wang, Xinlong, Zhang, Xiaosong, Luo, Zhengxiong, Sun, Quan, Cui, Yufeng, Wang, Jinsheng, Zhang, Fan, Wang, Yueze, Li, Zhen, Yu, Qiying, Zhao, Yingli, Ao, Yulong, Min, Xuebin, Li, Tao, Wu, Boya, Zhao, Bo, Bowen, Zhang, Wang, Liangdong, Liu, Guang, He, Zheqi, Yang, Xi, Liu, Jingjing, Lin, Yonghua, Huang, Tiejun, Wang, Zhongyuan
Published in arXiv.org (27.09.2024)
Get full text
Published in arXiv.org (27.09.2024)
Paper
AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies
Bo-Wen, Zhang, Wang, Liangdong, Ye Yuan, Li, Jijie, Gu, Shuhao, Zhao, Mengdi, Wu, Xinya, Liu, Guang, Wu, Chengwei, Zhao, Hanyu, Du, Li, Ju, Yiming, Ma, Quanyue, Ao, Yulong, Zhao, Yingli, Zhu, Songhe, Cao, Zhou, Liang, Dong, Lin, Yonghua, Zhang, Ming, Wang, Shunfei, Zhou, Yanxin, Ye, Min, Chen, Xuekai, Yu, Xinyang, Huang, Xiangjun, Yang, Jian
Published in arXiv.org (13.08.2024)
Get full text
Published in arXiv.org (13.08.2024)
Paper
End-to-end Adaptive Distributed Training on PaddlePaddle
Ao, Yulong, Wu, Zhihua, Yu, Dianhai, Gong, Weibao, Zhiqing Kui, Zhang, Minxu, Ye, Zilingfeng, Shen, Liang, Ma, Yanjun, Wu, Tian, Wang, Haifeng, Zeng, Wei, Yang, Chao
Published in arXiv.org (06.12.2021)
Get full text
Published in arXiv.org (06.12.2021)
Paper