Graph neural networks meet with distributed graph partitioners and reconciliations
Mu, Zongshen, Tang, Siliang, Zong, Chang, Yu, Dianhai, Zhuang, Yueting
Published in Neurocomputing (Amsterdam) (21.01.2023)
Published in Neurocomputing (Amsterdam) (21.01.2023)
Get full text
Journal Article
Exploiting Cross-Modal Prediction and Relation Consistency for Semisupervised Image Captioning
Yang, Yang, Wei, Hongchen, Zhu, Hengshu, Yu, Dianhai, Xiong, Hui, Yang, Jian
Published in IEEE transactions on cybernetics (01.02.2024)
Published in IEEE transactions on cybernetics (01.02.2024)
Get full text
Journal Article
HeterPS: Distributed deep learning with reinforcement learning based scheduling in heterogeneous environments
Liu, Ji, Wu, Zhihua, Feng, Danlei, Zhang, Minxu, Wu, Xinxuan, Yao, Xuefeng, Yu, Dianhai, Ma, Yanjun, Zhao, Feng, Dou, Dejing
Published in Future generation computer systems (01.11.2023)
Published in Future generation computer systems (01.11.2023)
Get full text
Journal Article
MoESys: A Distributed and Efficient Mixture-of-Experts Training and Inference System for Internet Services
Yu, Dianhai, Shen, Liang, Hao, Hongxiang, Gong, Weibao, Wu, Huachao, Bian, Jiang, Dai, Lirong, Xiong, Haoyi
Published in IEEE transactions on services computing (01.09.2024)
Published in IEEE transactions on services computing (01.09.2024)
Get full text
Journal Article
Large‐scale knowledge distillation with elastic heterogeneous computing resources
Liu, Ji, Dong, Daxiang, Wang, Xi, Qin, An, Li, Xingjian, Valduriez, Patrick, Dou, Dejing, Yu, Dianhai
Published in Concurrency and computation (30.11.2023)
Published in Concurrency and computation (30.11.2023)
Get full text
Journal Article
Distributed training for Conditional Random Fields
Xiaojun Lin, Liang Zhao, Dianhai Yu, Xihong Wu
Published in Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) (01.08.2010)
Published in Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010) (01.08.2010)
Get full text
Conference Proceeding
A Framework for Cost-Effective and Self-Adaptive LLM Shaking and Recovery Mechanism
Chen, Zhiyu, Li, Yu, Zhang, Suochao, Zhou, Jingbo, Zhou, Jiwen, Bao, Chenfu, Yu, Dianhai
Year of Publication 11.03.2024
Year of Publication 11.03.2024
Get full text
Journal Article
FlashMask: Efficient and Rich Mask Extension of FlashAttention
Wang, Guoxia, Zeng, Jinle, Xiao, Xiyuan, Wu, Siming, Yang, Jiabin, Zheng, Lujing, Chen, Zeyu, Bian, Jiang, Yu, Dianhai, Wang, Haifeng
Year of Publication 02.10.2024
Year of Publication 02.10.2024
Get full text
Journal Article
NACL: A General and Effective KV Cache Eviction Framework for LLMs at Inference Time
Chen, Yilong, Wang, Guoxia, Shang, Junyuan, Cui, Shiyao, Zhang, Zhenyu, Liu, Tingwen, Wang, Shuohuan, Sun, Yu, Yu, Dianhai, Wu, Hua
Year of Publication 07.08.2024
Year of Publication 07.08.2024
Get full text
Journal Article
Spectral Heterogeneous Graph Convolutions via Positive Noncommutative Polynomials
He, Mingguo, Wei, Zhewei, Feng, Shikun, Huang, Zhengjie, Li, Weibin, Sun, Yu, Yu, Dianhai
Year of Publication 31.05.2023
Year of Publication 31.05.2023
Get full text
Journal Article
PP-YOLOE-R: An Efficient Anchor-Free Rotated Object Detector
Wang, Xinxin, Wang, Guanzhong, Dang, Qingqing, Liu, Yi, Hu, Xiaoguang, Yu, Dianhai
Year of Publication 04.11.2022
Year of Publication 04.11.2022
Get full text
Journal Article
Efficient AlphaFold2 Training using Parallel Evoformer and Branch Parallelism
Wang, Guoxia, Wu, Zhihua, Fang, Xiaomin, Xiang, Yingfei, Liu, Yiqun, Yu, Dianhai, Ma, Yanjun
Year of Publication 31.10.2022
Year of Publication 31.10.2022
Get full text
Journal Article
Boosting Distributed Training Performance of the Unpadded BERT Model
Zeng, Jinle, Li, Min, Wu, Zhihua, Liu, Jiaqi, Liu, Yuang, Yu, Dianhai, Ma, Yanjun
Year of Publication 17.08.2022
Year of Publication 17.08.2022
Get full text
Journal Article
Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources
Liu, Ji, Dong, Daxiang, Wang, Xi, Qin, An, Li, Xingjian, Valduriez, Patrick, Dou, Dejing, Yu, Dianhai
Year of Publication 14.07.2022
Year of Publication 14.07.2022
Get full text
Journal Article
MoESys: A Distributed and Efficient Mixture-of-Experts Training and Inference System for Internet Services
Yu, Dianhai, Shen, Liang, Hao, Hongxiang, Gong, Weibao, Wu, Huachao, Bian, Jiang, Dai, Lirong, Xiong, Haoyi
Year of Publication 20.05.2022
Year of Publication 20.05.2022
Get full text
Journal Article