Sustainable AI Processing at the Edge
Ollivier, Sébastien, Li, Sheng, Tang, Yue, Chaudhuri, Chayanika, Zhou, Peipei, Tang, Xulong, Hu, Jingtong, Jones, Alex K
Year of Publication 04.07.2022
Year of Publication 04.07.2022
Get full text
Journal Article
YOLObile: Real-Time Object Detection on Mobile Devices via Compression-Compilation Co-Design
Cai, Yuxuan, Li, Hongjia, Yuan, Geng, Niu, Wei, Li, Yanyu, Tang, Xulong, Ren, Bin, Wang, Yanzhi
Year of Publication 11.09.2020
Year of Publication 11.09.2020
Get full text
Journal Article
Improving Multi-Instance GPU Efficiency via Sub-Entry Sharing TLB Design
Li, Bingyao, Wang, Yueqi, Wang, Tianyu, Eeckhout, Lieven, Yang, Jun, Jaleel, Aamer, Tang, Xulong
Published in arXiv.org (29.04.2024)
Get full text
Published in arXiv.org (29.04.2024)
Paper
Automatic Mapping of the Best-Suited DNN Pruning Schemes for Real-Time Mobile Acceleration
Gong, Yifan, Yuan, Geng, Zhan, Zheng, Niu, Wei, Li, Zhengang, Zhao, Pu, Cai, Yuxuan, Liu, Sijia, Ren, Bin, Lin, Xue, Tang, Xulong, Wang, Yanzhi
Year of Publication 22.11.2021
Year of Publication 22.11.2021
Get full text
Journal Article
SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing
Li, Sheng, Geng Yuan, Dai, Yue, Zhang, Youtao, Wang, Yanzhi, Tang, Xulong
Published in arXiv.org (30.01.2024)
Get full text
Published in arXiv.org (30.01.2024)
Paper
Minimizing Photonic Cluster State Depth in Measurement-Based Quantum Computing
Li, Yingheng, Pawar, Aditya, Zewei Mo, Zhang, Youtao, Yang, Jun, Tang, Xulong
Published in arXiv.org (18.12.2023)
Get full text
Published in arXiv.org (18.12.2023)
Paper
Integrated Qubit Reuse and Circuit Cutting for Large Quantum Circuit Evaluation
Pawar, Aditya, Li, Yingheng, Zewei Mo, Guo, Yanan, Zhang, Youtao, Tang, Xulong, Yang, Jun
Published in arXiv.org (16.12.2023)
Get full text
Published in arXiv.org (16.12.2023)
Paper
Improving GPU Multi-Tenancy Through Dynamic Multi-Instance GPU Reconfiguration
Wang, Tianyu, Li, Sheng, Li, Bingyao, Dai, Yue, Ao, Li, Geng Yuan, Ding, Yufei, Zhang, Youtao, Tang, Xulong
Published in arXiv.org (18.07.2024)
Get full text
Published in arXiv.org (18.07.2024)
Paper
Demystifying Arch-hints for Model Extraction: An Attack in Unified Memory System
Wang, Zhendong, Zeng, Xiaoming, Tang, Xulong, Zhang, Danfeng, Hu, Xing, Hu, Yang
Published in arXiv.org (29.08.2022)
Get full text
Published in arXiv.org (29.08.2022)
Paper
SupeRBNN: Randomized Binary Neural Network Using Adiabatic Superconductor Josephson Devices
Li, Zhengang, Geng Yuan, Yamauchi, Tomoharu, Zabihi Masoud, Xie, Yanyue, Dong, Peiyan, Tang, Xulong, Yoshikawa, Nobuyuki, Tiwari, Devesh, Wang, Yanzhi, Chen, Olivia
Published in arXiv.org (21.09.2023)
Get full text
Published in arXiv.org (21.09.2023)
Paper
Automated Runtime-Aware Scheduling for Multi-Tenant DNN Inference on GPU
Yu, Fuxun, Bray, Shawn, Wang, Di, Shangguan, Longfei, Tang, Xulong, Liu, Chenchen, Chen, Xiang
Published in arXiv.org (28.11.2021)
Get full text
Published in arXiv.org (28.11.2021)
Paper