Model-driven Cluster Resource Management for AI Workloads in Edge Clouds

Since emerging edge applications such as Internet of Things (IoT) analytics and augmented reality have tight latency constraints, hardware AI accelerators have been recently proposed to speed up deep neural network (DNN) inference run by these applications. Resource-constrained edge servers and acce...

Full description

Saved in:
Bibliographic Details
Published inACM transactions on autonomous and adaptive systems Vol. 18; no. 1; pp. 1 - 26
Main Authors Liang, Qianlin, Hanafy, Walid A., Ali-Eldin, Ahmed, Shenoy, Prashant
Format Journal Article
LanguageEnglish
Published New York, NY ACM 27.03.2023
Subjects
Online AccessGet full text
ISSN1556-4665
1556-4703
DOI10.1145/3582080

Cover

Loading…