NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision

Mobile vision systems such as smartphones, drones, and augmented-reality headsets are revolutionizing our lives. These systems usually run multiple applications concurrently and their available resources at runtime are dynamic due to events such as starting new applications, closing existing applica...

Full description

Saved in:

Bibliographic Details
Published in	arXiv.org
Main Authors	Fang, Biyi, Zeng, Xiao, Zhang, Mi
Format	Paper Journal Article
Language	English
Published	Ithaca Cornell University Library, arXiv.org 23.10.2018
Subjects	Accuracy Augmented reality Computer Science - Computer Vision and Pattern Recognition Deep learning Earphones Energy consumption Model accuracy Run time (computers) Smartphones Tradeoffs Vision systems
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Mobile vision systems such as smartphones, drones, and augmented-reality headsets are revolutionizing our lives. These systems usually run multiple applications concurrently and their available resources at runtime are dynamic due to events such as starting new applications, closing existing applications, and application priority changes. In this paper, we present NestDNN, a framework that takes the dynamics of runtime resources into account to enable resource-aware multi-tenant on-device deep learning for mobile vision systems. NestDNN enables each deep learning model to offer flexible resource-accuracy trade-offs. At runtime, it dynamically selects the optimal resource-accuracy trade-off for each deep learning model to fit the model's resource demand to the system's available runtime resources. In doing so, NestDNN efficiently utilizes the limited resources in mobile vision systems to jointly maximize the performance of all the concurrently running applications. Our experiments show that compared to the resource-agnostic status quo approach, NestDNN achieves as much as 4.2% increase in inference accuracy, 2.0x increase in video frame processing rate and 1.7x reduction on energy consumption.
ISSN:	2331-8422
DOI:	10.48550/arxiv.1810.10090