Collaborative image compression and classification with multi-task learning for visual Internet of Things

Widespread deployment of the Internet of Things (IoT) has changed the way that network services are developed, deployed, and operated. Most onboard advanced IoT devices are equipped with visual sensors that form the so-called visual IoT. Typically, the sender would compress images, and then through...

Full description

Saved in:
Bibliographic Details
Published inChinese journal of aeronautics Vol. 35; no. 5; pp. 390 - 399
Main Authors DU, Bing, DUAN, Yiping, ZHANG, Hang, TAO, Xiaoming, WU, Yue, RU, Congchong
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.05.2022
School of Computer and Communication Engineering,University of Science and Technology Beijing,Beijing 100083,China%Department of Electronic Engineering,Tsinghua University,Beijing 100084,China%School of Computer Science and Technology,Xidian University,Xian 710071,China%Smart Network Computing Lab,Cross-strait Tsinghua Research Institution,Beijing 100084,China
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Widespread deployment of the Internet of Things (IoT) has changed the way that network services are developed, deployed, and operated. Most onboard advanced IoT devices are equipped with visual sensors that form the so-called visual IoT. Typically, the sender would compress images, and then through the communication network, the receiver would decode images, and then analyze the images for applications. However, image compression and semantic inference are generally conducted separately, and thus, current compression algorithms cannot be transplanted for the use of semantic inference directly. A collaborative image compression and classification framework for visual IoT applications is proposed, which combines image compression with semantic inference by using multi-task learning. In particular, the multi-task Generative Adversarial Networks (GANs) are described, which include encoder, quantizer, generator, discriminator, and classifier to conduct simultaneously image compression and classification. The key to the proposed framework is the quantized latent representation used for compression and classification. GANs with perceptual quality can achieve low bitrate compression and reduce the amount of data transmitted. In addition, the design in which two tasks share the same feature can greatly reduce computing resources, which is especially applicable for environments with extremely limited resources. Using extensive experiments, the collaborative compression and classification framework is effective and useful for visual IoT applications.
ISSN:1000-9361
DOI:10.1016/j.cja.2021.10.003