EENet: Energy Efficient Neural Networks with Run-time Power Management
Deep learning approaches, such as convolution neural networks (CNNs), have achieved tremendous success in versatile applications. However, one of the challenges to deploy the deep learning models on resource-constrained systems is its huge energy cost. As a dynamic inference approach, early exit add...
Saved in:
Published in | 2023 60th ACM/IEEE Design Automation Conference (DAC) pp. 1 - 6 |
---|---|
Main Authors | , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
09.07.2023
|
Subjects | |
Online Access | Get full text |
DOI | 10.1109/DAC56929.2023.10247701 |
Cover
Abstract | Deep learning approaches, such as convolution neural networks (CNNs), have achieved tremendous success in versatile applications. However, one of the challenges to deploy the deep learning models on resource-constrained systems is its huge energy cost. As a dynamic inference approach, early exit adds exiting layers to the networks, which can terminate the inference earlier with accurate results to save energy. The current passive decision-making for energy regulation of early exit cannot adapt to ongoing inference status, varying inference workloads, and timing constraints, let alone guide the reasonable configuration of the computing platforms alongside the inference proceeds for potential energy saving. In this paper, we propose an Energy Efficient Neural Networks (EENet), which introduces a plug-in module to the state-of-the-art networks by incorporating run-time power management. Within each inference, we establish prediction of where the network will exit and adjust computing configurations (i.e., frequency and voltage) accordingly over a small timescale. Considering multiple inferences over a large timescale, we provide frequency and voltage calibration advice, given inference workloads and timing constraints. Finally, the dynamic voltage and frequency scaling (DVFS) governor configures voltage and frequency to execute the network according to the prediction and calibration. Extensive experimental results demonstrate that EENet achieves up to 63.8% energy-saving compared with classic deep learning networks and 21.5% energy-saving compared with the early exit under state-of-the-art exiting strategies, together with improved timing performance. |
---|---|
AbstractList | Deep learning approaches, such as convolution neural networks (CNNs), have achieved tremendous success in versatile applications. However, one of the challenges to deploy the deep learning models on resource-constrained systems is its huge energy cost. As a dynamic inference approach, early exit adds exiting layers to the networks, which can terminate the inference earlier with accurate results to save energy. The current passive decision-making for energy regulation of early exit cannot adapt to ongoing inference status, varying inference workloads, and timing constraints, let alone guide the reasonable configuration of the computing platforms alongside the inference proceeds for potential energy saving. In this paper, we propose an Energy Efficient Neural Networks (EENet), which introduces a plug-in module to the state-of-the-art networks by incorporating run-time power management. Within each inference, we establish prediction of where the network will exit and adjust computing configurations (i.e., frequency and voltage) accordingly over a small timescale. Considering multiple inferences over a large timescale, we provide frequency and voltage calibration advice, given inference workloads and timing constraints. Finally, the dynamic voltage and frequency scaling (DVFS) governor configures voltage and frequency to execute the network according to the prediction and calibration. Extensive experimental results demonstrate that EENet achieves up to 63.8% energy-saving compared with classic deep learning networks and 21.5% energy-saving compared with the early exit under state-of-the-art exiting strategies, together with improved timing performance. |
Author | Zou, An Shen, Yingtao Ma, Yehan Li, Xiangjie |
Author_xml | – sequence: 1 givenname: Xiangjie surname: Li fullname: Li, Xiangjie organization: Shanghai Jiao Tong University – sequence: 2 givenname: Yingtao surname: Shen fullname: Shen, Yingtao organization: Shanghai Jiao Tong University – sequence: 3 givenname: An surname: Zou fullname: Zou, An organization: Shanghai Jiao Tong University – sequence: 4 givenname: Yehan surname: Ma fullname: Ma, Yehan organization: Shanghai Jiao Tong University |
BookMark | eNo1z91KwzAYxvEICursHYjkBjrffLWJZ6N2Kswposcjzd7M4JpK2lF29xbUo__JjweeS3Iau4iE3DCYMwbm9n5RqcJwM-fAxZwBl2UJ7IRkpjRaKBBcSM3OSdb3oYEClJZQyAuyrOs1Dne0jph2R1p7H1zAONA1HpLdTxnGLn31dAzDJ307xHwILdLXbsREn220O2wnfkXOvN33mP11Rj6W9Xv1mK9eHp6qxSq33MCQa2O9VZI54QsE40pRSGis4xLcFhrOFfOaKYGTdl4qVaKd8FY0GgSiFjNy_bsbEHHznUJr03Hzf1f8ADEOTI8 |
ContentType | Conference Proceeding |
DBID | 6IE 6IH CBEJK RIE RIO |
DOI | 10.1109/DAC56929.2023.10247701 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan (POP) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE Electronic Library (IEL) IEEE Proceedings Order Plans (POP) 1998-present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE/IET Electronic Library url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 9798350323481 |
EndPage | 6 |
ExternalDocumentID | 10247701 |
Genre | orig-research |
GroupedDBID | 6IE 6IH ACM ALMA_UNASSIGNED_HOLDINGS CBEJK RIE RIO |
ID | FETCH-LOGICAL-a290t-89afa541c3f6e09c73640bac240cd0b2251f8153e290cf4557ea1c3d3b803ee83 |
IEDL.DBID | RIE |
IngestDate | Wed Aug 27 02:51:00 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | true |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-a290t-89afa541c3f6e09c73640bac240cd0b2251f8153e290cf4557ea1c3d3b803ee83 |
PageCount | 6 |
ParticipantIDs | ieee_primary_10247701 |
PublicationCentury | 2000 |
PublicationDate | 2023-July-9 |
PublicationDateYYYYMMDD | 2023-07-09 |
PublicationDate_xml | – month: 07 year: 2023 text: 2023-July-9 day: 09 |
PublicationDecade | 2020 |
PublicationTitle | 2023 60th ACM/IEEE Design Automation Conference (DAC) |
PublicationTitleAbbrev | DAC |
PublicationYear | 2023 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssib060584064 |
Score | 2.2320938 |
Snippet | Deep learning approaches, such as convolution neural networks (CNNs), have achieved tremendous success in versatile applications. However, one of the... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 1 |
SubjectTerms | Deep learning Early Exit Energy efficiency Feedback Control Inference Time Neural networks Potential energy Power system management Regulation Time-frequency analysis |
Title | EENet: Energy Efficient Neural Networks with Run-time Power Management |
URI | https://ieeexplore.ieee.org/document/10247701 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwGA26kycVJ_4mB6-paZM2iTeZHUNwDHGw20iTrxelE20v_vV-yVaHguCppSS0-dVH8r33PUKuoUYckSJj1uiaSSkk05UWLK9x75w63EO7oB1-nBaTuXxY5IuNWD1qYQAgks8gCbcxlu9XrgtHZbjCM6lUUGvt4jxbi7X6yRPCewhOcqMCTrm5ub8b5QXCfxIswpO-8g8blYgi430y7d-_Jo-8JF1bJe7zV2rGf3_gARluBXt09g1Fh2QHmiMyLssptLe0jPI-WsZkEVifhoQc9hUvkQH-QcNZLH3qGhaM5uks-KbRLS1mSObj8nk0YRvbBGYzw1umja1tjh0t6gK4cUoUklfWIXY7zytcwGmt8UcHWNrVMs8VWCzsRaW5ANDimAyaVQMnhErtlAmh2iL1EoyzquDWp-CV1ZnX9pQMQycs39aZMZZ9-8_-eH5O9sJYRLqruSCD9r2DSwT1trqKg_kFwhuffQ |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3PS8MwGA0yD3pSceJvc_Ca2jZpm3iT2TF1K0M22G2k6deL0om2F_96v6SrQ0Hw1FISSPOjr8n33vcIuYYScUTwkGklSyYEF0zmkrOoxL1zYHAPbax2eJLFo7l4XESLtVjdaWEAwJHPwLO3LpZfrExjj8pwhYciSaxaaxuBX0StXKubPjbAh_Ak1jrgwFc393eDKMYfAM-ahHtd9R9GKg5Hhnsk61rQ0kdevKbOPfP5Kznjv5u4T_obyR6dfoPRAdmC6pAM0zSD-pamTuBHU5cuAutTm5JDv-LFccA_qD2Npc9NxazVPJ1a5zS6Icb0yXyYzgYjtjZOYDpUfs2k0qWOsKt5GYOvTMJj4efaIHqbws9xCQelxE8dYGlTiihKQGPhgufS5wCSH5FetargmFAhTaJssDYOCgHK6CT2dRFAkWgZFlKfkL7thOVbmxtj2b3_6R_Pr8jOaDYZL8cP2dMZ2bXj4siv6pz06vcGLhDi6_zSDewXTyaiyg |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2023+60th+ACM%2FIEEE+Design+Automation+Conference+%28DAC%29&rft.atitle=EENet%3A+Energy+Efficient+Neural+Networks+with+Run-time+Power+Management&rft.au=Li%2C+Xiangjie&rft.au=Shen%2C+Yingtao&rft.au=Zou%2C+An&rft.au=Ma%2C+Yehan&rft.date=2023-07-09&rft.pub=IEEE&rft.spage=1&rft.epage=6&rft_id=info:doi/10.1109%2FDAC56929.2023.10247701&rft.externalDocID=10247701 |