Overload Control for Scaling WeChat Microservices
Effective overload control for large-scale online service system is crucial for protecting the system backend from overload. Conventionally, the design of overload control is ad-hoc for individual service. However, service-specific overload control could be detrimental to the overall system due to i...
Saved in:
Published in | arXiv.org |
---|---|
Main Authors | , , , , , , , , |
Format | Paper Journal Article |
Language | English |
Published |
Ithaca
Cornell University Library, arXiv.org
24.12.2018
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Effective overload control for large-scale online service system is crucial for protecting the system backend from overload. Conventionally, the design of overload control is ad-hoc for individual service. However, service-specific overload control could be detrimental to the overall system due to intricate service dependencies or flawed implementation of service. Service developers usually have difficulty to accurately estimate the dynamics of actual workload during the development of service. Therefore, it is essential to decouple the overload control from service logic. In this paper, we propose DAGOR, an overload control scheme designed for the account-oriented microservice architecture. DAGOR is service agnostic and system-centric. It manages overload at the microservice granule such that each microservice monitors its load status in real time and triggers load shedding in a collaborative manner among its relevant services when overload is detected. DAGOR has been used in the WeChat backend for five years. Experimental results show that DAGOR can benefit high success rate of service even when the system is experiencing overload, while ensuring fairness in the overload control. |
---|---|
AbstractList | ACM SoCC (2018) 149-161 Effective overload control for large-scale online service system is crucial
for protecting the system backend from overload. Conventionally, the design of
overload control is ad-hoc for individual service. However, service-specific
overload control could be detrimental to the overall system due to intricate
service dependencies or flawed implementation of service. Service developers
usually have difficulty to accurately estimate the dynamics of actual workload
during the development of service. Therefore, it is essential to decouple the
overload control from service logic. In this paper, we propose DAGOR, an
overload control scheme designed for the account-oriented microservice
architecture. DAGOR is service agnostic and system-centric. It manages overload
at the microservice granule such that each microservice monitors its load
status in real time and triggers load shedding in a collaborative manner among
its relevant services when overload is detected. DAGOR has been used in the
WeChat backend for five years. Experimental results show that DAGOR can benefit
high success rate of service even when the system is experiencing overload,
while ensuring fairness in the overload control. Effective overload control for large-scale online service system is crucial for protecting the system backend from overload. Conventionally, the design of overload control is ad-hoc for individual service. However, service-specific overload control could be detrimental to the overall system due to intricate service dependencies or flawed implementation of service. Service developers usually have difficulty to accurately estimate the dynamics of actual workload during the development of service. Therefore, it is essential to decouple the overload control from service logic. In this paper, we propose DAGOR, an overload control scheme designed for the account-oriented microservice architecture. DAGOR is service agnostic and system-centric. It manages overload at the microservice granule such that each microservice monitors its load status in real time and triggers load shedding in a collaborative manner among its relevant services when overload is detected. DAGOR has been used in the WeChat backend for five years. Experimental results show that DAGOR can benefit high success rate of service even when the system is experiencing overload, while ensuring fairness in the overload control. |
Author | Zhou, Hao Chen, Ming Lin, Qian Liu, Sifan Gu, Rui Beng Chin Ooi She, Xiaobin Wang, Yong Yang, Junfeng |
Author_xml | – sequence: 1 givenname: Hao surname: Zhou fullname: Zhou, Hao – sequence: 2 givenname: Ming surname: Chen fullname: Chen, Ming – sequence: 3 givenname: Qian surname: Lin fullname: Lin, Qian – sequence: 4 givenname: Yong surname: Wang fullname: Wang, Yong – sequence: 5 givenname: Xiaobin surname: She fullname: She, Xiaobin – sequence: 6 givenname: Sifan surname: Liu fullname: Liu, Sifan – sequence: 7 givenname: Rui surname: Gu fullname: Gu, Rui – sequence: 8 fullname: Beng Chin Ooi – sequence: 9 givenname: Junfeng surname: Yang fullname: Yang, Junfeng |
BackLink | https://doi.org/10.48550/arXiv.1806.04075$$DView paper in arXiv https://doi.org/10.1145/3267809.3267823$$DView published paper (Access to full text may be restricted) |
BookMark | eNotj81Kw0AYRQdRsNY-gCsDrhO_-Z8sJfgHlS4suAyTmS-aEjN1Ji369sbW1d0cLudckNMhDEjIFYVCGCnh1sbvbl9QA6oAAVqekBnjnOZGMHZOFiltAIApzaTkM0JXe4x9sD6rwjDG0GdtiNmrs303vGdvWH3YMXvpXAwJ475zmC7JWWv7hIv_nZP1w_26esqXq8fn6m6ZW8lUThlyr1phlDcOLRWm0S2nDIxwrhFGM01t6Xwjm8kLJUWvUZZK-AmR0PA5uT7eHnrqbew-bfyp_7rqQ9dE3ByJbQxfO0xjvQm7OExONQPNuSpBKP4L8zVQhA |
ContentType | Paper Journal Article |
Copyright | 2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
Copyright_xml | – notice: 2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: http://arxiv.org/licenses/nonexclusive-distrib/1.0 |
DBID | 8FE 8FG ABJCF ABUWG AFKRA AZQEC BENPR BGLVJ CCPQU DWQXO HCIFZ L6V M7S PIMPY PQEST PQQKQ PQUKI PRINS PTHSS AKY GOX |
DOI | 10.48550/arxiv.1806.04075 |
DatabaseName | ProQuest SciTech Collection ProQuest Technology Collection Materials Science & Engineering Collection ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials ProQuest Central Technology Collection ProQuest One Community College ProQuest Central Korea SciTech Premium Collection ProQuest Engineering Collection Engineering Database Publicly Available Content Database ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China Engineering Collection arXiv Computer Science arXiv.org |
DatabaseTitle | Publicly Available Content Database Engineering Database Technology Collection ProQuest Central Essentials ProQuest One Academic Eastern Edition ProQuest Central (Alumni Edition) SciTech Premium Collection ProQuest One Community College ProQuest Technology Collection ProQuest SciTech Collection ProQuest Central China ProQuest Central ProQuest Engineering Collection ProQuest One Academic UKI Edition ProQuest Central Korea Materials Science & Engineering Collection ProQuest One Academic Engineering Collection |
DatabaseTitleList | Publicly Available Content Database |
Database_xml | – sequence: 1 dbid: GOX name: arXiv.org url: http://arxiv.org/find sourceTypes: Open Access Repository – sequence: 2 dbid: 8FG name: ProQuest Technology Collection url: https://search.proquest.com/technologycollection1 sourceTypes: Aggregation Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Physics |
EISSN | 2331-8422 |
ExternalDocumentID | 1806_04075 |
Genre | Working Paper/Pre-Print |
GroupedDBID | 8FE 8FG ABJCF ABUWG AFKRA ALMA_UNASSIGNED_HOLDINGS AZQEC BENPR BGLVJ CCPQU DWQXO FRJ HCIFZ L6V M7S M~E PIMPY PQEST PQQKQ PQUKI PRINS PTHSS AKY GOX |
ID | FETCH-LOGICAL-a526-12e3d6f486d8cea148b7f312084ccb487271a9cdb5b422e51ed7e5964d20850b3 |
IEDL.DBID | 8FG |
IngestDate | Mon Jan 08 05:49:30 EST 2024 Thu Oct 10 18:55:42 EDT 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-a526-12e3d6f486d8cea148b7f312084ccb487271a9cdb5b422e51ed7e5964d20850b3 |
OpenAccessLink | https://www.proquest.com/docview/2073369046?pq-origsite=%requestingapplication% |
PQID | 2073369046 |
PQPubID | 2050157 |
ParticipantIDs | arxiv_primary_1806_04075 proquest_journals_2073369046 |
PublicationCentury | 2000 |
PublicationDate | 20181224 |
PublicationDateYYYYMMDD | 2018-12-24 |
PublicationDate_xml | – month: 12 year: 2018 text: 20181224 day: 24 |
PublicationDecade | 2010 |
PublicationPlace | Ithaca |
PublicationPlace_xml | – name: Ithaca |
PublicationTitle | arXiv.org |
PublicationYear | 2018 |
Publisher | Cornell University Library, arXiv.org |
Publisher_xml | – name: Cornell University Library, arXiv.org |
SSID | ssj0002672553 |
Score | 1.7241658 |
SecondaryResourceType | preprint |
Snippet | Effective overload control for large-scale online service system is crucial for protecting the system backend from overload. Conventionally, the design of... ACM SoCC (2018) 149-161 Effective overload control for large-scale online service system is crucial for protecting the system backend from overload.... |
SourceID | arxiv proquest |
SourceType | Open Access Repository Aggregation Database |
SubjectTerms | Computer Science - Distributed, Parallel, and Cluster Computing Load shedding On-line systems |
SummonAdditionalLinks | – databaseName: arXiv.org dbid: GOX link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV1NSwMxEB3anryIotJqlRy8Lu7ma5OjLNYiaA9W3NuSr6IgKt1a_PlOsls8iNeQHPLy8eaRlxmAy9w56zwPWS6Uy7hnIrOa6cwjFXMffDAmGWQf5PyJ39WiHgDZ_YUx6-_XbZcf2LZXhYpvBag5xBCGlEbL1u2i7h4nUyquvv9vP4wxU9OfqzXxxewA9vtAj1x3K3MIg_B-BMUCd87bh_Gk6iziBGNG8og4IYOQ51C9mA25jx65tj_Dx7Cc3SyredYXLciMoLGkTWBerriSXrlgUGzYcsUKmiuOqKA6oGVhtPNWWE5pEEXwZRBach-LZeaWncAIdX8YA9HaO8cEXkBGcuUYokZLx5HhXV5SU05gnKbafHZ5KZqIQpNQmMB0N_um35NtQ2N9RhTDXJ7-P_IM9jAkUNGwQfkURpv1VzhH2t3Yi4T9D98Vgfg priority: 102 providerName: Cornell University |
Title | Overload Control for Scaling WeChat Microservices |
URI | https://www.proquest.com/docview/2073369046 https://arxiv.org/abs/1806.04075 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1JS8NAFH5og-DNlVZrycHraDJLlpNg6YLQBa3YW5itKEhbmyqe_O2-maZ6ELwEkpzmy-St37wP4DLSWmnDLYlEpgk3TBCVs5wYdMXcWGOl9ATZYdJ_5HdTMa0KbmVFq9zaRG-ozUK7GrmrhDCGqRxPbpZvxKlGue5qJaGxC0HsJuG5k-Ld3k-NhSYpRsxs08z0o7uu5erz5eMqzlwPAnMZdCmBf_THFHv_0j2AYCyXdnUIO3Z-BHuelqnLY4hHuNNeF9KE7Q2lPMQYM3xAXNHjhE-2_SzX4cBx6srqnz-BSbczafdJJXJApKBOAscyk8x4lphMW4nJiUpnLKZRxhFFzCZoGstcGyUUp9SK2JrUijzhxolrRoqdQm2-mNs6hHlutGYCDZZMeKYZokxTzTEi0FFKZdqAul9qsdzMsSgcCoVHoQHN7eqLag-XxS_iZ_-_Pod9DCMyR_KgvAm19erdXqCrXquW_x4tCG47w_E93vVGU7wOvjrfgzOWbg |
link.rule.ids | 228,230,786,790,891,12792,21416,27956,33406,33777,43633,43838 |
linkProvider | ProQuest |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV1NT8JAEJ0ojdGbnxFF7cFrtd2P7vZkIoGgAhLFyK3Z7i7RxABSNP58Z5eiBxOv7Wlft_Pm42UewHmsdaENs1HMpY6YoTwqMppFBqmYGWusUl4g2087T-x2xEdVw62sZJWrmOgDtZlq1yN3nRBKsZRj6dXsPXKuUW66WllorEPgVm7KGgTXrf7g4afLQlKBOTNdjjP98q5LNf96_bxIpJtCYDWDpBL4R3-CsWeY9jYEAzWz8x1Ys5Nd2PDCTF3uQXKPd-1tqkzYXIrKQ8wyw0dEFjknfLbNF7UIe05VV1Z__T4M261hsxNVNgeR4sSZ4Fhq0jGTqZHaKixPCjGmCYklQxyxniAiUZk2BS8YIZYn1gjLs5QZZ68ZF_QAapPpxB5CmGVGa8oxZKmUSU0RZyI0w5xAx4IoUYdDf9R8ttxkkTsUco9CHRqr0-fVLS7zX8yP_n99BpudYa-bd2_6d8ewhUmFdJIPwhpQW8w_7AkS96I4rb7ON4_Wlus |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Overload+Control+for+Scaling+WeChat+Microservices&rft.jtitle=arXiv.org&rft.au=Zhou%2C+Hao&rft.au=Chen%2C+Ming&rft.au=Lin%2C+Qian&rft.au=Wang%2C+Yong&rft.date=2018-12-24&rft.pub=Cornell+University+Library%2C+arXiv.org&rft.eissn=2331-8422&rft_id=info:doi/10.48550%2Farxiv.1806.04075 |