Real-time monitoring of IO load and latency

Providers of web services and other types of software as a service may be subject to service-level agreements requiring that response times be within a defined range. For efficiency, multiple services may be hosted on the same set of computing nodes, which may jeopardize adherence to service-level a...

Full description

Saved in:
Bibliographic Details
Main Authors Lu, Yijun, Muniswamy-Reddy, Kiran-Kumar, Xiao, Wei, Filipe, Miguel Mascarenhas, Swift, Bjorn Patrick
Format Patent
LanguageEnglish
Published 16.02.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Providers of web services and other types of software as a service may be subject to service-level agreements requiring that response times be within a defined range. For efficiency, multiple services may be hosted on the same set of computing nodes, which may jeopardize adherence to service-level agreements. A control system may involve classifying service requests and determining desired values for measurements such as latency. An error value may be calculated based on the difference between measured and desired values. A controller may adjust a rate of capacity utilization for the computing nodes based on the current error, a history of past errors, and a prediction of future errors.
Bibliography:Application Number: US201313886025