Multi-cluster warehouse

A method implementing a fault-tolerant data warehouse using availability zones includes allocating a plurality of processing units to a data warehouse, the processing units located in different availability zones, an availability zone comprising one or more data centers. The method further includes...

Full description

Saved in:
Bibliographic Details
Main Authors Dageville, Benoit, Povinec, Peter, Cruanes, Thierry, Funke, Florian Andreas
Format Patent
LanguageEnglish
Published 14.05.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method implementing a fault-tolerant data warehouse using availability zones includes allocating a plurality of processing units to a data warehouse, the processing units located in different availability zones, an availability zone comprising one or more data centers. The method further includes routing a query to a processing unit within the data warehouse, the query having a common session identifier with a query previously provided to the processing unit, the processing unit determined to be caching a data segment associated with a cloud storage resource independent of the plurality of processing units. The method further includes, as a result of monitoring a number of queries running at an input degree of parallelism, determining that the processing capacity of the processing units has reached a threshold; and changing a total number of processing units using the input degree of parallelism and the number of queries.
Bibliography:Application Number: US202318139809