COSHARDING AND RANDOMIZED COSHARDING

The technology relates to cosharding tables within a distributed storage system. A data table including one or more rows may be received. Each row in the data table may include an identifier key and pieces of data. Each piece of data in the data table may be indexed into individual rows of an index...

Full description

Saved in:
Bibliographic Details
Main Authors KANTHAK SEBASTIAN, LLOYD ALEXANDER, KHESIN ALEXANDER
Format Patent
LanguageChinese
English
Published 14.05.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The technology relates to cosharding tables within a distributed storage system. A data table including one or more rows may be received. Each row in the data table may include an identifier key and pieces of data. Each piece of data in the data table may be indexed into individual rows of an index table, wherein each row in the index table includes data associated with the identifier key of the data table from which the piece of data in the respective row was indexed. The index table may be sharded into splits, wherein the sharding includes assigning each row of the index table into one of the splits based on the identifier key of the data table from which the piece of data in the respective row was indexed. The splits may be stored into two or more portions of the distributed storage system. 本技术涉及在分布式存储系统中对表进行共同分片。可以接收包括一个或多个行的数据表。数据表中的每个行可以包括标识符键和数据段。可以将所述数据表中的每个数据段索引到索引表的单独行中,其中所述索引表中的每个行都包括与从中索引了所述相应行中的该数据段的所述数据表的所述标识符键相关联的数据。可以将所述索引表分片成裂片,其中所述分片包括基于从中索引了所述相应行中的该数据段的所述数据表的所述标识符键将所述索引表的每个行分配给所述裂片中的一个裂片。可以
Bibliography:Application Number: CN202080005621