Frequent itemset counting using subsets of bitmaps

A method and mechanism for performing improved frequent itemset operations is provided. A set of item groups are divided into a plurality of subsets. Each item group is composed of a set of data items. Possible combinations of data items that may frequently appear together in the same item group are...

Full description

Saved in:
Bibliographic Details
Main Authors MOZES ARI W, LI WEI, JAKOBSSON HAKAN
Format Patent
LanguageEnglish
Published 13.07.2010
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method and mechanism for performing improved frequent itemset operations is provided. A set of item groups are divided into a plurality of subsets. Each item group is composed of a set of data items. Possible combinations of data items that may frequently appear together in the same item group are referred to as candidate combinations. Candidate combinations comprising a first set of data items are identified, and thereafter the occurrence of each candidate combination in any item group in each subset is counted by comparing item bitmaps, associated with items in the candidate combination, in each subset in turn. The comparison of item bitmaps is performed in volatile memory. A total frequent itemset count that describes the frequency of candidate combinations in items groups across all subsets is obtained. Thereafter, the total frequent itemset count for candidate combinations having a larger number of data items may be determined.
Bibliography:Application Number: US20040927893