STREAMING JOINS IN CONSTRAINED MEMORY ENVIRONMENTS

Large amounts of memory can be consumed in streaming joins because events from one stream are held in memory while waiting for matching events from a second stream. Memory needs can be reduced by analyzing the join condition to determine the bounds on the time discrepancy between events in the two s...

Full description

Saved in:
Bibliographic Details
Main Authors Shulman Boris, Chen Zhong, Novik Lev, Szyperski Clemens A
Format Patent
LanguageEnglish
Published 08.12.2016
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Large amounts of memory can be consumed in streaming joins because events from one stream are held in memory while waiting for matching events from a second stream. Memory needs can be reduced by analyzing the join condition to determine the bounds on the time discrepancy between events in the two streams. When it is determined that an event from one stream must occur prior to the matching event from the other stream, the later-arriving stream data can be ingested with an intentional delay. When it is determined that regardless of input received from a first stream, no output will be produced when there is no input from the second stream, pulling data from the first stream can cease. A multi-stage join plan can be employed so that a less busy stream can be scanned with increasing amounts of intentional delay. Only unmatched data is stored.
Bibliography:Application Number: US201514732374