Capturing Delays and Valid Times in Data Warehouses--Towards Timely Consistent Analyses

Real-world changes are generally discovered delayed by computer systems. The typical update patterns for traditional data warehouses on an overnight or even weekly basis increase this propagation delay until the information is available to knowledge workers. Typically, traditional data warehouses fo...

Full description

Saved in:
Bibliographic Details
Published inJournal of intelligent information systems Vol. 19; no. 2; p. 169
Main Authors Bruckner, Robert M, Tjoa, A Min
Format Journal Article
LanguageEnglish
Published New York Springer Nature B.V 01.09.2002
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Real-world changes are generally discovered delayed by computer systems. The typical update patterns for traditional data warehouses on an overnight or even weekly basis increase this propagation delay until the information is available to knowledge workers. Typically, traditional data warehouses focus on summarized data (at some level) rather than detailed data. For active data warehouse environments, detailed data about entities is required for checking the data conditions and triggering actions to automize routine decision tasks. Hence, keeping data current (by minimizing the latency from when data is captured until it is available to knowledge workers) and consistent in that context is a difficult task. In this paper we present an approach for modeling conceptual time consistency problems and introduce a data model that deals with timely delays. It supports knowledge workers in finding out, why (or why not) an active system responded to a certain state of the data. Therefore, the model enables analytical processing of detailed data (enhanced by valid time) based on a knowledge state at a specific time. All states that were not yet known by the system at that point in time are consistently ignored. This enables timely consistent analyses by considering that the validity of detailed data and aggregates can be restricted to time intervals only, due to frequent updates and late-arriving information. [PUBLICATION ABSTRACT]
ISSN:0925-9902
1573-7675
DOI:10.1023/A:1016555410197