Database aggregation of imprecise and uncertain evidence

Information from which knowledge can be discovered is frequently distributed due to having been recorded at different times or to having arisen from different sources. Such information is often subject to both imprecision and uncertainty. The Dempster–Shafer representation of evidence offers a way o...

Full description

Saved in:
Bibliographic Details
Published inInformation sciences Vol. 155; no. 3; pp. 245 - 263
Main Authors Scotney, Bryan, McClean, Sally
Format Journal Article
LanguageEnglish
Published Elsevier Inc 15.10.2003
Subjects
Online AccessGet full text
ISSN0020-0255
1872-6291
DOI10.1016/S0020-0255(03)00172-5

Cover

More Information
Summary:Information from which knowledge can be discovered is frequently distributed due to having been recorded at different times or to having arisen from different sources. Such information is often subject to both imprecision and uncertainty. The Dempster–Shafer representation of evidence offers a way of representing uncertainty in the presence of imprecision, and may therefore be used to provide a mechanism for storing imprecise and uncertain information in databases. We consider an extended relational data model that allows the imprecision and uncertainty associated with attribute values to be quantified using a mass function distribution. When a query is executed, it may be necessary to combine imprecise and uncertain data from distributed sources in order to answer that query. A mechanism is therefore required both for combining the data and for generating measures of uncertainty to be attached to the (imprecise) combined data. In this paper we provide such a mechanism based on aggregation of evidence. We show first how this mechanism can be used to resolve inconsistencies and hence provide an essential database capability to perform the operations necessary to respond to queries on imprecise and uncertain data. We go on to exploit the aggregation operator in an attribute-driven approach to provide information on properties of and patterns in the data. This is fundamental to rule discovery, and hence such an aggregation operator provides a facility that is a central requirement in providing a distributed information system with the capability to perform the operations necessary for Knowledge Discovery.
ISSN:0020-0255
1872-6291
DOI:10.1016/S0020-0255(03)00172-5