Big data, traditional data and the tradeoffs between prediction and causality in highway-safety analysis

•Prediction and causality tradeoffs are considered.•Four distinct modeling approaches to safety data evaluated.•Concerns with using data from observed accidents and real-time safety data are discussed.•The consequences of prediction/causality tradeoffs on safety policy are considered. The analysis o...

Full description

Saved in:
Bibliographic Details
Published inAnalytic methods in accident research Vol. 25; p. 100113
Main Authors Mannering, Fred, Bhat, Chandra R., Shankar, Venky, Abdel-Aty, Mohamed
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.03.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•Prediction and causality tradeoffs are considered.•Four distinct modeling approaches to safety data evaluated.•Concerns with using data from observed accidents and real-time safety data are discussed.•The consequences of prediction/causality tradeoffs on safety policy are considered. The analysis of highway accident data is largely dominated by traditional statistical methods (standard regression-based approaches), advanced statistical methods (such as models that account for unobserved heterogeneity), and data-driven methods (artificial intelligence, neural networks, machine learning, and so on). These methods have been applied mostly using data from observed crashes, but this can create a problem in uncovering causality since individuals that are inherently riskier than the population as a whole may be over-represented in the data. In addition, when and where individuals choose to drive could affect data analyses that use real-time data since the population of observed drivers could change over time. This issue, the nature of the data, and the implementation target of the analysis imply that analysts must often tradeoff the predictive capability of the resulting analysis and its ability to uncover the underlying causal nature of crash-contributing factors. The selection of the data-analysis method is often made without full consideration of this tradeoff, even though there are potentially important implications for the development of safety countermeasures and policies. This paper provides a discussion of the issues involved in this tradeoff with regard to specific methodological alternatives and presents researchers with a better understanding of the trade-offs often being inherently made in their analysis.
ISSN:2213-6657
2213-6657
DOI:10.1016/j.amar.2020.100113