A guide to measuring expert performance in forensic pattern matching

Decisions in forensic science are often binary. A firearms expert must decide whether a bullet was fired from a particular gun or not. A face comparison expert must decide whether a photograph matches a suspect or not. A fingerprint examiner must decide whether a crime scene fingerprint belongs to a...

Full description

Saved in:

Bibliographic Details
Published in	Behavior research methods Vol. 56; no. 6; pp. 6223 - 6247
Main Authors	Robson, Samuel G., Searston, Rachel A., Thompson, Matthew B., Tangen, Jason M.
Format	Journal Article
Language	English
Published	New York Springer US 2024 Springer Nature B.V
Subjects	Behavioral Science and Psychology Cognitive Psychology Decision Making Expert Testimony Firearms Forensic science Forensic Sciences - methods Humans Original Manuscript Psychology Signal Detection, Psychological Decision-making Forensic science Fingerprints Proficiency tests Forensic pattern matching Expertise Signal detection
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Decisions in forensic science are often binary. A firearms expert must decide whether a bullet was fired from a particular gun or not. A face comparison expert must decide whether a photograph matches a suspect or not. A fingerprint examiner must decide whether a crime scene fingerprint belongs to a suspect or not. Researchers who study these decisions have therefore quantified expert performance using measurement models derived largely from signal detection theory. Here we demonstrate that the design and measurement choices researchers make can have a dramatic effect on the conclusions drawn about the performance of forensic examiners. We introduce several performance models – proportion correct, diagnosticity ratio, and parametric and non-parametric signal detection measures – and apply them to forensic decisions. We use data from expert and novice fingerprint comparison decisions along with a resampling method to demonstrate how experimental results can change as a function of the task, case materials, and measurement model chosen. We also graphically show how response bias, prevalence, inconclusive responses, floor and ceiling effects, case sampling, and number of trials might affect one’s interpretation of expert performance in forensics. Finally, we discuss several considerations for experimental and diagnostic accuracy studies: (1) include an equal number of same-source and different-source trials; (2) record inconclusive responses separately from forced choices; (3) include a control comparison group; (4) counterbalance or randomly sample trials for each participant; and (5) present as many trials to participants as is practical.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1554-3528 1554-351X 1554-3528
DOI:	10.3758/s13428-024-02354-y