Data Collection in a Flat World: The Strengths and Weaknesses of Mechanical Turk Samples

ABSTRACT Mechanical Turk (MTurk), an online labor system run by Amazon.com, provides quick, easy, and inexpensive access to online research participants. As use of MTurk has grown, so have questions from behavioral researchers about its participants, reliability, and low compensation. In this articl...

Full description

Saved in:
Bibliographic Details
Published inJournal of behavioral decision making Vol. 26; no. 3; pp. 213 - 224
Main Authors Goodman, Joseph K., Cryder, Cynthia E., Cheema, Amar
Format Journal Article
LanguageEnglish
Published Chichester Blackwell Publishing Ltd 01.07.2013
Wiley Periodicals Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:ABSTRACT Mechanical Turk (MTurk), an online labor system run by Amazon.com, provides quick, easy, and inexpensive access to online research participants. As use of MTurk has grown, so have questions from behavioral researchers about its participants, reliability, and low compensation. In this article, we review recent research about MTurk and compare MTurk participants with community and student samples on a set of personality dimensions and classic decision‐making biases. Across two studies, we find many similarities between MTurk participants and traditional samples, but we also find important differences. For instance, MTurk participants are less likely to pay attention to experimental materials, reducing statistical power. They are more likely to use the Internet to find answers, even with no incentive for correct responses. MTurk participants have attitudes about money that are different from a community sample's attitudes but similar to students' attitudes. Finally, MTurk participants are less extraverted and have lower self‐esteem than other participants, presenting challenges for some research domains. Despite these differences, MTurk participants produce reliable results consistent with standard decision‐making biases: they are present biased, risk‐averse for gains, risk‐seeking for losses, show delay/expedite asymmetries, and show the certainty effect—with almost no significant differences in effect sizes from other samples. We conclude that MTurk offers a highly valuable opportunity for data collection and recommend that researchers using MTurk (1) include screening questions that gauge attention and language comprehension; (2) avoid questions with factual answers; and (3) consider how individual differences in financial and social domains may influence results. Copyright © 2012 John Wiley & Sons, Ltd.
Bibliography:ark:/67375/WNG-RR3VX4HR-C
istex:8B7C0461D03867E356B4D7A14E7FD3446245DACF
ArticleID:BDM1753
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-2
content type line 23
ISSN:0894-3257
1099-0771
DOI:10.1002/bdm.1753