Crowdsourcing for Hispanic Linguistics: Amazon’s Mechanical Turk as a source of Spanish data

Within the field of Linguistics, Amazon’s Mechanical Turk, a crowdsourcing marketplace specializes in computer-based Human Intelligence Tasks, has been praised as a cost efficient source of data for English and other major languages. Spanish is a good candidate due to its presence within the US and...

Full description

Saved in:
Bibliographic Details
Published inBorealis (Tromsø) Vol. 8; no. 1; pp. 187 - 215
Main Author Ortega-Santos, Iván
Format Journal Article
LanguageEnglish
Published Tromsø Septentrio Academic Publishing 01.06.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Within the field of Linguistics, Amazon’s Mechanical Turk, a crowdsourcing marketplace specializes in computer-based Human Intelligence Tasks, has been praised as a cost efficient source of data for English and other major languages. Spanish is a good candidate due to its presence within the US and beyond. Still, detailed information concerning the linguistic and demographic profile of Spanish-speaking ‘Turkers’ is missing, thus making it difficult for researchers to evaluate whether the Mechanical Turk provides the right environment for their tasks. This paper addresses this gap in our knowledge by developing the first detailed study of the presence of Spanish-speaking workers, focusing on factors relevant for research planning, namely, (socio)linguistically relevant variables and information concerning work habits. The results show that this platform provides access to a fairly active participant pool of both L1 and L2Spanish speakers as well as bilinguals. A brief introduction to how Amazon’s Mechanical Turk works and overview of Hispanic Linguistics projects that have so far used the Mechanical Turk successfully is included.
ISSN:1893-3211
1893-3211
DOI:10.7557/1.8.1.4670