Exploring British accents: Modelling the trap–bath split with functional data analysis

The sound of our speech is influenced by the places we come from. Great Britain contains a wide variety of distinctive accents which are of interest to linguistics. In particular, the ‘a’ vowel in words like ‘class’ is pronounced differently in the North and the South. Speech recordings of this vowe...

Full description

Saved in:

Bibliographic Details
Published in	Journal of the Royal Statistical Society Series C: Applied Statistics Vol. 71; no. 4; pp. 773 - 805
Main Authors	Koshy, Aranya, Tavakoli, Shahin
Format	Journal Article
Language	English
Published	Oxford Oxford University Press 01.08.2022
Subjects	Accentuation British English Classifiers Data analysis Datasets formants functional principal component analysis Linguistics logistic regression MFCC Modelling North and South Phonetics Soap films Speech Vowels United Kingdom > UK
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The sound of our speech is influenced by the places we come from. Great Britain contains a wide variety of distinctive accents which are of interest to linguistics. In particular, the ‘a’ vowel in words like ‘class’ is pronounced differently in the North and the South. Speech recordings of this vowel can be represented as formant curves or as mel‐frequency cepstral coefficient curves. Functional data analysis and generalised additive models offer techniques to model the variation in these curves. Our first aim was to model the difference between typical Northern and Southern vowels /æ/ and /ɑ/, by training two classifiers on the North‐South Class Vowels dataset collected for this paper. Our second aim is to visualise geographical variation of accents in Great Britain. For this we use speech recordings from a second dataset, the British National Corpus (BNC) audio edition. The trained models are used to predict the accent of speakers in the BNC, and then we model the geographical patterns in these predictions using a soap film smoother. This work demonstrates a flexible and interpretable approach to modelling phonetic accent variation in speech recordings.
ISSN:	0035-9254 1467-9876
DOI:	10.1111/rssc.12555