Testing the Validity of Automatic Speech Recognition for Political Text Analysis

Journal article

Sven-Oliver Proksch, Christopher Wratil, Jens Wäckerle
Political Analysis, vol. 27(3), 2019, pp. 339-359

Link

Cite

APA Click to copy
Proksch, S.-O., Wratil, C., & Wäckerle, J. (2019). Testing the Validity of Automatic Speech Recognition for Political Text Analysis. Political Analysis, 27(3), 339–359.

Chicago/Turabian Click to copy
Proksch, Sven-Oliver, Christopher Wratil, and Jens Wäckerle. “Testing the Validity of Automatic Speech Recognition for Political Text Analysis.” Political Analysis 27, no. 3 (2019): 339–359.

MLA Click to copy
Proksch, Sven-Oliver, et al. “Testing the Validity of Automatic Speech Recognition for Political Text Analysis.” Political Analysis, vol. 27, no. 3, 2019, pp. 339–59.

BibTeX Click to copy

@article{sven-oliver2019a,
  title = {Testing the Validity of Automatic Speech Recognition for Political Text Analysis},
  year = {2019},
  issue = {3},
  journal = {Political Analysis},
  pages = {339-359},
  volume = {27},
  author = {Proksch, Sven-Oliver and Wratil, Christopher and Wäckerle, Jens}
}

Abstract

The analysis of political texts from parliamentary speeches, party manifestos, social media, or press releases forms the basis of major and growing fields in political science, not least since advances in “text-as-data” methods have rendered the analysis of large text corpora straightforward. However, a lot of sources of political speech are not regularly transcribed, and their on-demand transcription by humans is prohibitively expensive for research purposes. This class includes political speech in certain legislatures, during political party conferences as well as television interviews and talk shows. We showcase how scholars can use automatic speech recognition systems to analyze such speech with quantitative text analysis models of the “bag-of-words” variety. To probe results for robustness to transcription error, we present an original “word error rate simulation” (WERSIM) procedure implemented in R . We demonstrate the potential of automatic speech recognition to address open questions in political science with two substantive applications and discuss its limitations and practical challenges.