Testing the Validity of Automatic Speech Recognition for Political Text Analysis


Journal article


Sven-Oliver Proksch, Christopher Wratil, Jens Wäckerle
Political Analysis, vol. 27(3), 2019, pp. 339-359

Link
Cite

Cite

APA   Click to copy
Proksch, S.-O., Wratil, C., & Wäckerle, J. (2019). Testing the Validity of Automatic Speech Recognition for Political Text Analysis. Political Analysis, 27(3), 339–359.


Chicago/Turabian   Click to copy
Proksch, Sven-Oliver, Christopher Wratil, and Jens Wäckerle. “Testing the Validity of Automatic Speech Recognition for Political Text Analysis.” Political Analysis 27, no. 3 (2019): 339–359.


MLA   Click to copy
Proksch, Sven-Oliver, et al. “Testing the Validity of Automatic Speech Recognition for Political Text Analysis.” Political Analysis, vol. 27, no. 3, 2019, pp. 339–59.


BibTeX   Click to copy

@article{sven-oliver2019a,
  title = {Testing the Validity of Automatic Speech Recognition for Political Text Analysis},
  year = {2019},
  issue = {3},
  journal = {Political Analysis},
  pages = {339-359},
  volume = {27},
  author = {Proksch, Sven-Oliver and Wratil, Christopher and Wäckerle, Jens}
}

Abstract

The analysis of political texts from parliamentary speeches, party manifestos, social media, or press releases forms the basis of major and growing fields in political science, not least since advances in “text-as-data” methods have rendered the analysis of large text corpora straightforward. However, a lot of sources of political speech are not regularly transcribed, and their on-demand transcription by humans is prohibitively expensive for research purposes. This class includes political speech in certain legislatures, during political party conferences as well as television interviews and talk shows. We showcase how scholars can use automatic speech recognition systems to analyze such speech with quantitative text analysis models of the “bag-of-words” variety. To probe results for robustness to transcription error, we present an original “word error rate simulation” (WERSIM) procedure implemented in R . We demonstrate the potential of automatic speech recognition to address open questions in political science with two substantive applications and discuss its limitations and practical challenges.


Share



Follow this website


You need to create an Owlstown account to follow this website.


Sign up

Already an Owlstown member?

Log in