Text-mining international politics

Methods
MA
WS2018
One-week block seminar, MA-level, Charles University Prague, Spring 2019
Published

May 7, 2019

Outline and learning goals

Politics takes place in and through texts. Speeches, debates, position papers, press releases, traditional and social media coverage, as well as the resulting laws, agreements, or resolutions can tell us much on the priorities, preferences, and power of political actors. This holds especially for politics beyond the nation state. For studying international politics across long time periods or broad actor sets, political text is often the only consistently available information source that we have. The challenge, however, lies in extracting systematic information from largely unstructured texts in a reliable and systematic fashion. This is a promise of automated content analyses: various algorithms offer means to reveal relevant patterns in the vast amount of political language that we can nowadays access in digital formats.

The block seminar thus introduces the strengths but also the limitations of various approaches to treat text as data. Based on my own work with and on these tools, students will learn about the basic intuitions behind the most prominent text analysis methods in recent political science research. We will work along concrete examples by discussing extant and possible applications of these methods to EU and international politics. These examples are also useful to highlight the pragmatic issues involved in collecting, analysing, and visualizing large-scale digital text corpora.

The course targets advanced BA as well as MA students in the political sciences or related disciplines who wish to broaden their empirical toolkit. Prior knowledge in content analysis or quantitative methods is not required but may be an asset. The seminar pursues three related teaching goals:

  • Enable students to read and to assess studies using automated text analyses
  • Allow informed methodological choices and provide pragmatic tips/resources for conducting own text analyses
  • Create awareness for the more general promises and pitfalls of analysing human language with automated algorithms

Successful participants will be awarded with 4 ECTS. The evaluation of student performance will be based on three criteria:

  • Thorough reading of the obligatory literature marked with (O) in the syllabus below
  • Regular and active participation in the individual six sessions of the seminar
  • A short research-design paper (4,000-5,000 words) that discusses whether and which automated text analyses might be suited to address a freely chosen question on the EU, international politics, or related fields (more details during the seminar)

Ideally, this course convinces you that conducting text analyses can be fun and insightful at the same time. In any case, I am very much looking forward to work with you!

Note

All details on course organization, assignments, literature and individual sessions are provided in the syllabus below.



Materials

Syllabus (PDF)     Slides Session 1 (PDF)     Slides Session 2 (PDF)     Slides Session 3 (PDF)     Slides Session 4 (PDF)     Slides Session 5 (PDF)     Slides Session 6 (PDF)    



R code tutorials

R code to implement some of the discussed methods in climate-change related talk in the United Nations General Debate corpus.