Corpus Linguistics

Teachers: Victoria Kamasa & Katarzyna Klessa (vkamasa | klessa

Students’ page (more materials)

  • Class 01: (2017-10-04) Introduction: basic concepts: corpus, corpus linguistics, main types of corpora. View Slides.
  • Class 02: (2017-10-11) Ctd: the types of corpora, selected concepts used in everyday practice of corpus linguistics. View Slides.
  • Class 03Referential corpora: compare and contrast.  Please sign in to the Corpus linguistics course in Moodle. Use your USOS credentials to log in. More instructions about Moodle can be found in the Slides from Class 2. Your tasks for this part are in the section for week: 18 October – 24 October.
  • Class 04Specialized corpora. Please find the class materials in the Corpus linguistics course in Moodle (week: 25 October – 31 October).
  • Class 05: (2017-11-08) Corpus annotation. View Slides.
  • Class 06: Speech corpora in phonetic research. Annotation of prosodic features in the PoInt Corpus. Please find the class materials in the Corpus linguistics course in Moodle (week: 15 November – 21 November).
  • Class 07-08: Annotation practice: intonation labelling (tasks defined as workshop for week: 29 November – 5 December Moodle, deadlines are longer than 1 week).
  • Class 09: (soon)


  • Anthony, L. (2005). AntConc: A Learner and Classroom Friendly, Multi-Platform Corpus Analysis Toolkit. Proceedings of IWLeL 2004: An Interactive Workshop on Language e-Learning, pp. 7-13, see also:
  • Baker, P. (Ed.). (2009). Contemporary corpus linguistics. London, New York: Continuum.
  • Baker, P., Hardie, A., & McEnery, T. (2006). A glossary of corpus linguistics. Edinburgh: Edinburgh University Press.
  • Bird, S., E. Klein, and E. Loper, Natural Language Processing with Python. Analyzing Text with the Natural Language Toolkit, see: , and (for beginners) Python course at:
  • Boersma, P. & Weenink, D. (2013). Praat: doing phonetics by computer [Computer program]. Ver. 5.3.51, retrieved 2.06.2013 from
  • Dimitriadis, A., & Musgrave, S. (2009). Designing linguistic databases: A primer for linguists (p. 13). Berlin: Walter de Gruyter.
  • Klessa, K. (2015). Annotation Pro [Software tool]. Ver. Retrieved from:   on 2015-05-19.
  • McEnery, T., & Hardie, A. (2012). Corpus linguistics: Method, theory and practice. Cambridge textbooks in linguistics. Cambridge, New York: Cambridge University Press.
  • McEnery, T., & Wilson, A. (2001). Corpus linguistics: An introduction (2nd ed.). Edinburgh textbooks in empirical linguistics. Edinburgh: Edinburgh University Press.
  • Sloetjes, H., & Wittenburg, P. (2008). Annotation by category – ELAN and ISO DCR. In: Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC 2008), on-line:
  • Viana, V., Zyngier, S., & Barnbrook, G. (Eds.). (2011). Studies in corpus linguistics: v. 48. Perspectives on corpus linguistics. Amsterdam, Philadelphia: J. Benjamins Pub.
  • Warren Tang. (2011). A Simple Guide to Using Antconc. Retrieved from