The plumbing of corpus linguistics: A guided tour of the corpus-processing pipeline (Methodenworkshop)

Dozent(en)David Lukes
AnsprechpartnerChristina Meuser, Dennis Dressel
Anmeldung per Email ist erforderlich.
Termin23.-25. Januar 2019
OrtAlter Senatssaal, Wilhelmstraße 26

While it’s not necessary to know how corpus software works in order to use it, having a high-level idea of the entire process, from raw data to what happens when you type a query into a search interface, can help you become a power user. Providing you with such a general idea is the goal of this workshop. We’ll cover the following topics:

  • technical background: how text is represented inside a computer (file formats, plain text, character sets and encodings)
  • adding annotation: metadata (author, year of publication…), morphological tagging
  • corpus query systems: what’s their purpose (why not directly search the plain text files?), how they work behind the scenes, standard formats

The concepts will be illustrated with practical examples using the corpus query systems Corpus Workbench, (No)SketchEngine and ANNIS, and other related tools. By the end of the workshop, you should have a better intuition for what can and cannot be achieved using corpora, and you should also be better equipped to deal with the technical pitfalls of conducting corpus research.


Am 15.Oktober 2018 wurde das Corpus Salcedo von Pieter Muysken veröffentlicht. Es wurde in Freiburg und Basel in Zusammenarbeit mit einem internationalen Team editiert und kann nun über das in Freiburg entwickelte Korpusverwaltungstool moca3 (Daniel Alcón) genutzt werden.

Die HPSL bietet zum 01.Oktober 2018 1-2 Stipendien für Doktorand/innen in Freiburg. Weitere Informationen finden Sie hier.

Die neue Forschungsgruppe CLiMR (Cognition, language & interaction with machines Research Group) in Basel stellt sich vor. Interessiert mitzumachen? Get in touch here!

Hermann-Paul-Preis für herausragende Dissertationen

Wir freuen uns ab diesem Winter den Hermann-Paul-Preis für herausragende Dissertationen jährlich verleihen zu können. 
Weitere Informationen finden Sie hier.



PhD Scholarships Hermann Paul Scholarships in Linguistics 2018

The Hermann Paul Scholarship in Linguistics 2018 in Basel went to Joelle Loew. Congratulations!

PhD Scholarships Hermann Paul Scholarships in Linguistics 2017

The Hermann Paul Scholarships in Linguistics 2017 in Basel went to Robert Reinecke and Valentina Saccone. Congratulations!

PhD Scholarship Promotionskolleg Empirische Linguistik (PEL) 2014

The PEL scholarship 2014 went to Hanna Thiele. Congratulations!

Upcoming Events

25.-26. Oktober 2018
Relevance of Knowledge for Asking and Telling in Social Interaction (Blockseminar für Doktoranden)

25. Oktober 2018, 18:15 Uhr
La rivoluzione della punteggiatura nella Rete (Vortrag)