This page needs JavaScript! Please enable it to continue.

This website uses JavaScripts. If you use an adblocker, content may not be displayed or may not be displayed correctly.

The plumbing of corpus linguistics: A guided tour of the corpus-processing pipeline (Methodenworkshop)

Date	Wednesday, 23rd January 2019
Location

veranstalter: David Lukes
ansprechpartner: Christina Meuser, Dennis Dressel
email: contact@hpsl.uni-freiburg.de
web:
institution: HPSL
language: Englisch
location institution: Freiburg
date_raw: 23.-25. Januar 2019
date_sort: 23.01.2019, 00:00:00

While it’s not necessary to know how corpus software works in order to use it, having a high-level idea of the entire process, from raw data to what happens when you type a query into a search interface, can help you become a power user. Providing you with such a general idea is the goal of this workshop. We’ll cover the following topics:

technical background: how text is represented inside a computer (file formats, plain text, character sets and encodings)
adding annotation: metadata (author, year of publication…), morphological tagging
corpus query systems: what’s their purpose (why not directly search the plain text files?), how they work behind the scenes, standard formats

The concepts will be illustrated with practical examples using the corpus query systems Corpus Workbench, (No)SketchEngine and ANNIS, and other related tools. By the end of the workshop, you should have a better intuition for what can and cannot be achieved using corpora, and you should also be better equipped to deal with the technical pitfalls of conducting corpus research.

HPSL PhD Student Kristina Ehrsam successfully defends her doctoral thesis on 26 March 2025: HPSL doctoral student Kristina Ehrsam successfully defended her doctoral thesis entitled ‘English...
HPSL PhD Student Ye Ji Rieser successfully defends her doctoral thesis: HPSL doctoral student Ye Ji Rieser successfully defended her doctoral thesis entitled ‘The series...
SNSF Advanced Grant für Prof. Dr. Lorenza Mondadas Projekt «Körper und Zeit in sozialen Interaktionen»: Prof. Dr. Lorenza Mondada erhält einen der hoch kompetitiven Förderpreise des Schweizerischen...
Hermann Paul Scholarship in Linguistics 2025: The Hermann Paul School of Linguistics is offering one scholarship for a PhD candidate in Basel,...

Der Hermann-Paul Preis 2022 ging an Aline Bieri und Florian Dreyer. Herzlichen Glückwunsch! /// The Hermann Paul Award 2022 went to Aline Bieri and Florian Dreyer. Congratulations!

Der Hermann-Paul Preis 2019 ging an Emiel van den Hoven. Herzlichen Glückwunsch! /// The Hermann Paul Award 2019 went to Emiel van den Hoven. Congratulations!

Der Hermann-Paul Preis 2018 ging an Verena Schröter und Hanna Svensson. Herzlichen Glückwunsch! /// The Hermann Paul Award 2018 went to Verena Schröter and Hanna Svensson. Congratulations!

This page needs JavaScript! Please enable it to continue.

Hermann Paul School of Linguistics
Basel - Freiburg (i.Br.)

The plumbing of corpus linguistics: A guided tour of the corpus-processing pipeline (Methodenworkshop)

Information for applicants

News

Upcoming Events

Search member

PhD Scholarships

Hermann-Paul-Preis für herausragende Dissertationen

Newsletter

Current Research

This page needs JavaScript! Please enable it to continue.

Hermann Paul School of Linguistics Basel - Freiburg (i.Br.)

The plumbing of corpus linguistics: A guided tour of the corpus-processing pipeline (Methodenworkshop)

Information for applicants

News

Upcoming Events

Search member

PhD Scholarships

Hermann-Paul-Preis für herausragende Dissertationen

Newsletter

Current Research

Hermann Paul School of Linguistics
Basel - Freiburg (i.Br.)