MIMS: Learner Corpora and Corpora for Learners

[Home] [Topics] [Speakers] [Workshops] [Comittees] [Call for Papers] [Photos] [Contact] [Impressum]

Multilingual Individuals and Multilingual Societies

Workshop 2: Learner Corpora and Corpora for Learners

Outline

The presentations in this workshop will deal with different aspects of the construction and use of learner corpora in linguistic analysis and language teaching or training. By “learner corpora” we understand systematic collections of spoken or written language data produced by (adult) speakers in a language other than their mother tongue. The design of such corpora is determined, among other things, by the following parameters:

Native language(s) of the speakers involved
Learner language(s)
Levels of linguistic analysis (e.g. phonology, morphology, syntax) reflected in annotation and transcription
Mode (written vs. spoken) and text/interaction type (e.g. exam texts, task-oriented communication, ad-hoc interpreting) of the data

The constellation of workshop participants aims at forming a representative cross-section with respect to these parameters.

Questions

The workshop will focus on methodological and technological aspects in the construction and use of learner corpora. Thus, each contribution will address one or several of the following questions:

Corpus design: What motivates the choice of speakers and what parameters are used to describe them? What methods of data elicitation are used? What categories are used for annotation and transcription, how do they interrelate?
Corpus construction: What kind of corpus technology (tools, formats and data models) is used? What are the procedures for annotation and transcription? What instruments are used for corpus management and quality control?
Corpus analysis and use: Which categories are used for error analysis? Which tools and methods are employed for corpus query and statistical analysis? What are methods and concepts for the application of learner corpora in teaching and training?
Corpus reuse: What policy is used to ensure the interoperability and sustainability of corpora? What are best practices for the construction and dissemination of learner corpora?

Participants and Contributions

Anke Lüdeling (Humboldt-Universität Berlin)
Morphological Productivity in Learner German: Complex Verbs
[Abstract] [Presentation]
Susanne Kämmerer and Sandra Götz (Justus Liebig University Giessen)
Error-tagging the German component of LINDSEI: Methodology and application
[Abstract] [Presentation]
Hanna Hedeland, Timm Lehmberg, Thomas Schmidt, Kai Wörner (SFB 538 Mehrsprachigkeit)
Towards a Best Practice for sharing and reusing learner corpora: the Hamburg Map Task Corpus
[Abstract] [Presentation]
Detmar Meurers, Niels Ott, Ramon Ziai (Universität Tübingen)
On the Creation and Analysis of a Reading Comprehension Exercise Corpus: Evaluating Meaning in Context
[Abstract] [Presentation]
Bernd Meyer (Universität Hamburg)
A corpus of interpreter-mediated interactions as a tool for interpreter training
[Abstract] [Presentation]
Christiane Andersen (Universität Göteborg)
Spoken language corpora in German and Swedish (L1) and acquisition of foreign language grammar
[Abstract] [Presentation]