[Home]
[Topics]
[Speakers]
[Workshops]
[Comittees]
[Call for Papers]
[Photos]
[Contact]
[Impressum]
Multilingual Individuals and Multilingual Societies
Workshop 2: Learner Corpora and Corpora for Learners
Outline
The presentations in this workshop will deal with different aspects of the construction and use of learner
corpora in linguistic analysis and language teaching or training. By “learner corpora” we understand
systematic collections of spoken or written language data produced by (adult) speakers in a language other
than their mother tongue. The design of such corpora is determined, among other things, by the following
parameters:
- Native language(s) of the speakers involved
- Learner language(s)
- Levels of linguistic analysis (e.g. phonology, morphology, syntax) reflected in annotation and
transcription
- Mode (written vs. spoken) and text/interaction type (e.g. exam texts, task-oriented communication,
ad-hoc interpreting) of the data
The constellation of workshop participants aims at forming a representative cross-section with respect to
these parameters.
Questions
The workshop will focus on methodological and technological aspects in the construction and use of learner
corpora. Thus, each contribution will address one or several of the following questions:
- Corpus design: What motivates the choice of speakers and what parameters are used to describe them? What
methods of data elicitation are used? What categories are used for annotation and transcription, how do
they interrelate?
- Corpus construction: What kind of corpus technology (tools, formats and data models) is used? What are
the procedures for annotation and transcription? What instruments are used for corpus management and
quality control?
- Corpus analysis and use: Which categories are used for error analysis? Which tools and methods are
employed for corpus query and statistical analysis? What are methods and concepts for the application of
learner corpora in teaching and training?
- Corpus reuse: What policy is used to ensure the interoperability and sustainability of corpora? What are
best practices for the construction and dissemination of learner corpora?
Participants and Contributions
- Anke Lüdeling (Humboldt-Universität Berlin)
Morphological Productivity in Learner German:
Complex Verbs
[Abstract] [Presentation]
- Susanne Kämmerer and Sandra Götz (Justus Liebig University Giessen)
Error-tagging the German
component of LINDSEI: Methodology and application
[Abstract] [Presentation]
- Hanna Hedeland, Timm Lehmberg, Thomas Schmidt, Kai Wörner (SFB 538 Mehrsprachigkeit)
Towards a Best Practice for sharing and reusing learner corpora:
the Hamburg Map Task Corpus
[Abstract] [Presentation]
- Detmar Meurers, Niels Ott, Ramon Ziai (Universität Tübingen)
On the Creation and Analysis of a Reading Comprehension Exercise Corpus:
Evaluating Meaning in Context
[Abstract] [Presentation]
- Bernd Meyer (Universität Hamburg)
A corpus of interpreter-mediated interactions as a tool for interpreter training
[Abstract] [Presentation]
- Christiane Andersen (Universität Göteborg)
Spoken language corpora in German and Swedish (L1) and acquisition of foreign language
grammar
[Abstract] [Presentation]