Completed Research Projects
- Transforming European Learner Language into Learning Opportunities (TELL-OP) [2014-2017]
How can adult learners use their own output to further acquire language skills? How can adult learners use their critical thinking, analysis and awareness skills to improve their communicative competence across different CEFR levels?
TELL-OP is an ERASMUS+ strategic partnership that seeks to bring together the knowledge and expertise of European stakeholders from Belgium, Germany, Spain, Turkey and the UK in the fields of language education, corpus and applied linguistics, e-learning and knowledge engineering in order to promote the personalized e-learning of languages in the contexts of higher and adult education, in particular, through mobile devices.
Please visit the project website.
- The Old Bailey Corpus - Spoken English in the 18th and 19th centuries [2008-2012]
The proceedings of the Old Bailey, London's central criminal court, were published from 1674 to 1913 and constitute a large body of texts from the beginning of Late Modern English. The Proceedings contain over 200,000 trials, totalling ca. 134 million words and its verbatim passages are arguably as near as we can get to the spoken word of the period. The material thus offers the rare opportunity of analyzing everyday language in a period that has been neglected both with regard to the compilation of primary linguistic data and the description of the structure, variability, and change of English. The Old Bailey Corpus (OBC) is based on the Proceedings and documents spoken English from the 1720s onward.
Please visit theor the .
- Integrating the Old Bailey Corpus into the CLARIN-D infrastructure [2015-2016]
The is a corpus of 18th and 19th century spoken English and consists of selected Proceedings of the Old Bailey, London's central criminal court. The OBC currently has c. 750,000 words of direct speech per decade between the years 1720-1913, amounting to about 13.9 million words of spoken English. Every speaker turn is annotated for sociobiographical (gender, social class, age), pragmatic (role in the court proceeding) and textual variables (the shorthand scribe, printer and publisher of individual Proceedings). The aim of this project is to integrate the OBC into the German section of the Common Language Resources and Technology Infrastructure (CLARIN-D) to achieve sustainability of this resource (persistent storage and access). The project is funded by the Federal Ministry of Education and Research. The OBC will be hosted at the CLARIN-D Service Centre of Saarland University.
- Verb Complementation in South Asian Englishes [2008-2011]
The project aims to take into account all complementation patterns that ditransitive verbs such as to give can take in order to evaluate whether, and to which extent, differences in verb complementation between South Asian Englishes and British English are dependent on various lexicogrammatical factors (e.g. variation in collocational profiles; pronominality, animacy, or syntactic complexity of the arguments; influences from indigenous languages). More Information can be found on thein the DFG's research database.
TransComp is a process-oriented longitudinal study which explores the development of translation competence in 12 students of translation over a period of 3 years and compares it to that of 10 professional translators. It aims to make an important contribution to the development of the methodology and model building in process-oriented translation studies by overcoming a number of shortcomings of previous studies. The insight into the components which make up translation competence and into its development gained in the project will be utilized for translation pedagogy and the improvement of curricula for translator training.
Please visit the .
- The Automated Similarity Judgment Program
The ASJP project aims at achieving a computerized lexicostatistical analysis of ideally all the world’s languages. The two main purposes are to provide a classification of all languages by a single, consistent and objective (if perhaps not ideal) method and to perform various statistical analyses regarding the historical and areal behavior of lexical items.
It is a computerized lexicostatistical analysis in that it uses an algorithm taken from biogenetics in order to arrive at an unbiased classification of the languages of the world. The data input is computer readable transcriptions of the 40 most stable items of the Swadesh word list. At present the database consists of more than 5300 languages.
You can still contribute to the ASJP project: wordlists for ca. 3000 languages are still missing.