The following research projects have received external funding of more than DKK 500,000.
- 2009-2011: Reconstruction of Speech Events in Real Time (DKK 4.7 million).
The project aims to rethink the way information is encoded in speech and decoded in hearing. Our formal models are optimized for compatibility with hearing aids, speech technology, and mobile telephony. We use methods from machine learning, numerical analysis, deductive systems, acoustic analysis, and natural language parsing, based on data from real-world and laboratory tasks. The project is funded by the Danish Research Council for the Humanities. Members: Peter Juel Henrichsen (principal investigator), Maya Borges, Ruben Schactenhaufen, Thomas Ulrich Christiansen (Technical University of Denmark), Andrea Corradini (University of Southern Denmark).
- 2008-2011: The Copenhagen Dependency Treebank (DKK 3.0 million). This project seeks to build a set of parallel treebanks for Danish, English, German, Italian, and Spanish with 40,000 words in each language, based on a unified annotation scheme for syntax, morphology and discourse. The treebanks will then be used to explore a wide range of research problems in theoretical and computational linguistics. The project is funded by the Danish Research Council for the Humanities. Members: Matthias Buch-Kromann (principal investigator), Iørn Korzen, Henrik Høeg Müller, Martin Haulrich.
- 2007-2010: The Discontinuous Grammar System. This project seeks to develop dependency-based models of human parsing and translation, exploring a new serial repair-based processing architecture based on local search. The project is funded by the Danish Research Council for the Humanities. Members: Matthias Buch-Kromann.
- 2007-2009 Comprehension and Text Production in Translation and Interpreting Hybrids (FKK Project): Using eyetracking, keylogging, voice recognition, and audio recording, this project aims at exploring the interaction of comprehension and production processes in hybrid forms such as sight translation, which is hybrid in the sense that it involves production of oral target text from a written source and therefore demonstrates a cross of features from traditional (written) translation and (oral) interpreting. The theoretical aim is to explore the different forms of interaction in closely related translation genres, and the practical aim is to demonstrate the positive and/or negative effects of convergence of the written and spoken modes.Members: Arnt Lykke Jakobsen (principal investigator), Barbara Dragsted, Inge Gorm Hansen.
- 2006-2009 Eye-to-IT. Development of Human-Computer Monitoring and Feedback Tools and Technologies for the Purposes of Studying Cognition and Translation (EU-FP6 project): This project seeks to build an application that can be used at once for studying human translational behaviour and to support practical translation. The application interprets information about a translator's eye and finger movements, provides insight into comprehension processes and text production processes, and also finds out when a translator needs help and provides that help instantly. Partners: University of Tampere, New Bulgarian University, Tobii Technology AB, Technical University Graz, University of Oslo, Copenhagen Business School. Members from CBS: Arnt Lykke Jakobsen (principal investigator), Barbara Dragsted, Michael Carl, Christian Jensen, Kristian Tangsgaard Hvelplund Jensen, Annette Camilla Sjørup.
Other collaborative projects within the environment:
- Translog (development variously funded): The first version of Translog was created in 1995 by Arnt Lykke Jakobsen. It was programmed by Lasse Schou in TurboPascal 5.5 under the DOS operating system. In 1999, a version for Windows 95 programmed in Delphi 6.0 was published under the label Translog2000, also by Lasse Schou. In 2006, a .NET version, introducing Unicode 8, XML and many aditional features was programmed in C#. This version was also programmed by Lasse Schou. In 2009, following attempts in the Eye-to-IT project to integrate keylogging with eyetracking, computer scientist Morten Lemvigh restructured the program. Programming changes were also introduced by Michael Carl and implemented by Jakob Elming. At the time of writing, this work is still in progress.
The following PhD projects are conducted within the environment:
- Martin Haulrich: Building parallel dependency treebanks
The main goal of the project is to create a statistical synchronous parser that can reduce the work involved in creating parallel treebanks.
If each language in a parallel corpus is parsed separately and afterwards aligned to create a parallel treebank the result is not always satisfactory because the two analysis differ to much to make a good alignment. In order to reduce the magnitude of this problem the projects aims at implementing a system that parses both languages at the same time, and lets the two parses interact in order to obtain parses that are more alike.
- Morten Gylling-Jørgensen: The Structure of Discourse: A Corpus-based Cross-language Study. The project will determine and describe the upper level for language, i.e. discourse, which involves text relations beyond the syntactic structure. The traditional linguistic research has mainly been concentrated inside syntax, but new development in computational linguistics, e.g. linguistic annotation of corpora, has facilitated studies in larger text units, such as discourse units. A better understanding of discourse does not only lead to a better ability to communicate and to strengthen the cohesion in a text, but it also contributes to a more precise understanding of the structure of texts in various languages. The primary aim of the project is to prove and explain language typological differences in the discourse structure between the two well-known language families Germanic and Romance. This will help improving translation processes and foreign language pedagogy.
- Kristian Tangsgaard Hvelplund Jensen: Eye movements, cognition and translation
Project description: Using a combination of eye tracking and keylogging technologies, this project aims at exploring the distribution of attention during translation from English to Danish. The experimental texts used in the study differ with respect to their levels of complexity, and a time constraint was introduced in a number of tasks. More specifically, it aims at investigating (1) the frequency with which attention shifts between source text and target text during translation, and (2) the duration of the attentional segments based on segment type.
- Annette Camilla Sjørup: The process of translating metaphors: an analysis of the processes and behaviours of the translator when translating metaphors.
Project description: The project uses eye-tracking and keylogging technologies to seek a pattern in gaze and typing activity for the translation of metaphorical expressions. The project hypothesises that gaze times will be longer for metaphorical expressions than for non-metaphorical expressions. The baseline measure used is total gaze time/character to enable comparisons between segments of varying length.
Sidst opdateret af Lotte Jelsbech Knudsen 14.09.2010