Statistical Methods in Linguistic Research (29 - 30 September and 27 - 28 October 2011)
Faculty
Laura Winther Balling, Assistant Professor in Experimental Psycholinguistics, Søren Feodor Nielsen and/or Peter Dalgaard (both professors in Statistics, CBS)
Course Coordinator
Assistant Professor Laura Winter Balling
Prerequisite/progression of the course
The course builds to some extent on concepts and methods introduced in “Statistical methods in linguistic research. Module 1: Elementary methods". Participants who have not attended that course should read the course book (S. Rasinger,
Quantitative Research in Linguistics. An Introduction, Continuum, 2008) for that module and do the exercises in the book. Further details about Module I may be found on
http://gradeast.dk/kurser/statistik2010/
. Some time will be devoted at the beginning of the course to brush up the main points of the first module.
The course does not assume any prior knowledge about mathematics or statistics. The course builds mainly on examples from linguistics, but a background in linguistics is not a prerequisite.
Aim of the course
The purpose of the course is to introduce the participants to statistical tools and methodologies that will enable them to conduct statistical analyses in their PhD dissertations and further linguistic research. The course focuses on the practical applications of statistics and includes a substantial amount of hands-on examples and exercises. The course is of interest to all students who are doing or plan to do quantitative research.
Course content, structure and teaching
The participants should bring a laptop with the R statistics package installed; the R package can be downloaded for free from www.r-project.org and runs under Windows, Mac and Linux. Participants who do not have access to a laptop should contact the organisers. For the first of the two modules (September 29-30), please read chapters 1 to 4 of Baayen’s book and try all the examples in the first two chapters in the R statistics package. The goal of the preparation is to make the participant familiar with R, and to allow the participant to formulate questions to the first two chapters. The remaining chapters will be discussed in the second of the two modules, more detail will be provided later.
Learning Objectives
The participants will learn how to conduct statistical analyses on real-world data sets from a range of linguistic domains, using the R statistics software package. In particular, they will learn how to utilize graphical data exploration, statistical hypothesis testing, and regression analysis with a focus on mixed-effect models that are well suited for addressing many questions in linguistics. Datasets from different subfields will be provided, but participants who have their own data will also get the chance to work on them.
Participants who want full credit for the course must write a final report in which they use the methods from the course to address a research problem within their own research area. The final report must describe the linguistic problem the participant wants to examine, the type of conclusions the participant would like to draw, and the data collection and statistical analysis that is needed in order to support the conclusions.
Teaching methods
The course will consist of a mixture of lectures, discussions and practical exercises. The course will be taught in English.
Course literature
R.H. Baayen, Analyzing Linguistic Data. A Practical Introduction to Statistics using R, Cambridge University Press, 2008.
Sidst opdateret af Katja Høeg Tingleff 05.09.2011