Search Results

Abstract

In order to adjust observed frequencies of occurrence, previous studies have suggested a variety of measures of dispersion and adjusted frequencies. In a previous study, I reviewed many of these measures and suggested an alternative measure, DP (for ‘deviation of proportions’), which I argued to be conceptually simpler and more versatile than many competing measures. However, despite the relevance of dispersion for virtually all corpus-linguistic work, it is still a very much under-researched topic: to the best of my knowledge, there is not a single study investigating how different measures compare to each other when applied to large datasets, nor is there any work that attempts to determine how different measures match up with the kind of psycholinguistic data that dispersions and adjusted frequencies are supposed to represent. This article takes exploratory steps in both of these directions.

In: Corpus-linguistic applications

Abstract

In order to adjust observed frequencies of occurrence, previous studies have suggested a variety of measures of dispersion and adjusted frequencies. In a previous study, I reviewed many of these measures and suggested an alternative measure, DP (for ‘deviation of proportions’), which I argued to be conceptually simpler and more versatile than many competing measures. However, despite the relevance of dispersion for virtually all corpus-linguistic work, it is still a very much under-researched topic: to the best of my knowledge, there is not a single study investigating how different measures compare to each other when applied to large datasets, nor is there any work that attempts to determine how different measures match up with the kind of psycholinguistic data that dispersions and adjusted frequencies are supposed to represent. This article takes exploratory steps in both of these directions.

In: Corpus-linguistic applications
This series of lectures provides an overview of the author's work on quantitative applications in cognitive linguistics by discussing a wide range of studies involving corpus-linguistic as well as experimental work. After a discussion of how corpus linguistics, cognitive linguistics, and psycholinguistics relate to each other, the author discusses empirical and statistical studies of a wide variety of phenomena including morphophonology (morphological blends and alliteration effects), corpus-based cognitive semantics, frequency and association at the syntax-lexis interface. The book concludes with chapters exemplifying the role that bottom-up approaches can take, the role of statistical methods more generally, and the role of converging evidence from corpus and experimental data.The lectures for this book were given at The China International Forum on Cognitive Linguistics in May 2013.

In the e-book version all handouts have been made available at the back. All audio of the lectures as well as the handouts are available for free, in Open Access, here.

Applications for Usage-Based and Psycholinguistic Research
In this book, Stefan Th. Gries provides an overview on how quantitative corpus methods can provide insights to cognitive/usage-based linguistics and selected psycholinguistic questions. Topics include the corpus linguistics in general, its most important methodological tools, its statistical nature, and the relation of all these topics to past and current usage-based theorizing. Central notions discussed in detail include frequency, dispersion, context, and others in a variety of applications and case studies; four practice sessions offer short introductions of how to compute various corpus statistics with the open source programming language and environment R.
In: Ten Lectures on Corpus Linguistics with R
In: Ten Lectures on Corpus Linguistics with R
In: Ten Lectures on Corpus Linguistics with R
In: Ten Lectures on Corpus Linguistics with R
In: Ten Lectures on Corpus Linguistics with R
In: Ten Lectures on Corpus Linguistics with R