This volume is witness to a spirited and fruitful period in the evolution of corpus linguistics. In twenty-two articles written by established corpus linguists, members of the ICAME (International Computer Archive of Modern and Mediaeval English) association, this new volume brings the reader up to date with the cycle of activities which make up this field of study as it is today, dealing with corpus creation, language varieties, diachronic corpus study from the past to present, present-day synchronic corpus study, the web as corpus, and corpus linguistics and grammatical theory. It thus serves as a valuable guide to the state of the art for linguistic researchers, teachers and language learners of all persuasions.
After over twenty years of evolution, corpus linguistics has matured, incorporating nowadays not just small, medium and large primary corpus building but also specialised and multi-dimensional secondary corpus building; not just corpus analysis, but also corpus evaluation; not just an initial application of theory, but self-reflection and a new concern with theory in the light of experience.
The volume also highlights the growing emphasis on language as a changing phenomenon, both in terms of established historical study and the newer short-range diachronic study of 20th century and current English; and the growing area of overlap between these two.
Another section of the volume illustrates the recent changes in the definition of ‘corpus’ which have come about due to the emergence of new technologies and in particular of the availability of texts on the world wide web.
The volume culminates in the contributions by a group of corpus grammarians to a timely and novel discussion panel on the relationship between corpus linguistics and grammatical theory.
Analyses and Techniques in Describing English
Editor: John M. Kirk
Aspects of Theory and Description
Author: Joe Trotta
This study provides the first description-oriented, theoretically-unaligned account of wh-clauses in Modern English. The author employs a data-based approach to examine aspects of both generative and non-generative work as regards their relative strengths and weaknesses.
Wh-clauses in English: Aspects of Theory and Description is a unique combination of statistical findings and qualitative analysis. It is not only underpinned by a systematic investigation of the Brown University corpus but also includes attested material from other sources such as the British National Corpus, the CobuildDirect corpus as well as material gleaned from the internet.
The qualitative and quantitative analyses are combined to approach a wide range of theoretical and descriptive issues, such as wh-movement, landing-sites for moved wh-XPs, vacuous movement, island constraints, among others. Not insignificantly, many questions of indeterminacy are addressed, such as the interface of conjunctions and relative words, the problems of demarcation between interrogatives and free relatives as well as structural ambiguities between interrogatives and exclamatives.
The Goddess in Indo-Caribbean Ritual and Fiction
Translating Kali's Feast is an interdisciplinary study of the Goddess Kali bringing together ethnography and literature within the theoretical framework of translation studies. The idea for the book grew out of the experience and fieldwork of the authors, who lived with Indo-Caribbean devotees of the Hindu Goddess in Guyana. Using a variety of discursive forms including oral history and testimony, field notes, songs, stories, poems, literary essays, photographic illustrations, and personal and theoretical reflections, it explores the cultural, aesthetic and spiritual aspects of the Goddess in a diasporic and cross-cultural context. With reference to critical and cultural theorists including Walter Benjamin and Julia Kristeva, the possibilities offered by Kali (and other manifestations of the Goddess) as the site of translation are discussed in the works of such writers as Wilson Harris, V.S. Naipaul and R.K. Narayan. The book articulates perspectives on the experience of living through displacement and change while probing the processes of translation involved in literature and ethnography and postulating links between ‘rite' and ‘write,' Hindu ‘leela' and creole ‘play.'
Studies in Honour of Stig Johansson
In recent years there has been increasing interest in the development and use of bilingual and multilingual corpora. As Karin Aijmer writes in this book, 'The contrastive or comparative perspective ... makes it possible to dig deeper and to ask new questions about the relationship between languages with the aim of sharpening our conceptions of cross-linguistic correspondences and adding to our knowledge of the languages compared.'
The papers in this volume are a showcase of the great variety of purposes to which bilingual and multilingual corpora can be put. They do not only lend themselves to descriptive and applied approaches, but are also suitable for theory-oriented studies. The range of linguistic phenomena covered by the various approaches is very wide; the papers focus on fields of research like syntax, discourse, semantics, information structure, lexis, and translation studies. The range of languages studied comprises English, Norwegian, Swedish, German, Dutch, and Portuguese. In addition to purely linguistic papers, there are contributions on computer programs developed for the compilation and use of bilingual and multilingual corpora.
Explorations in Corpus Linguistics contains selected papers from the eighteenth International Conference on English Language Research on Computerised Corpora (ICAME 18). The papers give a broad overview of the latest activities in corpus linguistics. Issues associated with the creation of corpora are raised, topics ranging from corpus design, to problems of rare data acquisition and data protection, to the relative merits of corpora and free text collections. The main body of the volume is devoted to reports on the analysis of corpora. Several papers offer synchronic descriptions of aspects of modern language usage, in both spoken and written corpora. Some corpora are 'general' in content; those deriving from specialised textual domains include parallel corpora of international varieties of English and of learner language. The diachronic dimension of corpus-based study is also represented, in the examination of some modern-day grammatical features from a historical perspective, and by socio-pragmatic and sociolinguistic studies of diachronic corpus data. The principal aim of English corpus linguistics as reflected here is to describe language in use; there are also cases where such description forms a basis for the development of resources and tools, including specialised taggers, an Internet-based grammar, a glossary, and software to identify semantic relations and diachronic change within corpora.
Papers from the seventeenth International Conference on English Language Research on Computerized Corpora (ICAME 17)
Editor: Magnus Ljung
Corpus-based Studies in English contains selected papers from the seventeenth International Conference on English Language Research on Computerized Corpora (ICAME 17). The topics include parsing and annotation of corpora, discourse studies, lexicography, translation studies, parallel corpora, language variation and change, national varieties, methodology and English language teaching. The papers on parsing and annotation include discussions of the treatment of irregular forms, semantic/pragmatic labels in air traffic control, a comparison of tagging systems and a presentation of T-tag lexicon construction.
The papers on discourse and lexicography include a study of like as a discourse marker, thesaural relations and the lexicalisation of NPs. In translation studies one paper discusses explicitness as a universal feature of translation and the paper on parallel corpora contrasts English and Norwegian. Many papers deal with variation and change; here we find a discussions of dialogue vs. non-dialogue in modern English fiction and an account of verbal disputes in adolescent English; the historical studies deal with e.g. text type evolution, multi-verb words, normalization in Middle English prose and modalities in Early Modern English. The methodology papers discuss the use in corpus analysis of inferential statistics, probabilistic approaches to anaphora resolution and multi-method approaches to data. The ELT paper compares the use of the progressive in native and non-native compositions.
This book is about syntactic databases (a.k.a. treebanks), collections of text material in which the syntactic relations have been made visible. It starts off with a general intro-duction to the subject and then continues with three in-depth investigations of more specialized aspects.
In the introduction, syntactic databases are first placed in the larger context of linguistic databases, text collections with a broader linguistic annotation than just a syntactic one. Then some examples of syntactic databases are given, illustrating the range of annotation actually encountered. The introduction is completed with an investigation of database management systems for syntactic databases.
The first in-depth investigation concerns the treatment of ambiguous structures in syntactic analysis trees, focussing on a very efficient representation for such structures and the means to create this representation. Next, classroom use of syntactic databases is examined. A computer program for this purpose, CLUES, is discussed, along with a suggested series of syntax exercises. The final subject is the importance of including function and attribute information in the annotation of texts. The central line of investigation here is a probabilistic parsing experiment in which the use of function and attribute information is the main variable.
Studies in English Corpus Linguistics
The twenty papers of this volume - published to honour Gunnel Tottie - are of interest to everyone concerned with the study of the English language. The collection is a convincing argument for an approach to language studies based on the analysis of computerized corpora.
Though this is not an introduction to the field but a series of highly specialized studies, readers get a good overview of the work being done at present in English computer corpus studies. English corpus linguistics, though basically concerned with the study of varieties of English, goes far beyond the simple ordering and counting of large numbers of examples but is deeply concerned with linguistic theory - based on real language data.
The volume includes sections on corpora of written and spoken present-day English, historical corpora, contrastive corpora, and on the application of corpus studies to teaching purposes.