Where did we Go Wrong? A Retrospective Look at the British National Corpus

in Teaching and Learning by Doing Corpus Analysis
Restricted Access
Get Access to Full Text

Subject Highlights


The British National Corpus (BNC) has been a major influence on the construction of language corpora during the last decade, if only as a significant reference point. This corpus may be seen as the culmination of a research tradition going back to the one-million word Brown corpus of 1964, but its constitution and its industrial-scale production techniques look forward to a new world in which language-focussed engineering and software development are at the heart of the information society instead of lurking on its academic fringes.

This paper attempts to review the design and management issues and decisions taken during the construction of the BNC and to suggest what lessons have been learned over the last five years about how such corpus building exercises can most usefully be extended into the new century.

I will also describe the new World Edition of the BNC and its associated SARA retrieval package, which has been enhanced in response to user feedback to facilitate creation of a searchable version of any large-scale XML-marked-up corpus.

Teaching and Learning by Doing Corpus Analysis

Proceedings of the Fourth International Conference on Teaching and Language Corpora, Graz 19-24 July, 2000


Table of Contents




All Time Past Year Past 30 Days
Abstract Views 20 20 4
Full Text Views 11 11 3
PDF Downloads 8 8 3
EPUB Downloads 0 0 0

Related Content