Chapter 11 An Information-Theoretic Approach to Modeling Diachronic Change in Scientific English

In: From Data to Evidence in English Language Research
Authors:
Stefania Degaetano-Ortlieb
Search for other papers by Stefania Degaetano-Ortlieb in
Current site
Google Scholar
PubMed
Close
,
Hannah Kermes
Search for other papers by Hannah Kermes in
Current site
Google Scholar
PubMed
Close
,
Ashraf Khamis
Search for other papers by Ashraf Khamis in
Current site
Google Scholar
PubMed
Close
, and
Elke Teich
Search for other papers by Elke Teich in
Current site
Google Scholar
PubMed
Close

Purchase instant access (PDF download and unlimited online access):

$40.00

Abstract

We present an information-theoretic approach to investigate diachronic change in scientific English. Our main assumption is that over time scientific English has become increasingly dense, i.e. linguistic constructions allowing dense packing of information are progressively used.

So far, diachronic change in scientific writing has been investigated by means of frequency-based approaches (see e.g. Halliday (1988); Atkinson (1998); Biber (2006b, c); Biber and Gray (2016); Banks (2008); Taavitsainen and Pahta (2010)). We use information-theoretic measures (entropy, surprisal; Shannon (1949)) to assess features previously stated to change over time and to discover new, latent features from the data itself that are involved in diachronic change.

For this, we use the Royal Society Corpus (rsc) (Kermes et al. (2016)), which spans over the time period 1665 to 1869. We present three kinds of analyses: nominal compounding (typical of academic writing), modal verbs (shown to have changed in frequency over time), and an analysis based on part-of-speech trigrams to detect new features that change diachronically. We show how information-theoretic measures help to investigate, evaluate and detect features involved in diachronic change.

  • Collapse
  • Expand

Metrics

All Time Past Year Past 30 Days
Abstract Views 750 96 7
Full Text Views 56 12 0
PDF Views & Downloads 56 12 0