Skip to contents

Rules for fusing historical two-word (and select three-word) spellings into their modern compound or hyphenated forms (e.g., every where -> everywhere, men of war -> men-of-war). Frequency from Google Ngrams data where available; supplement entries sourced from textnorm have mixed attribution.

Format

A data frame with 259 rows and 5 variables.

Source

Ted Underwood / DataMunging (CC-BY); curated supplement

Variables

  • word1. first word of the bigram (or first word of a 3-word phrase)

  • word2. second word of the bigram (may contain a space for 3-word phrases, e.g., "of war")

  • fused. modern fused or hyphenated form

  • frequency. frequency count from Google Ngrams (NA for supplement entries without frequency data)

  • source. data source attribution ("Ted Underwood / DataMunging" or "curated")