Skip to contents

A dataset mapping English words to phonaesthemes — sound patterns (onset clusters and rimes) that carry associative meaning. Each row pairs a word with a phonaestheme, its type (onset or rime), the semantic domain the sound evokes, and the scholarly source.

Format

A data frame with 640 rows and 5 variables.

Source

Bergen, B. K. (2004). The psychological reality of phonaesthemes. Language, 80(2), 290-311; Bolinger, D. L. (1950). Rime, assonance, and morpheme analogy. Word, 6, 137-144; Marchand, H. (1969). The categories and types of present-day English word-formation (2nd ed.). Beck; Waugh, L. R. (1994). Degrees of iconicity in the lexicon. Journal of Pragmatics, 22, 55-71; Hutchins, S. (1998). The psychological reality, variability, and compositionality of English phonaesthemes. Proceedings of the 20th Annual Meeting of the Cognitive Science Society.

Details

Phonaesthemes are submorphemic sound clusters that recur across words sharing a semantic field. For example, the onset cluster gl- recurs in words relating to light and vision (glow, gleam, glint), while the rime -ash recurs in words of violent impact (crash, smash, dash).

This dictionary covers 25 phonaesthemes (19 onset clusters and 6 rime patterns) drawn from published inventories by Bergen (2004), Bolinger (1950), Marchand (1969), Waugh (1994), Hutchins (1998), and Wikipedia (CC-BY-SA 4.0). When multiple sources cite the same word-phonaestheme pairing, sources are merged with semicolons.

Variables

  • term. the English word (lowercase)

  • phonaestheme. the sound pattern, either an onset cluster (e.g., "gl-") or a rime (e.g., "-ash")

  • type. whether the phonaestheme is an onset cluster ("onset") or a rime ("rime")

  • semantic_domain. the associative meaning domain the sound pattern evokes (e.g., "light_vision", "violent_impact")

  • source. scholarly attribution, semicolon-separated when multiple (e.g., "bolinger_1950;marchand_1969")