Skip to contents

A dataset of 20,137 English words with their syllable counts, sourced from the Nettalk Corpus (Sejnowski & Rosenberg, 1987) via qdapDictionaries.

Format

A data frame with 20,137 rows and 3 variables.

Source

Sejnowski, F. J. & Rosenberg, C. R. (1987). Parallel networks that learn to pronounce English text. Complex Systems, 2, 145-168. Via qdapDictionaries (GPL-2)

Variables

  • word. English word

  • syllables. number of syllables

  • source. data source attribution