A dataset of subjective age-of-acquisition (AoA) ratings and word familiarity measures for over 31,000 English content words (nouns, verbs, and adjectives). Age of acquisition captures when a word is typically learned, which is a strong predictor of lexical processing speed beyond word frequency alone.
Source
Kuperman, V., Stadthagen-Gonzalez, H., & Brysbaert, M. (2012). Age-of-acquisition ratings for 30,000 English words. Behavior Research Methods, 44, 978-990. doi:10.3758/s13428-012-0210-4
Details
Ratings were collected via Amazon Mechanical Turk from nearly 2,000
US-based participants. Participants estimated the age (in years) at
which they learned each word. Words unknown to a participant were
recorded as "don't know" responses, providing both an AoA estimate
and a measure of how widely known each word is (pct_known).
Variables
term. the English lemma (lowercase)
aoa_mean. mean age-of-acquisition rating in years (NA for words where no rater provided a numeric estimate)
aoa_sd. standard deviation of AoA ratings
n_total. total number of valid responses (numeric + "don't know")
n_numeric. number of numeric AoA responses
pct_known. percentage of raters who knew the word (0-100)
freq_pm. SUBTLEX-US frequency per million words
source. data source attribution (
"kuperman_2012")
