This is an R Package with several datasets for English-language dictionaries useful for text analysis. See also text2map.

Installation

This is primarily a dataset package and therefore we will not be sending it to CRAN. You can install the latest version from GitLab:

library(remotes)
install_gitlab("culturalcartography/text2map.dictionaries")

library(text2map.dictionaries)

Dictionaries

The package currently includes 20+ dictionaries, primarily English, containing hand-ranked and inferred word “norms,” as well as frequency and rank information from various corpora or administrative data. Additionally, there are a few domain-specific dictionaries documenting rare words.

There are four related packages hosted on GitLab:

The above packages can be installed using the following:

install.packages("text2map")

library(remotes)
install_gitlab("culturalcartography/text2map.theme")
install_gitlab("culturalcartography/text2map.corpora")
install_gitlab("culturalcartography/text2map.pretrained")

Contributions and Support

We welcome new dictionaries – especially old or rare dictionaries! If you have a dictionary you would like to be easily available to other researchers, send us an email (maintainers [at] textmapping.com) or submit pull requests.

Please report any issues or bugs here: https://gitlab.com/culturalcartography/text2map.dictionaries/-/issues