This is an R package to load and download a pretrained text analysis models. Some models are quite large and must be separately downloaded first. See also
A few smaller topic models are included when the package is installed:
These can be loaded directly with
Word embedding models are much larger and must be first downloaded to your machine. Then they can be loaded with
# ~1 million fastText word vectors mod <- "vecs_fasttext300_wiki_news" # download the model once per machine download_pretrained(mod) # load the model each session data(mod) dim(wv)
Below are the currently available word embedding models.
|MODEL||N TERMS||N DIMS||METHOD|
There are four related packages hosted on GitLab:
text2map: text analysis functions
text2map.corpora: 13+ text datasets
text2map.dictionaries: norm dictionaries and word frequency lists
ggplot2aesthetics and loads viridis color scheme as default
The above packages can be installed using the following:
We welcome new models. If you have an embedding model or topic model you would like to be easily available to other researchers, send us an email (maintainers [at] textmapping.com) or submit pull requests.
Please report any issues or bugs here: https://gitlab.com/culturalcartography/text2map.pretrained/-/issues