Embeddings functionsFunctions for word embeddings |
|
---|---|
Calculate Concept Mover's Distance |
|
Performs Concept Class Analysis (CoCA) |
|
Word embedding semantic centroid extractor |
|
Word embedding semantic direction extractor |
|
Word embedding semantic region extractor |
|
Gets anchor terms from precompiled anchor lists |
|
Find the 'projection matrix' to a semantic vector |
|
Find the 'rejection matrix' from a semantic vector |
|
Find a specified matrix transformation |
|
Evaluate anchor sets in defining semantic directions |
|
DTM FunctionsFunctions for document-term matrices |
|
A fast unigram DTM builder |
|
Removes terms from a DTM based on rules |
|
Gets DTM summary statistics |
|
Resamples an input DTM to generate new DTMs |
|
Melt a DTM into a triplet data frame |
|
Inference FunctionsFunctions for inference and robustness checks |
|
Monte Carlo Permutation Tests for Model P-Values |
|
Build a Random Corpus |
|
Build Multiple Random Corpora |
|
General FunctionsGeneral functions for text analysis |
|
A fast unigram vocabulary builder |
|
Represent Documents as Token-Integer Sequences |
|
Gets stoplist from precompiled lists |
|
A very tiny "gender" tagger |
|
Textnet FunctionsFunctions for textual networks |
|
Find a specified document centrality metric |
|
Find a similarities between documents |
|
DatasetsIncluded datesets |
|
A dataset of anchor lists |
|
A dataset of stoplists |
|
Sample of fastText embeddings |
|
Full Text of JFK's Rice Speech |
|
Metadata for Shakespeare's First Folio |
|
MethodsMethods for specific classes |
|
Plot CoCA |
|
Prints CoCA class information |