Corpora (included)Corpora installed with the package |
|
---|---|
Subset of 6 Corpora for the SentiStrength Benchmark |
|
Abstracts from the Annual Review of Sociology, 2020 |
|
Balanced Sample of Immigration related articles from All the News Corpus |
|
Lyrics of Beyonce's Songs |
|
Sample of 100 Blogposts from the CMU 2008 Political Blog Corpus |
|
Environmental Sociology Article Abstracts, 1990-2014 |
|
Sample from European Parliament Proceedings Parallel Corpus |
|
Subset of Amazon Fine Food Reviews Corpus, 2011-2012 |
|
Sample of 2,000 ISOT Fake News Dataset |
|
Immigration Think Tank Press Release (ITTPR) Corpus, 1998-2020 |
|
U.S. Presidential Speeches, 1952-1996 |
|
Subset of Community Ethical Judgements on Real-Life Anecdotes Corpus |
|
Lyrics of Taylor Swift's Songs |
|
Lines from Star Trek: The Next Generation, Season 5 |
|
National Security Strategy of the United States, 1987-2017 |
|
Corpora (downloaded)Corpora which must be downloaded first |
|
6 Corpora for the SentiStrength Benchmark |
|
Figure Eight Disaster Tweets |
|
Internal Emails from Enron Email Corpus |
|
New York Times Articles about COVID-19, 2020 |
|
Lines from three books by W.E.B DuBois |
|
ISOT Fake News Dataset |
|
DJS VOX Articles Corpus, 2014-2017 |
|
Pitckfork Reviews, 1999-2019 |
|
All The News (ATN) Corpus 1.0, 2015-2017 |
|
All The News (ATN) Corpus 2.0, 2016-2020 |
|
Amazon Fine Food Reviews Corpus, 2011-2012 |
|
Community Ethical Judgements on Real-Life Anecdotes Corpus |
|
Lines from Black Mirror |
|
FunctionsHelper functions |
|
Download specified corpus |
|
Tweet IDsTweet IDs which can be “rehydrated” |
|
Tweet IDs for 1,922 tweets using #Covid19 collected in 2021 |
|
Tweet IDs for 1,999 geo-tagged tweets #Covid19 collected in 2021 |
|
Tweet IDs of 15,594 tweets using the $GME (GameStop Ticker) |
|
Tweet IDs for 23,737 tweets using #StayHome collected in 2021 |