A dataset containing the lines from three books by the sociologist W.E.B. DuBois, acquired from Project Gutenberg: - The Souls of Black Folk (1903) - Darkwater (1920) - The Conservation of Races (1897)

data("corpus_web_dubois")

Format

A data frame with 12,757 rows and 5 variables.

Source

`gutenbergr`

Variables

  • gutenberg_id. ID for the work on Project Gutenberg

  • text. Line from a book

  • title. Full title of the book

  • short_title. Shortened title for the book

  • pub_year. Year the book was originally published