A corpus containing lines from every episode of Star Trek: The Next Generation, Season 5. Scripts obtained from the `rtrek` package.

data(corpus_tng_season5)

Format

A data frame with 10,834 rows and 5 variables.

Source

`rtrek`

Variables

  • number. Episode number

  • title. Episode title

  • airdate. Date the episode aired

  • character. Character speaking the line

  • line. Text of the line spoken