Skip to contents

A dataset containing 982 New York times articles about COVID-19, published between November 1, 2020 and November 26, 2020 Originally collected using the `rtimes` package on May 26, 2021. The dataset does not contain the full text for the articles, but does have several short texts: headline, abstract, snippet, and lead paragraph.

Usage

data("corpus_nytimes_covid")

Format

A data frame with 982 rows and 28 variables.

Source

https://developer.nytimes.com/docs/articlesearch-product/1/overview

Details

  • abstract. character vectors

  • web_url. character vectors

  • snippet. character vectors

  • lead_paragraph. character vectors

  • source. character vectors

  • multimedia. nested lists of character vectors

  • keywords. nested lists of character vectors

  • pub_date. date/time vectors

  • document_type. character vectors

  • news_desk. character vectors

  • section_name. character vectors

  • type_of_material. character vectors

  • word_count. integer vectors

  • uri. character vectors

  • print_section. character vectors

  • print_page. integer vectors

  • subsection_name. character vectors

  • headline_main. character vectors

  • headline_kicker. character vectors

  • headline_print_headline. character vectors

  • byline_original. character vectors

  • byline_person. nest lists of character vectors

  • byline_organization. character vectors