Skip to contents

A dataset of 3,230 articles, 190 sampled from each news organization. Articles were first subset to only those including "immigration" or "immigrants," then 190 articles were sampled from each news organization. Organizations were labeled by their approximate political lean.

Usage

data(corpus_atn_immigr)

Format

A data frame with 3230 rows and 8 variables.

Variables

  • doc_id. Unique ID for each article

  • title. Title of the article

  • author. Author of the article (if provided)

  • date. Date of publication

  • content. Full text of the article

  • year. Year of publication

  • publication. News organization publishing the article

  • lean. Approximate political lean (Right, Left, Neither) of news organization