Skip to contents

A merged dataset of 952 abbreviations, acronyms, initialisms, honorifics, military ranks, titles, months, measurements, place abbreviations, publication abbreviations, organizational abbreviations, legal terms, Latin abbreviations, scientific terms, stage directions, fictional character names, time abbreviations, slang, and firearm caliber designations. Combines the former english_acronyms and english_honorifics datasets with additional abbreviation categories from the textnorm/ECHNAE project into a single unified dictionary with consistent schema and full_form expansions for all entries.

Format

A data frame with 952 rows and 5 variables.

Source

ECHNAE Project (MIT), textnorm/ECHNAE

Variables

  • form. the abbreviation, acronym, or initialism

  • full_form. full expanded form (e.g., "Doctor" for "dr.", "F.Y.I." for "FYI", "National Aeronautics and Space Administration" for "NASA")

  • category. type of abbreviation: honorific, military, title, month, measurement, measurement_time, publication, place, organization, versus, abbreviation, initialism, acronym, academic, economic, education, fictional, finance, firearm, latin, legal, medical, misc, scientific, slang, stage_direction, technology, time

  • description. brief description of the entry type or context (e.g., "honorific or title prefix", "initialism", "government agency", "unit of measurement"). For initialisms, provides classification such as "government agency", "company", "organization", "technology", "medical condition", etc.

  • source. data source attribution