A merged dataset of 952 abbreviations, acronyms, initialisms, honorifics, military ranks, titles, months, measurements, place abbreviations, publication abbreviations, organizational abbreviations, legal terms, Latin abbreviations, scientific terms, stage directions, fictional character names, time abbreviations, slang, and firearm caliber designations. Combines the former english_acronyms and english_honorifics datasets with additional abbreviation categories from the textnorm/ECHNAE project into a single unified dictionary with consistent schema and full_form expansions for all entries.
Variables
form. the abbreviation, acronym, or initialism
full_form. full expanded form (e.g., "Doctor" for "dr.", "F.Y.I." for "FYI", "National Aeronautics and Space Administration" for "NASA")
category. type of abbreviation: honorific, military, title, month, measurement, measurement_time, publication, place, organization, versus, abbreviation, initialism, acronym, academic, economic, education, fictional, finance, firearm, latin, legal, medical, misc, scientific, slang, stage_direction, technology, time
description. brief description of the entry type or context (e.g., "honorific or title prefix", "initialism", "government agency", "unit of measurement"). For initialisms, provides classification such as "government agency", "company", "organization", "technology", "medical condition", etc.
source. data source attribution
