Every facility in the US that emits at least 25,000 metric tons CO2-equivalent per year and must report annually to EPA under 40 CFR Part 98 (the Greenhouse Gas Reporting Program). ~9k facilities spanning 41 industry subparts: Subpart C = stationary combustion (power plants — biggest emitters), Subpart D = petroleum refineries, Subpart H = cement, Subpart Q = iron + steel, Subpart S = lime, Subpart P = pulp + paper, Subpart W = oil + gas systems, Subpart X = petrochemicals, etc. Each facility has: name, parent company (corporate accountability), full address + geocode (lat/lng), NAICS code, FRS ID (joins to EPA ECHO compliance), reported subpart codes, most-recent year's CO2e emissions in metric tons, lifetime cumulative emissions, count of years reported. Source: EPA Envirofacts pub_dim_facility + pub_facts_sector_ghg_emission XML APIs (public, no auth, monthly refresh). Underlying authority: Clean Air Act § 114.
Every record in this dataset can be traced back to its primary source at https://www.epa.gov/ghgreporting. Underlying content is a US federal government work (public domain under 17 USC §105); our derived data is licensed CC0 1.0.