A data module to estimate global powerplant capacities for any region in the world at any resolution.
This is a modular snakemake workflow created as part of the Modelblocks project. It can be imported directly into any snakemake workflow.
For more information, please consult the Modelblocks documentation,
the integration example,
and the snakemake documentation.
Data processing steps:
- Stable version-controlled global datasets are downloaded, including:
- Disaggregated powerplant statistics from GEM, Transition-Zero, and GloHydroRES.
- National-level statistics from the EIA.
- Individual powerplants are prepared into point-source categories: bioenergy, fossil, geothermal, hydropower, nuclear, large_solar, and wind.
- Fuel-burning powerplants (fossil, bioenergy) are assigned unique fuel-classes depending on the combination of fuels they utilise.
- For utility-scale solar projects, satellite detected TZ-Solar Asset Mapper facilities are matched to GEM-Global Solar Power Tracker data to obtain a highly complete dataset of large-scale solar facilities.
- Powerplants are selected according to the shapes file provided by the user. Depending on the configuration, their placement may be adjusted per technology and country.
- Powerplant start and end dates are imputed per category/technology using the configuration.
lifetime_yearsdetermines overall technology lifetime.retirement_delay_yearsdetermines the remaining years of powerplants currently operating beyond their expected lifetime.
Note
Powerplant start/end dates are only imputed if they are not provided in the original dataset.
- Optionally, powerplant capacities are adjusted evenly per category and country to match EIA statistics.
Important
This stage may significantly inflate/deflate individual powerplants. We encourage users to carefully assess if this adjustment is merited by their use-case.
- Powerplant capacity is aggregated to the provided shapes, for either adjusted or unadjusted powerplants.
- Solar is processed as a special case because rooftop PV panels are not covered in GEM or Transition-Zero data.
- Per country:
$solar_{rooftopPV} = solar_{nationalStatistics} - solar_{largeScale}$ . - A user-provided proxy raster is used to determine how to disaggregate
$solar_{rooftopPV}$ . - This proxy is used to determine the aggregated rooftop PV capacity per-shape.
- The final aggregated
solaroutput combineslarge_solarfacilities with proxied rooftop PV capacity.
- Per country:
Note
Due to this assumption, the lifetime of rooftop PV capacity is left undetermined.
- The current reference year for operating capacity, national statistics adjustment, and status imputation is
2024. - User-provided shapes should add up to whole countries. This is required for national statistics and rooftop PV proxying to remain meaningful.
- Adjusted outputs rescale operating powerplants to match EIA national category totals. Future and retired plants are kept unchanged.
large_solaris available as a point-source category for utility PV and CSP.solaris only available for aggregated outputs because rooftop PV is represented through a proxy raster rather than individual plant points.
Please consult the configuration README and the configuration example for a general overview on the configuration options of this module.
Required user inputs:
<shapes>: GeoParquet file with the target regional disaggregation. It must containshape_id,country_id(ISO-3),shape_class(landormaritime), and valid polygon geometry.<proxy_rooftop_pv>: GeoTIFF proxy raster to use for adjusted aggregatedsolaroutputs.
Optional user inputs:
<imputed_powerplants>: category-specific GeoParquet files with additional point-source powerplants. These can add missing facilities or replace source records when combined with theexcluded_idsconfiguration value.<wemi>: Wind Energy Market Intelligence.xlsfile, required only whencategory.wind.sourceis set towemi.
Main outputs:
<powerplants>: disaggregated point-source powerplants.unadjustedoutputs are available forbioenergy,fossil,geothermal,hydropower,nuclear,large_solar, andwind;adjustedoutputs are available for the same categories exceptlarge_solar.<aggregated_capacity>: capacity aggregated to the user-provided shapes.bioenergy,fossil,geothermal,hydropower,nuclear, andwindare available asadjustedorunadjusted;large_solaris available asunadjusted;solaris available asadjusted.
Please consult the interface file for exact path variables and wildcards.
We use pixi as our package manager for development.
Once installed, run the following to clone this repository and install all dependencies.
git clone git@github.com:modelblocks-org/module_powerplants.git
cd module_powerplants
pixi install --allPlease be aware that this is a multi-environment project (see pixi.toml for details).
default: used for development and integration testing. Because it containsSnakemake,condaandpytestas dependencies it should not be used inSnakemakerules.module: contains minimal dependencies used inSnakemakerules. If modified, be sure to export it toSnakemakeso it can be recreated by module users:
# create module.yaml and conda-spec pin files in workflow/envs/
pixi run export-snakemake-env moduleFor testing, simply run:
pixi run test-integrationTo test a minimal example of a workflow using this module:
pixi shell # activate this project's environment
cd tests/integration/ # navigate to the integration example
snakemake --use-conda --cores 2 # run the workflow!This module is based on the following research and datasets: For specific versions please consult our stable dataset repository.
- Global Energy Monitor datasets. https://globalenergymonitor.org/. License: CC BY 4.0.
- Global Bioenergy Power Tracker
- Global Coal Plant Tracker
- Global Geothermal Power Tracker
- Global Nuclear Power Tracker
- Global Oil and Gas Plant Tracker
- Global Solar Power Tracker
- Global Wind Power Tracker
- Global Hydropower powerplants. Shah, J., Hu, J., Edelenbosch, O., & van Vliet, M. T. H. (2024). GloHydroRes - a global dataset combining open-source hydropower plant and reservoir data [Data set]. Zenodo. https://doi.org/10.5281/zenodo.14526360. License: CC BY 4.0.
- National capacity dataset. U.S. Energy Information Administration (Oct 2008). https://www.eia.gov/international/overview/world. License: Public domain.
- Satellite Utility-scale PV dataset. TransitionZero Solar Asset Mapper, TransitionZero. https://www.transitionzero.org/products/solar-asset-mapper. License: CC BY-NC 4.0.
Thanks goes to these wonderful people, sorted alphabetically (emoji key):
This project follows the all-contributors specification. Contributions of any kind welcome!





