Skip to content

JimengShi/DL-Foundation-Models-Weather

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

135 Commits
 
 
 
 

Repository files navigation

DL-Foundation-Models-for-Weather-Prediction

taxonomy

Contents

Taxonomy and Review

Taxonomy:

taxonomy

Representative review:

review

👉👉 Predictive Models

General-Purpose Large Models

CNN

  • (FourCastNet 3) FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale arXiv 2025
    [Paper] [Code]

Transformer

  • (FourCastNet) FourCastNet: A Global Data-driven High-resolution Weather Model using Adaptive Fourier Neural Operators arXiv 2022
    [Paper] [Code]

  • (FuXi) FuXi: a cascade machine learning forecasting system for 15-day global weather forecast npj Climate and Atmospheric Science 2023
    [Paper] [Code]

  • (FengWu) FengWu: Pushing the Skillful Global Medium-range Weather Forecast beyond 10 Days Lead arXiv 2023
    [Paper] [Code]

  • (FengWu-4DVar) FengWu-4DVar: Coupling the Data-driven Weather Forecasting Model with 4D Variational Assimilation arXiv 2023
    [Paper] [Code]

  • (XiChen) XiChen: An observation-scalable fully AI-driven global weather forecasting system with 4D variational knowledge arXiv 2025
    [Paper] [Code]

  • (SwinVRNN) SwinVRNN: A Data-Driven Ensemble Forecasting Model via Learned Distribution Perturbation JAMES 2023
    [Paper] [Code]

  • (SwinRDM) SwinRDM: Integrate SwinRNN with Diffusion Model towards High-Resolution and High-Quality Weather Forecasting AAAI 2023
    [Paper]

  • (Pangu-Weather) Accurate medium-range global weather forecasting with 3D neural networks Nature 2023
    [Paper] [Code]

  • (Stormer) Scaling transformer neural networks for skillful and reliable medium-range weather forecasting arXiv 2024
    [Paper] [Code]

  • (HEAL-ViT) HEAL-ViT: Vision Transformers on a spherical mesh for medium-range weather forecasting arXiv 2024
    [Paper]

  • (TianXing) linear complexity transformer model with explicit attention decay for global weather forecasting Advances in Atmospheric Sciences
    [Paper] [Code]

GNN

  • (GraphCast) Graphcast: Learning skillful medium-range global weather forecasting Science 2023
    [Paper] [Code]

  • (OneForecast) OneForecast: A Universal Framework for Global and Regional Weather Forecasting ICML 2025
    [Paper] [Code]

  • (GnnWeather) Forecasting Global Weather with Graph Neural Networks arXiv 2022
    [Paper] [Code]

  • (AIFS) AIFS -- ECMWF's data-driven forecasting system arXiv 2024
    [Paper] [Code]

  • (GraphDOP) GraphDOP: Towards skilful data-driven medium-range weather forecasts learnt and initialised directly from observations arXiv 2024
    [Paper] [Code]

PhysicsAI

  • (Conformer) Searth Transformer: A Transformer Architecture Incorporating Earth's Geospheric Physical Priors for Global Mid-Range Weather Forecasting arXiv 2026
    [Paper] [Code]

  • (ClimODE) ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs arXiv 2024
    [Paper] [Code]

  • (WeatherODE) Mitigating Time Discretization Challenges with WeatherODE: A Sandwich Physics-Driven Neural ODE for Weather Forecasting arXiv 2024
    [Paper] [Code]

  • (NeuralGCM) Neural general circulation models for weather and climate Nature 2024
    [Paper] [Code]

  • (Conformer) STC-ViT: Spatio Temporal Continuous Vision Transformer for Weather Forecasting arXiv 2024
    [Paper] [Code]

Domain-Specific Models

Transformer

  • (SwinUnet) Spatiotemporal vision transformer for short time weather forecasting IEEE BigData 2022
    [Paper] [Code]

  • (Earthformer) Earthformer: Exploring Space-Time Transformers for Earth System Forecasting NeurIPS 2022
    [Paper] [Code]

  • (Rainformer) Rainformer: Features Extraction Balanced Network for Radar-Based Precipitation Nowcasting IEEE Geoscience and Remote Sensing Letters 2022
    [Paper] [Code]

  • (PFformer) PFformer: A Time-Series Forecasting Model for Short-Term Precipitation Forecasting IEEE Access 2024
    [Paper]

  • (OMG-HD) OMG-HD: A high-resolution ai weather model for end-to-end forecasts from observations arXiv 2024
    [Paper]

  • (U-STN) Towards physics-inspired data-driven weather forecasting: integrating data assimilation with a deep spatial-transformer-based u-net in a case study with era5 Geoscientific Model Development 2024
    [Paper] [Code]

GNN

  • (HiSTGNN) HiSTGNN: Hierarchical spatio-temporal graph neural network for weather forecasting Information Sciences 2023
    [Paper] [Code]

  • (w-GNN) Coupling Physical Factors for Precipitation Forecast in China With Graph Neural Network AGU 2024
    [Paper]

  • (WeatherGNN) WeatherGNN: Exploiting Meteo- and Spatial-Dependencies for Local Numerical Weather Prediction Bias-Correction IJCAI 2024
    [Paper]

  • (MPNNs) Multi-modal graph neural networks for localized off-grid weather forecasting arXiv 2024
    [Paper] [Code]

RNN&CNN

  • (Samudra) Samudra: An AI Global Ocean Emulator for Climate Geophysical Reseach Letters 2025
    [Paper] [Code]

  • (MetNet) MetNet: A Neural Weather Model for Precipitation Forecasting arXiv 2020
    [Paper] [Code]

  • (MetNet-3) Deep Learning for Day Forecasts from Sparse Observations arXiv 2023
    [Paper]

  • (PredRNN) PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023
    [Paper] [Code]

  • (MM-RNN) MM-RNN: A Multimodal RNN for Precipitation Nowcasting IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 2023
    [Paper]

  • (ConvLSTM) Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting NeurIPS 2015
    [Paper] [Code]

Mamba

  • (MetMamba) MetMamba: Regional Weather Forecasting with Spatial-Temporal Mamba Model arXiv 2024 [Paper]

  • (MambaDS) MambaDS: Near-Surface Meteorological Field Downscaling With Topography Constrained Selective State-Space Modeling IEEE Transactions on Geoscience and Remote Sensing 2024
    [Paper]

PhysicsAI

  • (NowcastNet) Skilful nowcasting of extreme precipitation with NowcastNet Nature 2023
    [Paper] [Code]

  • (PhysDL) Deep learning for physical processes: incorporating prior scientific knowledge IOPScience 2019
    [Paper]

  • (PhyDNet) Disentangling Physical Dynamics From Unknown Factors for Unsupervised Video Prediction CVPR 2020
    [Paper] [Code]

  • (DeepPhysiNet) DeepPhysiNet: Bridging Deep Learning and Atmospheric Physics for Accurate and Continuous Weather Modeling arXiv 2024
    [Paper] [Code]

👉👉 Generative Models

General-Purpose Large Models

Diffusion Models

  • (OmniCast) OmniCast: A Masked Latent Diffusion Model for Weather Forecasting Across Time Scales NeurIPS 2025
    [Paper] [Code]

  • (GenCast) GenCast: Diffusion-based ensemble forecasting for medium-range weather Nature 2024
    [Paper] [Code]

  • (CoDiCast) CoDiCast: Conditional Diffusion Model for Weather Prediction with Uncertainty Quantification IJCAI 2025
    [Paper] [Code]

  • (SEEDs) SEEDS: Emulation of Weather Forecast Ensembles with Diffusion Models Science Advances 2024
    [Paper] [Code]

  • (ContinuousEnsCast) Continuous Ensemble Weather Forecasting with Diffusion models ICLR 2025
    [Paper] [Code]

Domain-Specific Models

Diffusion Models

  • (LDMRain) Latent diffusion models for generative precipitation nowcast-ing with accurate uncertainty quantification arXiv 2023
    [Paper] [Code]

  • (PreDiff) PreDiff: Precipitation Nowcasting with Latent Diffusion Models NeurIPS 2023
    [Paper] [Code]

  • (CasCast) CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling ICML 2024
    [Paper] [Code]

  • (SRNDiff) SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model arXiv 2024
    [Paper] [Code]

  • (DiffCast) DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting CVPR 2024
    [Paper] [Code]

  • (GEDRain) Precipitation nowcasting with generative diffusion models arXiv 2023
    [Paper] [Code]

GANs

  • (GANRain) Skillful precipitation nowcasting using deep generative models of radar Nature 2021
    [Paper] [Code]

  • (MultiScaleGAN) Experimental Study on Generative Adversarial Network for Precipitation Nowcasting IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 2022
    [Paper] [Code]

  • (STGM) Physical-Dynamic-Driven AI-Synthetic Precipitation Nowcasting Using Task-Segmented Generative Model AGU 2023
    [Paper] [Code]

  • (PCT-CycleGAN) PCT-CycleGAN: Paired Complementary Temporal Cycle-Consistent Adversarial Networks for Radar-Based Precipitation Nowcasting ACM 2023
    [Paper]

👉👉 Foundation Models

  • (ClimaX) ClimaX: A foundation model for weather and climate ICML 2023
    [Paper] [Code]

  • (W-MAE) W-MAE: Pre-trained weather model with masked autoencoder for multi-variable weather forecasting arXiv 2023
    [Paper] [Code]

  • (Aurora) Aurora: A foundation model of the atmosphere. arXiv 2024
    [Paper] [Code]

  • (Prithvi WxC) Prithvi WxC: Foundation Model for Weather and Climate arXiv 2024
    [Paper] [Code]

  • (AtmosArena) AtmosArena: Benchmarking Foundation Models for Atmospheric Sciences NeurIPS 2024 Workshop FM4Science
    [Paper] [Code]

👉👉 Multi-modal Weather Models

  • (ClimateBench-M) ClimateBench-M: A Multi-Modal Climate Data Benchmark with a Simple Generative Method CIKM 2025
    [Paper] [Code]

  • (TyphoFormer) TyphoFormer: Language-Augmented Transformer for Accurate Typhoon Track Forecasting arXiv 2025
    [Paper] [Code]

  • (Omni-Weather) Omni-Weather: Unified Multimodal Foundation Model for Weather Generation and Understanding arXiv 2025
    [Paper] [Code]

👉👉 Distillation for Climate & Weather

  • (DLESyM) Long-Range Distillation: Distilling 10,000 Years of Simulated Climate into Long Timestep AI Weather Models arXiv 2025
    [Paper] [Code]

👉👉 Agentic Models

  • (ClimateBench-M) Zephyrus: An Agentic Framework for Weather Science arXiv 2025
    [Paper] [Code]

Applications

Precipitation

  • Neural general circulation models for modeling precipitation. [Paper]
  • Deep Learning and the Weather Forecasting Problem: Precipitation Nowcasting. [Paper]
  • PRISMA: A systematic quantitative review on the performance of some of the recent short-term rainfall forecasting techniques. [Paper]
  • Deep Learning Techniques in Extreme Weather Events: A Review [Paper]
  • Analysis, characterization, prediction, and attribution of extreme atmospheric events with machine learning and deep learning techniques: a review [Paper]
  • Deep learning for precipitation nowcasting: A survey from the perspective of time series forecasting. [Paper]
  • Precipitation Nowcasting with Satellite Imagery. [Paper]
  • RainNet v1.0: a convolutional neural network for radar-based precipitation nowcasting. [Paper]
  • Convective Precipitation Nowcasting Using U-Net Model. [Paper]
  • NowCasting-Nets: Representation Learning to Mitigate Latency Gap of Satellite Precipitation Products Using Convolutional and Recurrent Neural Networks. [Paper]
  • Domain Generalization Strategy to Train Classifiers Robust to Spatial-Temporal Shift. [Paper]
  • Region-Conditioned Orthogonal 3D U-Net for Weather4Cast Competition. [Paper]
  • Skilful nowcasting of extreme precipitation with NowcastNet. [Paper]
  • Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting. [Paper]
  • PredRNN: Recurrent Neural Networks for Predictive Learning using Spatiotemporal LSTMs. [Paper]
  • Nowformer: A Locally Enhanced Temporal Learner for Precipitation Nowcasting. [Paper]
  • Earthformer: Exploring Space-Time Transformers for Earth System Forecasting. [Paper]
  • Rainformer: Features Extraction Balanced Network for Radar-Based Precipitation Nowcasting. [Paper]
  • The MS-RadarFormer: A Transformer-Based Multi-Scale Deep Learning Model for Radar Echo Extrapolation. [Paper]
  • A Foundation Model for the Earth System. [Paper]
  • WeatherGFM: Learning A Weather Generalist Foundation Model via In-context Learning. [Paper]
  • AENN: A GENERATIVE ADVERSARIAL NEURAL NETWORK FOR WEATHER RADAR ECHO EXTRAPOLATION. [Paper]
  • MPL-GAN: Toward Realistic Meteorological Predictive Learning Using Conditional GAN. [Paper]
  • Skillful Radar-Based Heavy Rainfall Nowcasting Using Task-Segmented Generative Adversarial Network. [Paper]
  • A Self-Attention Causal LSTM Model for Precipitation Nowcasting. [Paper]
  • PCT-CycleGAN: Paired Complementary Temporal Cycle-Consistent Adversarial Networks for Radar-Based Precipitation Nowcasting. [Paper]
  • Precipitation Nowcasting Using Physics Informed Discriminator Generative Models. [Paper]
  • GPTCast: a weather language model for precipitation nowcasting. [Paper]
  • PreDiff: Precipitation Nowcasting with Latent Diffusion Models [Paper]
  • Latent diffusion models for generative precipitation nowcasting with accurate uncertainty quantification. [Paper]
  • DiffCast: A Unified Framework via Residual Diffusion for Precipitation Nowcasting. [Paper]
  • CasCast: Skillful High-resolution Precipitation Nowcasting via Cascaded Modelling. [Paper]

Air Quality

  • U-Air: when urban air quality inference meets big data. [Paper]
  • Forecasting of Air Quality Using an Optimized Recurrent Neural Network. [Paper]
  • Deep Distributed Fusion Network for Air Quality Prediction. [Paper]
  • Time Series Forecasting (TSF) Using Various Deep Learning Models. [Paper]
  • Group-Aware Graph Neural Network for Nationwide City Air Quality Forecasting. [Paper]
  • PM2.5-GNN: A Domain Knowledge Enhanced Graph Neural Network For PM2.5 Forecasting. [Paper]
  • AirFormer: Predicting Nationwide Air Quality in China with Transformers. [Paper]
  • MGSFformer: A Multi-Granularity Spatiotemporal Fusion Transformer for air quality prediction. [Paper]
  • Air Quality Prediction Using the Fractional Gradient-Based Recurrent Neural Network. [Paper]
  • Air quality prediction using CNN+LSTM-based hybrid deep learning architecture. [Paper]

Sea Surface Temperature

  • OptFormer: Optical Flow-Guided Attention and Phase Space Reconstruction for SST Forecasting. [Paper]
  • ENSO analysis and prediction using deep learning: A review. [Paper]
  • Analyzing El Niño–Southern Oscillation Predictability Using Long-Short-Term-Memory Models. [Paper]
  • Spatiotemporal Model Based on Deep Learning for ENSO Forecasts. [Paper]
  • Deep learning for multi-year ENSO forecasts. [Paper]
  • Forecasting the Indian Ocean Dipole With Deep Learning Techniques. [Paper]
  • Deep Residual Convolutional Neural Network Combining Dropout and Transfer Learning for ENSO Forecasting. [Paper]
  • DLENSO: A Deep Learning ENSO Forecasting Model. [Paper]
  • Graph Neural Networks for Improved El Niño Forecasting. [Paper]
  • Transformer for EI Niño-Southern Oscillation Prediction. [Paper]
  • A self-attention–based neural network for three-dimensional multivariate modeling and its skillful ENSO predictions. [Paper]
  • Spatial-temporal transformer network for multi-year ENSO prediction. [Paper]
  • Adaptive Graph Spatial-Temporal Attention Networks for long lead ENSO prediction. [Paper]
  • ENSO dataset & comparison of deep learning models for ENSO forecasting. [Paper]
  • Global Spatiotemporal Graph Attention Network for Sea Surface Temperature Prediction. [Paper]
  • Physical Knowledge-Enhanced Deep Neural Network for Sea Surface Temperature Prediction. [Paper]
  • Explainable deep learning for insights in El Niño and river flows. [Paper]
  • Data-driven multi-step prediction and analysis of monthly rainfall using explainable deep learning. [Paper]
  • An Interpretable Deep Learning Approach for Detecting Marine Heatwaves Patterns. [Paper]
  • 3D-Geoformer: A self-attention–based neural network for three-dimensional multivariate modeling and its skillful ENSO predictions. [Paper]

Flood

  • Evaluation of artificial intelligence models for flood and drought forecasting in arid and tropical regions. [Paper]
  • Flood forecasting with machine learning models in an operational framework. [Paper]
  • Particle swarm optimization based LSTM networks for water level forecasting: A case study on Bangladesh river network. [Paper]
  • Prediction of Flow Based on a CNN-LSTM Combined Deep Learning Approach. [Paper]
  • Designing Deep-Based Learning Flood Forecast Model With ConvLSTM Hybrid Algorithm. [Paper]
  • Improving urban flood prediction using LSTM-DeepLabv3+ and Bayesian optimization with spatiotemporal feature fusion. [Paper]
  • The Merit of River Network Topology for Neural Flood Forecasting. [Paper]
  • FloodGNN-GRU: a spatio-temporal graph neural network for flood prediction. [Paper]
  • Graph Transformer Network for Flood Forecasting with Heterogeneous Covariates. [Paper]
  • FIDLAR: Forecast-Informed Deep Learning Architecture for Flood Mitigation. [Paper]
  • Data-driven and knowledge-guided denoising diffusion model for flood forecasting. [Paper]
  • DRUM: Diffusion-based runoff model for probabilistic flood forecasting. [Paper]
  • Generalizing rapid flood predictions to unseen urban catchments with conditional generative adversarial networks. [Paper]

Drought

  • Drought as a natural hazard: concepts and definitions. [Paper]
  • Drought predic- tion based on spi and spei with varying timescales using lstm recurrent neural network. [Paper]
  • Explain- able ai in drought forecasting. [Paper]
  • Drought prediction based on feature-based transfer learning and time series imaging. [Paper]
  • Application of a hybrid arima-lstm model based on the spei for drought forecasting. [Paper]
  • Evaluation of artificial intelligence models for flood and drought forecasting. [Paper]
  • Deep learning oriented satellite remote sensing for drought and prediction in agriculture. [Paper]
  • Forecasting the propagation from meteorolog- ical to hydrological and agricultural drought in the huaihe river basin with machine learning methods. [Paper]
  • Multivariate time series convo- lutional neural networks for long-term agricultural drought prediction under global warming. [Paper]
  • A novel intelligent deep learning predictive model for meteorological drought forecasting. [Paper]
  • Harnessing deep learning for meteorological drought forecasts in the northern cape, south africa. [Paper]
  • Construction of an integrated drought monitoring model based on deep learning algorithms. [Paper]
  • Drought prediction using artificial intelligence models based on climate data and soil moisture. [Paper]
  • Advanced stacked integration method for forecasting long-term drought severity: Cnn with machine learning models. [Paper]
  • Multiscale spatiotemporal meteorological drought prediction: A deep learning approach. [Paper]
  • Deep learning-oriented c-gan models for vegetative drought prediction on peninsular india. [Paper]

Tropical Storms/Cyclones and Hurricanes

  • Predicting tropical cyclogenesis us- ing a deep learning method from gridded satellite and era5 reanalysis data in the western north pacific basin. [Paper]
  • Predicting tropical cyclone formation with deep learning. [Paper]
  • Improvement in forecasting short-term tropical cyclone intensity change and their rapid intensification using deep learning. [Paper]
  • Tropical cyclone track forecasting using fused deep learning from aligned reanalysis data. [Paper]
  • A novel data-driven tropical cyclone track prediction model based on cnn and gru with multi-dimensional feature selection. [Paper]
  • Near real-time hurricane rainfall forecasting using convolutional neural network models with integrated multi-satellite retrievals for gpm (imerg) product. [Paper]
  • Advanced hybrid cnn-bilstm model augmented with ga and ffo for enhanced cyclone intensity forecasting. [Paper]
  • Forecasting formation of a tropical cyclone using reanalysis data. [Paper]
  • Predicting landfall’s location and time of a tropical cyclone using reanalysis data. [Paper]
  • Deep learning for down-scaling tropical cyclone rainfall to hazard-relevant spatial scales. [Paper]
  • Tropical cyclone forecast using multitask deep learning framework. [Paper]
  • Forecasting tropical cyclones with cascaded diffusion models. [Paper]
  • Advancing storm surge forecasting from scarce observation data: A causal-inference based spatio-temporal graph neural network approach. [Paper]

Wildfire

  • Wildfire spreading prediction using multimodal data and deep neural network approach. [Paper]
  • Deep learning provides substantial improvements to county-level fire weather forecasting over the western united states. [Paper]
  • Next-day wildfire spread: A machine learning dataset to predict wildfire spreading from remote-sensing data. [Paper]
  • The large-scale wildfire spread prediction using a multi-kernel convolutional neural network. [Paper]
  • Comparison of recurrent neural network architectures for wildfire spread modeling. [Paper]
  • Comparative analysis of hybrid long short-term memory models for fire danger index forecasting with weather data. [Paper]
  • A comparative study on forest fire prediction using arima, sarima, lstm, and gru methods. [Paper]
  • Cnn-bilstm: A novel deep learning model for near-real-time daily wildfire spread prediction. [Paper]
  • Firepred: A hybrid multi-temporal convolutional neural network model for wildfire spread prediction. [Paper]
  • Recurrent convolutional deep neural networks for modeling time-resolved wildfire spread behavior. [Paper]
  • Spatiotemporal attention convlstm networks for predicting and physically interpreting wildfire spread. [Paper]
  • Capturing and interpreting wildfire spread dynamics: attention-based spatiotemporal models using convlstm networks. [Paper]
  • Mitigating greenhouse gas emissions through generative adversarial networks based wildfire prediction. [Paper]
  • Modeling wildfire spread with an irregular graph network. [Paper]
  • Forest fire prediction based on time series networks and remote sensing images. [Paper]

Renewable Energy

  • SolarSeer: Ultrafast and accurate 24-hour solar irradiance forecasts outperforming numerical weather prediction across the USA.[Paper][Code]
  • Data-driven solar forecasting enables near-optimal economic decisions. [Paper]
  • Data-driven Surface Solar Irradiance Estimation using Neural Operators at Global Scale. [Paper]
  • A machine learning model for hub-height short-term wind speed prediction. [Paper]

Others

  • ChatClimate: Grounding conversational AI in climate science, in Nature Communications Earth & Environment 2023. [Paper]
  • spateGAN: Spatio-Temporal Downscaling of Rainfall Fields Using a cGAN Approach, Earth and Space Science 2023. [Paper]
  • OceanGPT: A Large Language Model for Ocean Science Tasks, in arXiv 2023. [paper] [official code]
  • IBMWeatherGen: Stochastic Weather Generator Tool, [Code]

Citation

@article{shi2025deep,
  title={Deep learning and foundation models for weather prediction: A survey},
  author={Shi, Jimeng and Shirali, Azam and Jin, Bowen and Zhou, Sizhe and Hu, Wei and Rangaraj, Rahuul and Wang, Shaowen and Han, Jiawei and Wang, Zhaonan and Lall, Upmanu and others},
  journal={arXiv preprint arXiv:2501.06907},
  year={2025}
}

About

Curated papers on survey paper "deep learning and foundation models for weather prediction".

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors