Skip to content

saiga143/google-v3-buildings-downloader

Repository files navigation

🏗️ Efficient Download and Extraction of Google Buildings V3 Data

Detailed blogpost explaining this GitHub repo and scripts can be read here: https://sola.kau.se/deprimap/2025/07/23/google-v3-download/

This repository provides a clean two-step pipeline to download and extract building footprint data from Google Open Buildings V3, using only the tiles and geometries that intersect your region of interest (ROI).


🌍 Why This Workflow?

Google’s V3 building dataset is powerful — but downloading and processing all ~178 GB is unnecessary for most use cases. This repo solves that by:

✅ Downloading only the tiles that intersect your ROI
✅ Processing those tiles in chunks, extracting only relevant buildings
✅ Saving output as clean .gpkg files ready for spatial analysis


🧭 Workflow Overview

Step 1: Identify and Download Tiles

  • Input: ROI polygons (can be single or multiple disjoint features)
  • Output: Only those .geojson.gz tiles that intersect your ROI
  • 📎 Notebook: notebook1_download_tiles.ipynb

step1 selecting tiles

The example shows Algeria as an example with multiple polygons spread across the country as an ROI


Step 2: Filter and Save Buildings

⚠️ Although the tile files have a .geojson.gz extension, they are actually CSVs with WKT geometries — not true GeoJSON. This notebook handles that for you.

step2 filtering buildings


🗺️ Example Use Case

You can use this pipeline for:

  • Extracting buildings in disjoint polygon boundaries
  • Urban change studies (morphometric studies)
  • Lightweight local modelling
  • Avoiding unnecessary storage/processing overhead

🙏 Acknowledgements

This code was developed as part of the DEPRIMAP project for large-scale urban deprivation analysis.

image

About

Efficient pipeline to download and extract building footprints from Google’s Open Buildings V3 dataset, using only tiles and geometries that intersect with your region of interest (ROI). Includes two Jupyter notebooks. Saves time, storage, and processing by avoiding the full 178 GB global dataset.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors