SnowGalileo: A Pre-trained Transformer for Snow Cover Mapping

This repository contains the code for pre-training, fine-tuning, and evaluating the ESA AI4Snow model "SnowGalileo". SnowGalileo is a transformer model for daily fractional snow cover (FSC) mapping at 100 m resolution, based on multi-sensor Earth observation data.

To reproduce the figures in the accompanying paper, please visit paper_visualizations/.

Python Version

This project uses Python 3.11 and relies on a Makefile for standardized, reproducible commands.

You can read more about the makefile here.

Package & Environment Management

Environment & Dependency Management: uv is the recommended default tool for fast, reliable dependency installation and virtual environment creation. It can be configured to use Poetry or conda via Makefile.variables.
- When we mention conda in this project, we generally mean mamba or micromamba See Mamba documentation
Configuration: Review the project-level configurations in Makefile.variables or set individual preferences in Makefile.private.

Quick Start

First, make sure that either the project's Makefile.variables or Makefile.private your choice of configuration.

You can review your current active configurations using this command:

make info

You can list the available targets using this command:

make targets

Tool-Specific Setup

Select your preferred development stack below. Ensure your Makefile.variables are configured to match your choice.

1. Configure Your Stack

Adjust the variables in Makefile.private to match your desired setup if they differ from the project's default configuration found in Makefile.variables (do this with care and only if necessary):

Desired Stack	`DEFAULT_BUILD_TOOL`	`DEFAULT_INSTALL_ENV`
uv (Default/Recommended)	`uv`	`uv`
Poetry (Standard)	`poetry`	`poetry`
Poetry + Conda	`poetry`	`conda`
Poetry + Venv	`poetry`	`venv`

2. Install System Tools

If needed, run the command corresponding to your chosen stack to install the necessary system tools (e.g., uv, poetry, or mamba).

Stack: uv

make uv-install

Stack: Poetry

make poetry-install

Stack: Poetry + Conda

# Install both the package manager and environment manager
make mamba-install
make poetry-install

Installing the Project

Once your tools are configured and installed, run the universal install command. This will create the environment and install all dependencies defined in pyproject.toml.

make install

Activating the Environment

# Works for uv, poetry, and conda configurations
eval $(make <tool>-activate)

Examples:

uv: eval $(make uv-activate)
poetry: eval $(make poetry-activate)
conda: eval $(make conda-activate)

Note: You can also view environment details (path, python version, etc.) by running make <tool>-env-info (poetry and conda only - uv does not provide this functionality).

Project Usage

How to Run Pre-training

For pre-training SnowGalileo,

Download pretrain_inputs_h5pys.tar.xz from here. Extract the files and place them into data/h5pys_pretrain/.
Run python -m scripts.pretrain --h5pys_only. Set --output_folder to where the pre-training checkpoint should be stored.

How to Run Fine-Tuning

For fine-tuning SnowGalileo on clear-sky data,

Download finetune_inputs_h5pys.tar.xz from here. Extract the files and place them into data/fsc_train_balanced_h5pys/. From the same Zenodo repository, download the FSC labels used as ground truth (finetune_labels_tifs.tar.xz) and place them into data/fsc_train_100m_masks_balanced/.
Run python -m scripts.finetune --checkpointing --h5pys_only. Set --pretraining_checkpoint_folder to where the pre-training checkpoint is stored (can be downloaded from the folder checkpoints_snowgalileo_pretrain/ from here). Set --exclude_prediction_high_res if you want to fine-tune the model without high-resolution satellite data (Landsat and Sentinel-2) on the prediction day. After fine-tuning, the final checkpoint will stored in logging_checkpoints/.

For fine-tuning SnowGalileo on cloudy data,

Download finetune_inputs_with_clouds_h5pys.tar.xz from here. Extract the files and place them into data/fsc_more_clouds_timeseries_h5pys/. If not done already, from the same Zenodo repository, download the FSC labels used as ground truth (finetune_labels_tifs.tar.xz) and place them into data/fsc_train_100m_masks_balanced/.
Run python -m scripts.finetune_with_clouds --checkpointing --h5pys_only. Set --pretraining_checkpoint_folder to where the pre-training checkpoint is stored (can be downloaded from the folder checkpoints_snowgalileo_pretrain/ from here). Set --exclude_prediction_high_res if you want to fine-tune the model without high-resolution satellite data (Landsat and Sentinel-2) on the prediction day. After fine-tuning, the final checkpoint will stored in logging_checkpoints/.

How to Run Evaluation Experiments

SnowGalileo can be evaluated using data from either the Canadian Rockies or the Swiss Alps.

For evaluating SnowGalileo on clear-sky data,

Download evaluate_[region]_inputs_h5pys.tar.xz from here. Extract the files and place them into data/fsc_test_[region]_h5pys/. From the same Zenodo repository, download the FSC labels used as ground truth (evaluate_[region]_labels_tifs.tar.xz) and place them into data/fsc_test_[region]_100m_masks/.
Run python -m scripts.eval_only --eval_config_name "fsc_test_[region]_tiny.json" --h5pys_only. Set --checkpoint_name to the name of the SnowGalileo checkpoint that should be evaluated (options can be downloaded from the folder checkpoints_snowgalileo_finetune/ from here and should be stored in logging_checkpoints/). Set --exclude_prediction_high_res if you want to evaluate the model without high-resolution satellite data (Landsat and Sentinel-2) on the prediction day. Evaluation results will be stored in results/.

Replace [region] with either rockies for the Canadian Rockies, or switzerland for the Swiss Alps.

For evaluating SnowGalileo on cloudy data,

Download evaluate_[region]_inputs_with_clouds_h5pys.tar.xz from here. Extract the files and place them into data/fsc_test_[region]_full_clouds_h5pys/. If not done already, from the same Zenodo repository, download the FSC labels used as ground truth (evaluate_[region]_labels_tifs.tar.xz) and place them into data/fsc_test_[region]_100m_masks/.
Run python -m scripts.eval_with_clouds --eval_config_name "fsc_test_[region]_full_clouds_tiny.json" --h5pys_only. Set --checkpoint_name to the name of the SnowGalileo checkpoint that should be evaluated (options can be downloaded from the folder checkpoints_snowgalileo_finetune/ from here and should be stored in logging_checkpoints/). Set --exclude_prediction_high_res if you want to evaluate the model without high-resolution satellite data (Landsat and Sentinel-2) on the prediction day. Evaluation results will be stored in results/.

Replace [region] with either rockies for the Canadian Rockies, or switzerland for the Swiss Alps.

How to Run Inference on your own Points

This repository can be used to run FSC inference on your own points. More detailed documentation will be available in the future.

Further Usage

Information about exporting input data using Google Earth Engine can be found in the file data/README.md. This repository also contains code for training and evaluating baseline models (random forest, MLP and support vector regressor), as well as for running machine learning ablation experiments. More detailed documentation will be available in the future.

Environment & Portability Note

This template is designed for reproducibility using the uv.lock file.

Funding

We are greatful to the ESA AI4Science 4000143295/23/I-DT grant that made this project possible.

Acknowledgements

This repository builds upon the codebase of Galileo, which is licensed under the MIT license. If you use this repository, please also cite the Galileo paper:

@misc{tseng2025galileolearninggloballocal,
      title={Galileo: Learning Global and Local Features in Pretrained Remote Sensing Models},
      author={Gabriel Tseng and Anthony Fuller and Marlena Reil and Henry Herzog and Patrick Beukema and Favyen Bastani and James R. Green and Evan Shelhamer and Hannah Kerner and David Rolnick},
      year={2025},
      eprint={2502.09356},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2502.09356},
}

This repository also uses the SatelliteCloudGenerator. If you use functionality based on this package, please cite:

@Article{rs15174138,
  author = {Czerkawski, Mikolaj and Atkinson, Robert and Michie, Craig and Tachtatzis, Christos},
  title = {SatelliteCloudGenerator: Controllable Cloud and Shadow Synthesis for Multi-Spectral Optical Satellite Images},
  journal = {Remote Sensing},
  volume = {15},
  year = {2023},
  number = {17},
  article-number = {4138},
  url = {https://www.mdpi.com/2072-4292/15/17/4138},
  issn = {2072-4292},
  doi = {10.3390/rs15174138}
}

This README is using resources, templates, and documentation by RolnickLab.

We gratefully acknowledge all original authors and contributors for making their code openly available.

Name		Name	Last commit message	Last commit date
Latest commit History 4,293 Commits
.github/workflows		.github/workflows
.make		.make
configs		configs
data		data
docs		docs
logging_checkpoints		logging_checkpoints
notebooks		notebooks
paper_visualizations		paper_visualizations
results		results
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.mdformat.toml		.mdformat.toml
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
Dockerfile		Dockerfile
Makefile		Makefile
Makefile.private.example		Makefile.private.example
Makefile.targets		Makefile.targets
Makefile.variables		Makefile.variables
README.md		README.md
noxfile.py		noxfile.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

SnowGalileo: A Pre-trained Transformer for Snow Cover Mapping

Python Version

Package & Environment Management

Quick Start

Tool-Specific Setup

1. Configure Your Stack

2. Install System Tools

Installing the Project

Activating the Environment

Project Usage

How to Run Pre-training

How to Run Fine-Tuning

How to Run Evaluation Experiments

How to Run Inference on your own Points

Further Usage

Environment & Portability Note

Funding

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

SnowGalileo: A Pre-trained Transformer for Snow Cover Mapping

Python Version

Package & Environment Management

Quick Start

Tool-Specific Setup

1. Configure Your Stack

2. Install System Tools

Installing the Project

Activating the Environment

Project Usage

How to Run Pre-training

How to Run Fine-Tuning

How to Run Evaluation Experiments

How to Run Inference on your own Points

Further Usage

Environment & Portability Note

Funding

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages