Skip to content

geo-smart/GeoSmart_ML_Tutorial

 
 

Repository files navigation

mountain

Geosmart Use Case Jupyter Book

Deploy Jupyter Book Badge Binder GeoSMART Use Case GitHub GitHub top language GitHub repo size

Deep Learning national scale 1 km resolution SWE prediction model

Snow-derived water is a critical hydrological component for characterizing the quantity of water available for domestic, recreation, agriculture, and power generation in the western United States. Advancing the efficiency and optimization of these aspects of water resources management requires an enhanced characterization of the snow state variable, particularly the essential global inputs of snow-water-equivalent (SWE), peak SWE, and snowmelt onset for hydrological models. While physically-based models that characterize the feedbacks and interactions between influencing factors predict SWE values well in homogeneous settings, these models exhibit limitations for CONUS-scale deployment due to challenges attributed to spatial resolution, landscape heterogeneity, and computational intensity. Leveraging a collaborative partnership between the Alabama Water Institute (AWI) at the University of Alabama (UA) and the University of Utah (UU), we address these limitations through the National Snow Model (NSM) as a full stack data-driven machine learning (ML) platform with a modular structure to account for the heterogeneity of climate and topographical influences on SWE across the western United States. This model consists of twenty-three regionally specific sub-models tailored to the unique topography and hydroclimate phenomena in the Western U.S., exhibiting an RMSE less than 8 cm and a coefficient of determination approaching 0.99 on predictions spanning the 2013-2017 training period. The NSM pipeline assimilates nearly 700 snow telemetry (SNOTEL) and California Data Exchange Center (CDEC) sites and combines with processed lidar-derived terrain features for the prediction of a 1 km x 1 km SWE inference in critical snowsheds under 20 minutes on personal computer. We complete the full stack product by leveraging the Tethys interface (TBA), supporting the interactive use of the results (i.e., individual to HUC-scale SWE estimates). With preliminary regional testing performance ranging between 2.5 cm to 8 cm (i.e., RMSE), this model has the potential to advance the snow state variable in hydrological models such as the National Water Model to improve the estimates of peak flow for flood management and low-flows for supply operations. This readme describes the necessary Python dependencies, training sources, and instructions to get the ML model running for near-real-time SWE inference.

ML_SWE-2.jpg

About the GeoSMART Snow ML Use Case Library

GeoSMART Use Case

General Overview

This use case library contains a summary of the motivation behind the project, applicable ML methods and tools, the methods for data preparation, model development including the model training framework and evaluation, workflow management demonstrated using GeoWeaver, links to the complete model on GitHub as the model is data-intensive the tutorial is for a subset of the entire model, a discussion/conclusion, and a solitication to qestions. Below are the respective chapters addressing these items:

  1. Motivation
  2. Machine Learning Methods and tools
  3. Data Preparation
  4. Model Development and Parameter Tuning
  5. Model Training
  6. Evaluation of Model Performance
  7. Workflow Management and Cloud Computing
  8. Reproducibility
  9. Discussion/Conclusion
  10. Solicitation to Questions

Contributing Content

Tutorial content supports the data processing, model training, and evaluation of the National Snow model for a subregion of Colorado. The goal is to provide get new users running executable code on our AWS platform.

Project support through the University of Alabama and the University of Utah

UU UA

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 84.2%
  • HTML 15.8%