News@Unidata

Running Pretrained AI-NWP Models, Our Experience at NSF Unidata on Jetstream2

2025-05-07T18:31:09+00:00

By Thomas Martin, Ana Espinoza, Julien Chastang, and Drew Camron

Wind output from AI-NWP. (Click to enlarge.)

At NSF Unidata, we have successfully implemented and re-used weights from several global AI-NWP (Artificial Intelligence-Numerical Weather Prediction) models (FourCastNet, Pangu) using the NVIDIA earth2mip package. We can confirm that these models are open source and can be reused on high-end, but increasingly standard, HPC hardware. While traditional numerical weather prediction requires massive supercomputing resources, these AI models can potentially deliver similar or better results using standard GPU hardware for inference. The training of these AI-NWP models still requires immense GPU resources. The process demanded careful consideration of computational resources, model architecture adaptations, and optimization strategies, as detailed below, and has opened up new possibilities for research institutions to run state-of-the-art weather predictions without access to supercomputing facilities. Below, we lay out some of the roadblocks we encountered and how to address them, hoping to smooth the path for others looking to implement these pre-trained models.

Need a Large-ish GPU!

A standard desktop GPU often won't suffice for loading and running these complex models. Jetstream2 is a U.S. National Science Foundation (NSF) funded cloud computing system designed to provide on-demand, interactive, and programmatic cyberinfrastructure for research and education, featuring advanced AI capabilities, virtual GPUs, and high-performance storage to support a broad range of scientific and engineering applications. We quickly ran into memory limitations when using GPUs with under 10GB of VRAM. Our successful implementation relied on a Jetstream2 g3.large instance equipped with a 20GB GPU (see Instance Flavors in the Jetstream2 Documentation), and even then encountered limits with specific models. Cloud-based serverless options like Modal are also viable, allowing you to access the necessary resources on demand.

State-of-the-art AI-NWP models can contain billions of parameters, with some requiring upwards of 16GB of GPU memory just to load the weights before computation begins. Additionally, at least 100GB of hard drive space is recommended for storing model weights and associated data. The initial loading process can be surprisingly time-consuming, often taking several minutes as the weights are transferred from storage to the GPU's memory. We also discovered that having sufficient memory isn't the whole story — efficient memory management is crucial. Strategies like clearing the CUDA cache, setting memory limits, and using mixed precision (FP16) are essential for preventing out-of-memory errors and ensuring smooth execution. At the end of this blog, there is a link to an NSF Unidata repository that has some helpful files and scripts for your own implementation of the earth2mip project.

AI/ML Python Packaging is Still a Mess

These products represent the cutting edge of Earth Systems Science, where APIs and datasets evolve rapidly to incorporate new research findings and methodological advances. The AI/ML Python packaging ecosystem is improving significantly week by week — a notable example being the recent ability to package PyTorch and CUDA on Windows — but the implementation process remains intricate and heavily dependent on specific hardware configurations, firmware versions, and package compatibilities in ways that exceed the typical requirements of scientific software. The emergence of modern Python package installers like uv (from Astral) offers promising solutions to these dependency challenges, as it provides dramatically faster installations and more reliable dependency resolution compared to traditional tools like pip. While we share our implementation approach below, we anticipate that these instructions may need frequent updates to remain current with the fast-paced developments in both AI/ML technologies and earth systems modeling. This rapid evolution means that successful implementation often requires staying closely connected with the broader AI/ML and Earth Systems communities to track breaking changes and emerging best practices.

Input Data Sourcing and Pre-processing

Access to specialized meteorological data sources is required for many AI-NWP frameworks. Specifically, the ECMWF ai-models repository requires direct MARS (Meteorological Archival and Retrieval System) access, which is typically only available to ECMWF member states and licensed institutions. While other data access has improved through tools like the CDS API for ERA5 and the earth2mip package's data preprocessing pipeline, significant challenges remain. This differs significantly from simpler machine learning approaches like random forest models that can work directly with local CSV or NetCDF files. Organizations must obtain appropriate credentials (ECMWF MARS access for ai-models, or CDS API credentials for ERA5) and become familiar with meteorological file formats like GRIB2 and their associated libraries (ecCodes, cfgrib) to properly load and preprocess the data. Additionally, downloading and storing the required ERA5 variables can demand several terabytes of storage space, depending on the temporal and spatial resolution needed.

Hope for the Future

This research and workflow is at the frontier of large-scale weather prediction, where artificial intelligence meets traditional numerical methods. While today's packages and workflows require careful handling and specific expertise, we're seeing small signs of maturation in the field. The community's growing adoption of these AI-driven approaches is steadily leading to more robust implementations and better documentation. We've already witnessed significant improvements in model packaging, data access, and installation processes — suggesting that what seems complex today may become routine practice tomorrow. This research is the frontier of large scale weather prediction.

While these packages and workflows are delicate today, we predict that models and methods that are used more in the community will get more support to smooth out the rough edges. Already we have seen improvement on this front with recent work from our CIRA colleagues using ai-models package package supported by ECMWF. Jacob Radford (https://github.com/jacob-radford) of CIRA created a great google colab notebook titled Running AI Weather Prediction (AIWP) models.

If You Just Want the Data

Our colleagues at CSU Fort Collins serve up these AI model outputs, in an accessible S3 bucket. Their work is described in Accelerating Community-Wide Evaluation of AI Models for Global Weather Prediction by Facilitating Access to Model Output.
and the data are available at https://noaa-oar-mlwp-data.s3.amazonaws.com/index.html.

Map display created by MetPy.

We want to thank them for doing this service for the community. At the 2025 American Meteorological Society Annual Meeting we saw more than a few presentations using this data archive for interesting research. Much more to do in this space!

Use it in MetPy

With the release of MetPy 1.7, we can access this data using the code below:

from datetime import datetime

from metpy.plots import MapPanel, PanelContainer, RasterPlot
from metpy.remote import MLWPArchive

###################
# Access the GraphCast forecast closest to the desired date/time
dt = datetime(2025, 3, 19, 00)  # Target datetime in UTC
# MLWPArchive accesses Machine Learning Weather Prediction models from NOAA's data archive
# get_product retrieves GraphCast data for the specified datetime
ds = MLWPArchive().get_product('graphcast', dt).access()

###################
# Plot the data using MetPy's simplified plotting interface.
raster = RasterPlot()
raster.data = ds  # Assign xarray Dataset
raster.field = 't2'  # Plot 2-meter temperature field
raster.time = dt  # Set valid time
raster.colorbar = 'horizontal'  # Position colorbar
raster.colormap = 'RdBu_r'  # Red-Blue reversed colormap (blue=cold, red=warm)

panel = MapPanel()
panel.area = 'co'  # Set geographic area to Colorado
panel.projection = 'lcc'  # Lambert Conformal Conic projection
panel.layers = ['coastline', 'borders', 'states']  # Add map features
panel.plots = [raster]  # Add raster plot to panel
panel.title = f"{ds[raster.field].attrs['long_name']} @ {dt}"  # Title with field name and time

pc = PanelContainer()
pc.size = (8, 8)  # Figure size in inches
pc.panels = [panel]  # Add panel to container
pc.draw()  # Render the figure
pc.show()  # Display the figure

Parting Thoughts

The dramatic computational efficiency gains demonstrated by AI-based weather prediction models like GraphCast and FourCastNet — showing orders of magnitude speedup over traditional physics-based models — make a compelling case for their operational integration. The hydrology community is doing something similar with GPU-based inference for complex simulations (Bennett et al. 2022); these learnings will only expand across the Earth Systems Sciences.

However, AI-NWP models remain dependent on traditional numerical models for training data, creating an interesting symbiotic relationship. While computationally expensive to run, physics-based models provide the essential foundation for training more efficient AI emulators. The path forward likely involves a hybrid approach where AI models handle routine operational forecasting on GPUs, while traditional models advance our understanding and generate training data.

The key challenges ahead involve thorough validation across different weather regimes and scales to build trust in the meteorological community. This requires continued collaboration between AI and weather prediction experts to ensure these models maintain physical consistency while capitalizing on their computational advantages. Finding this balance between efficiency and reliability will be crucial for successfully integrating AI-NWP into operational systems.

NSF Unidata-built Resources

In the Unidata/MLscratchpad GitHub repository, we have some of the resources that we used for this initial test. While it’s not a complete end-to-end implementation, it might help you get over the line. As always, feel free to get in touch with us via our support channels for more customized assistance.

References

Bauer, Peter. "What if? Numerical weather prediction at the crossroads." Journal of the European Meteorological Society 1 (2024): 100002.

Bennett, Andrew, Hoang Tran, Luis De la Fuente, Amanda Triplett, Yueling Ma, Peter Melchior, Reed M. Maxwell, and Laura E. Condon. "Spatio‐temporal machine learning for regional to continental scale terrestrial hydrology." Journal of Advances in Modeling Earth Systems 16, no. 6 (2024): e2023MS004095.

Bi, Kaifeng, Lingxi Xie, Hengheng Zhang, Xin Chen, Xiaotao Gu, and Qi Tian. "Pangu-weather: A 3d high-resolution model for fast and accurate global weather forecast." arXiv preprint arXiv:2211.02556 (2022).

Kurth, Thorsten, Shashank Subramanian, Peter Harrington, Jaideep Pathak, Morteza Mardani, David Hall, Andrea Miele, Karthik Kashinath, and Anima Anandkumar. "Fourcastnet: Accelerating global high-resolution weather forecasting using adaptive fourier neural operators." In Proceedings of the platform for advanced scientific computing conference, pp. 1-11. 2023.

Radford, Jacob T., Imme Ebert-Uphoff, Jebb Q. Stewart, Kate D. Musgrave, Robert DeMaria, Natalie Tourville, and Kyle Hilburn. "Accelerating Community-Wide Evaluation of AI Models for Global Weather Prediction by Facilitating Access to Model Output." Bulletin of the American Meteorological Society 106, no. 1 (2025): E68-E76.

Thomas Martin is an AI/ML Software Engineer at the NSF Unidata Program Center. Have questions? Contact support-ml@unidata.ucar.edu or book an office hours meeting with Thomas on his Calendar.

Book Review - StatQuest Guide to Neural Networks

2025-02-10T20:24:42+00:00

By Thomas Martin

The StatQuest Illustrated Guide to Neural Networks and AI: With hands-on examples in PyTorch!!! (available from Amazon through the StatQuest store) strikes an excellent balance between accessibility and technical depth. Josh Starmer, PhD, builds on his previous work while making neural networks approachable for both students and practitioners. This book has a similar feel and vibe to the previous book, The StatQuest guide to Machine Learning.

What makes this book particularly valuable is its comprehensive coverage of architectures relevant to real-world Earth Science applications. The sections on Long Short-Term Memory networks (LSTMs) are essential for anyone working with time series data — like forecasting streamflows in hydrology. Similarly, the coverage of Convolutional Neural Networks (CNNs) provides fundamental knowledge for working with image data, from satellite imagery classification to computer vision applications.

Each chapter comes with practical, modern PyTorch exercises that bridge theory and implementation. The book progresses logically from fundamental concepts through to advanced topics like transformers, making it accessible for those transitioning from traditional statistical methods to deep learning approaches. Starmer's explanations of concepts like backpropagation, cross-entropy, and attention mechanisms are clear and practical, enhanced by his fun illustration style that helps build intuition.

For readers seeking to deepen their mathematical understanding, two excellent free online resources complement Starmer's work. Mathematics for Machine Learning by Marc Peter Deisenroth, A. Aldo Faisal, and Cheng Soon Ong provides rigorous coverage of the underlying mathematical concepts, while An Introduction to Statistical Learning with Python by Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani, and Jonathan Taylor offers a comprehensive foundation in statistical and machine learning methods. These resources are particularly valuable for students and professionals who want to further understand the mathematical principles behind neural network architectures and optimization techniques.

Thomas Martin is an AI/ML Software Engineer at the NSF Unidata Program Center. Have questions? Contact support-ml@unidata.ucar.edu or book an office hours meeting with Thomas on his Calendar.

GOES Rebroadcast User Survey

2024-09-16T09:10:00+00:00

In a message from NOAA's Satellite Products and Services Division, the Direct Readout Program Manager has asked users of the GOES Rebroadcast system (GRB) for feedback on their use of the system, saying “NOAA has begun the preliminary stages of defining the requirements for the GEOXO mission. Your input is needed in creating a broadcast that meets and/or exceeds user expectations. Please take time to complete the GRB User Survey.”

Those operating GRB receiving stations are encouraged to answer the GRB User Survey 2024 (a Google form). From the survey introduction:

The purpose of this survey is to get a more accurate picture of the GOES Re-Broadcast Users' capabilities, footprint, and what sources are being utilized to retrieve their environmental data. A key benefit of collecting this information will be NESDIS/ OSPO's increased ability to assist in protecting our User's uninterrupted access to critical weather information.

What is GOES Rebroadcast?

Raw environmental sensing information collected by GOES-R Series satellites is transmitted to Earth as a digital data stream. The ground system software at Wallops Command and Data Acquisition Station (WCDAS) processes this Level 0 data and creates Level 1b products. For example, the measurements from the Advanced Baseline Imager (ABI) instrument are converted to units of radiance and calibrated, navigated, and remapped to a fixed grid. The antenna at WCDAS transmits the products to the GOES-R Series satellite for relay through the satellite's GOES Rebroadcast (GRB) transponder and L-Band antenna to GRB receiving stations, including GRB receive stations at the NOAA Satellite Operations Facility (NSOF) and a variety of non-NOAA stations. For more information, see GOES Rebroadcast (GRB).

What is GEOXO?

GeoXO is the next generation of NOAA Geostationary Observation Satellites for Weather, Climate, Atmospheric, and Oceanographic observations, following the GOES-R series with a first launch in 2032. By 2034/2035 NOAA will replace the GRB service with an equivalent service for GeoXO using a new dissemination architecture.

Does this Concern Me?

If you operate a GRB receiving station or otherwise rely on GRB data, or are just a general GOES data user, you may want to provide your feedback to NOAA so that they can develop a clear picture of the needs of the GRB and GOES data user community.

netCDF vs Zarr, an Incomplete Comparison

2024-09-09T14:54:57+00:00

By Thomas Martin and Ward Fisher

Visualization created efficiently from netCDF data using kerchunk.

At NSF Unidata, we have been supporting and developing netCDF standards and packages since the original release of netCDF in 1990. We strongly believe in the usefulness of netCDF Common Data Model for Earth Systems Science data, and for other types of data! NetCDF files can be used efficiently in machine learning modeling applications (see Loading NetCDFs in TensorFlow by Noah Brenowitz) and can be used as a virtual Zarr dataset using the python package kerchunk: check out Using Kerchunk with uncompressed NetCDF 64-bit offset files: Cloud-optimized access to HYCOM Ocean Model output on AWS Open Data, which provides a nice oceanographic demo by Rich Signell, and Fake it until you make it — Using Kerchunk to read NetCDF4 data on AWS S3 as Zarr for rapid data access by Lucas Sterzinger.

Zarr is an emergent data standard first introduced in 2016, and has implemented some nice features around efficient subsetting and chunking, cloud optimization, and flexible metadata handling. Zarr was born out of the need for scientific data formats optimized for object storage, instead of the traditional file-/block-based storage. This was driven by the explosion of cloud-hosted scientific data across the last decade. Zarr naturally has some distinct cloud optimization features not found in the file formats previously supported by netCDF.

netCDF and Zarr

In 2016, NSF Unidata was urged by our community to investigate options to allow netCDF to work more easily with modern cloud-based infrastructure. At that time, Zarr was identified as one of several possible initial avenues of interest. Based on the strong interest and rapid adoption of Zarr by the community, the netCDF team decided to begin working with the Zarr community to leverage the good work and contributions being made. Since this time, NSF Unidata has been an active participant in Zarr community meetings. Since 2022, NSF Unidata has had a voting seat on the Zarr Implementation Committee (ZIC), giving our community a formal voice in the technical development process adopted by the Zarr project.

At NSF Unidata we are interested and invested in the success of Zarr, and see it as a compliment to our netCDF efforts. Since our initial introduction to the Zarr community, netCDF has implemented ncZarr, a data storage format largely compatible with netCDF-4 enhanced data model. This new format has been integrated into netCDF so that users can leverage the advantages of cloud-based object storage without having to overhaul their existing code, or move away from netCDF software. A side effect of this adoption has been the ability to convert compatible files between ncZarr-based storage and more traditional netCDF files stored in block-based storage.

Unexpected Interactions with HPC

By design, Zarr's primary focus is on object storage. As the scientific community has investigated use of Zarr in research activities, situations where Zarr is not an appropriate choice have come to light. There have been surprising observations, particularly in High Performance Computing systems, as the community moves beyond sample datasets and begins exploring real-world data.

As part of its approach to object storage, Zarr generates a large number of 'files' which represent the corresponding dataset. An unintended consequence of this can be observed when we then consider the case where Zarr is not operating in an object store environment, but is instead being used within traditional block-storage filesystem (such as ext3/ext4, HFS+, or NTFS). The proliferation of files and directories generated can be a tremendous problem for large HPC systems, which by design serve many different types of users, filetypes, and software systems.

While this issue is not present for object storage, which is common for cloud systems like Amazon S3 or Azure Blob Storage (the use of which is becoming more and more common for Earth Systems Science data), it illustrates that there is seldom a one-size-fits-all solution for scientific data management. While object storage hosted scientific data is becoming more common, the bulk of scientific data used for data analysis, machine learning, and historic data archival still exist in traditional computing ecosystems.

We have put together a short and an extremely (perfectly?) imperfect Jupyter Notebook that illustrates this: netCDF vs zarr, an imperfect comparsion

While the Zarr files were faster to write, the example test case we used did create more than 2000 files, compared to just 2 netCDF files. With some effort, this notebook could probably be optimized for both netCDF and Zarr generation (we are hoping to get pull requests and comments from you about this!), but it serves to illustrate the situation.

As more and more HPC centers move to object storage, this potential downside might fade away in the future.

Thomas Martin is an AI/ML Software Engineer at the NSF Unidata Program Center. Have questions? Contact support-ml@unidata.ucar.edu or book an office hours meeting with Thomas on his Calendar.

Ward Fisher is the lead developer for NSF Unidata's netCDF efforts.

Convolutional Neural Networks (CNNs) for Earth Systems Science

2024-06-06T11:41:25+00:00

By Thomas Martin

Convolutional Neural Networks (CNNs) are a powerful class of deep learning models widely applied in Earth science for image analysis, classification, and regression problems. Leveraging the Keras framework in python, CNNs can efficiently process and extract spatial features from 2D and 3D remote sensing, model output, and other Earth Systems Science (ESS) data types.

An important feature of CNNs for ESS is their relative scale invariance (i.e. where specific features are on a 2/3D array), a characteristic that emerges from their architectural design. Scale invariance is facilitated by the use of local receptive fields, allowing the network to analyze specific regions of input data. By operating on local regions rather than the entire image, CNNs effectively capture features at different scales within the data. Additionally, CNNs employ weight sharing, where the same set of weights is applied across various spatial locations. This weight sharing mechanism enables the network to detect features regardless of their position in the input image, contributing to overall scale invariance. In addition, CNN architectures typically incorporate pooling layers, which downsample feature maps, reducing spatial dimensions while retaining essential information. This downsampling process enhances the network's focus on salient features while diminishing sensitivity to small spatial variations, further reinforcing its scale invariance.

(click to enlarge)

The image at right, from Visual Guide to Applied Convolution Neural Networks, shows how the filtering process works for a CNN. After a filter (or kernel) size is chosen, the filter array is populated with random values, then multiplied with each sub-array of matching size in the image (the convolution step) to create a feature map. This process is then repeated with a new set of filter values. After a large number of convolutions have been completed and the feature maps constructed, the algorithm chooses the filter array that results in the best match with existing training data. In this case, the algorithm identifies features that match those in a training set picturing dogs, and makes a prediction about whether the image being processed also represents a dog.

CNNs bear resemblance to standard filtering analysis, primarily through their shared use of convolutional operations. In both approaches, the convolution operation serves as a core mechanism for feature extraction. Moreover, both CNNs and standard filtering analysis operate hierarchically, capturing spatial hierarchies of features within the input. Through multiple convolutional layers, CNNs progressively extract higher-level features by amalgamating information from lower-level features, mirroring the hierarchical processing seen in standard filtering analysis.

While CNNs are well loved, they do have downsides:

Data Requirements: CNNs need large labeled datasets for training
Overfitting Risk: CNNs are prone to overfitting, where they memorize training data rather than generalize to new examples.
Interpretability Challenges: While some XAI (explainable AI) techniques exist to interpret input and outputs to CNNs, these tools are not perfect.
Generally not an appropriate model choice for tabular datasets.

These are not the only downsides, but things to keep in mind for your specific project.

Quick Code Block

CNN's in Keras 3.0 can be defined in around 10 lines of code:

  # Define a sequential model  model = keras.Sequential([  # First convolutional layer with 32 filters of size 5x5 and same padding  Conv2D(32, (5, 5), padding='same', strides=(1, 1)),  # Exponential Linear Unit (ELU) activation function for non-linearity  ELU(),  # Second convolutional layer with 32 filters of size 5x5 and same padding  Conv2D(32, (5, 5), padding='same'),  # ELU activation function  ELU(),  # Third convolutional layer with 1 filter of size 5x5 and same padding  Conv2D(1, (5, 5), padding='same'),  # No activation since we are solving a regression problem  ])

If you want to explore more on how this CNN was used to predict future pressure levels, take a look at the WeatherBench notebook:

ESS Research that uses CNNs

Figure from article #1 at left.

Additional Resources for Learning about CNNs

Thomas Martin is an AI/ML Software Engineer at the NSF Unidata Program Center. Have questions? Contact support-ml@unidata.ucar.edu or book an office hours meeting with Thomas on his Calendar.

SSEC Unidata Server Has Been Shut Down!

2024-05-22T10:20:47+00:00

The NSF Unidata server hosted at the Space Science and Engineering Center (SSEC) at the University of Wisconsin, Madison (unidata3.ssec.wisc.edu) has been permanently decommissioned.

Those who used the services provided by unidata3.ssec.wisc.edu should switch to the following alternate servers:

Service	If you used	Switch to
McIDAS ADDE	adde.ssec.wisc.edu	adde.ucar.edu
IDD/LDM Upstream	idd.ssec.wisc.edu	idd.unidata.ucar.edu
HTTP access to IDD Data		atm.ucar.edu

Please reach out to us at support@unidata.ucar.edu if you have any questions!

Why is the Keras 3 Release a Big Deal for the Deep Learning Community?

2024-04-18T09:13:00+00:00

By Thomas Martin, Julien Chastang, and Ana Espinoza

The Keras package is an open-source library that provides a Python interface for deep learning. Keras is intended to be a user-friendly, modular, and extensible way to enable fast experimentation with deep neural networks.

Keras has an approachable API that originally supported multiple backends including TensorFlow, Microsoft Cognitive Toolkit, Theano, and PlaidML. With the introduction of Keras version 2.4, however, only the TensorFlow backend was supported.

Use of different ML frameworks (click to enlarge)

With Keras version 3, however, the package provides APIs for using three backends: TensorFlow, JAX, and PyTorch. This is a nice change in our view, as PyTorch has gained traction for deep learning research and general use in many labs.

Technical Note: as of the date of publication of this article, the current version of Keras is 3.2.1. We recommend using the latest releases, as they are still working out some small bugs and API improvements.

Why would you want to use a different backend?

Some machine learning backend frameworks can be more optimized for specific tasks or hardware. For smaller, fully connected models that are common in Earth Systems Science research, this might not matter as much. But some of the speed-ups reported with using JAX might be worth looking into for larger problems. This article about JAX covers some of the advantages: Why You Should (or Shouldn't) be Using Google's JAX in 2023.

One reason to use the PyTorch ecosystem is the Fully Sharded Data Parallel API which allows for massive scaling on the largest of clusters.

While there are pros and cons to each backend, there is not one “best” option — especially when dealing with large, complex problems. Keras 3 benchmarks provides some additional insights into the suitability of different backend frameworks for different situations.

Is TensorFlow Dead?

Just like Fortran and Matlab, TensorFlow will be around for a long time. But with JAX having official support from Google, TensorFlow may not get the development support it has in the past. If you have a current project that uses TensorFlow, I would not worry about it today, but this evolving landscape is something to keep in mind for the future. (That future may be a long way off!) If problems with support for TensorFlow do start to emerge, with Keras 3 it should not be a major change to switch backends.

Why Does Unidata Care?

For deep learning training I (Thomas) will be using a Keras 3 API exclusively. It more closely resembles the scikit-learn api and I find it to be easier to explain. Other people might feel differently, but I currently do not see the downsides to teaching the new Keras api. This API is also used for some JupyterHubs already at Universities.

Unidata's Science Gateway Now Offers Access to Jupyter Hubs with Keras 3 loaded!

Please get in touch with our Science Gateway team at support-gateway@unidata.ucar.edu to inquire about access to a GPU enabled JupyterHub instance with Keras 3! While we offer this access free of charge to our community, access to these resources may be limited depending on demand.

An example notebook illustrating use of the Keras 3 API is here: Modified Weather Bench Code

You can find more information about Keras 3 in Introducing Keras 3.0.

Thomas Martin is an AI/ML Software Engineer at the NSF Unidata Program Center. Have questions? Contact support-ml@unidata.ucar.edu or book an office hours meeting with Thomas on his Calendar.

SSEC Unidata Server Shutting Down in April 2024

2024-04-02T10:43:23+00:00

The NSF Unidata server hosted at the Space Science and Engineering Center (SSEC) at the University of Wisconsin, Madison (unidata3.ssec.wisc.edu) will be permanently decommissioned on April 26, 2024.

Those using services provided by unidata3.ssec.wisc.edu can switch to using the following alternate servers:

Service	If you use	Switch to
McIDAS ADDE	adde.ssec.wisc.edu	adde.ucar.edu
IDD/LDM Upstream	idd.ssec.wisc.edu	idd.unidata.ucar.edu
HTTP access to IDD Data		atm.ucar.edu

Please reach out to us at support@unidata.ucar.edu if you have any questions!

Shutdown of Two Special-Purpose THREDDS Data Servers

2024-03-29T11:32:58+00:00

NSF Unidata will be shutting down two existing special-purpose THREDDS Data Servers on April 15, 2024:

https://threddsrc.ucar.edu/
https://thredds-jumbo.unidata.ucar.edu/

These servers were created for specialized reasons, and are no longer needed. All functionality of the two servers that will be decommisioned has been incorporated into NSF Unidata's main THREDDS Data Server:

https://thredds.ucar.edu

Please update any scripts, IDV bundles, or other local resources to use the https://thredds.ucar.edu host.

Please reach out to the THREDDS Development team at support-thredds@unidata.ucar.edu if you have any questions.

K Nearest Neighbors

2024-03-04T08:24:00+00:00

By Thomas Martin, AI/ML Software Engineer

Fred Rogers, famous for asking people to be his neighbor
(Click to enlarge)

K Nearest Neighbors (KNN) is a supervised machine learning method that 'memorizes' (stores) an entire dataset, then relies on the concepts of proximity and similarity to make predictions about new data. The basic idea is that if a new data point is in some sense 'close' to existing data points, its value is likely to be similar to the values of its neighbors. In the Earth Systems Sciences, such techniques can be useful for small- to moderate-scale classification and regression problems; one example uses KNN techniques to derive local-scale information about precipitation and temperature from regional- or global-scale numerical weather prediction model output.

When using a KNN algorithm, you select the number of 'neighbors' to consider (K), and potentially a way of calculating the 'distance' between data points. KNN algorithms can be used for both classification and regression problems. For regression problems, KNN predicts the target variable by using an averaging scheme. For classification problems it takes the mode of the nearest neighbors; as a result, it is generally recommended that the value of K be an odd number. Effective use of KNN often requires some experimentation to determine the best value for K.

.' href='https://assets.unidata.ucar.edu/blog_content/images/2024/20240219_ml_k_neighbors.png'>

Comparing the decision boundary between using 1 neighbor vs 20, from Kevin Zakka's blog.

KNN is sometimes called a 'lazy learning' method. This is because it does not generate a new explicit model, but rather memorizes the dataset in its entirety. While the scikit-learn API uses a .fit() method, this is largely to match the rest of the scikit-learn API.

Why you might use KNN for your ML project

It's simple. Because KNN is a lazy learner, there is no complex model and only limited math is needed to understand the inner workings.
It's adaptable to different data distributions. KNN works well with odd distributions of data.
It's good for smaller datasets. Because no model is being constructed, KNNs can be a good choice for smaller datasets.

Some Downsides to KNN

It's sensitive to outliers and poor feature selection. KNN does not do any automatic feature selection like decision tree models. These types of models can struggle in high dimensional space, both with a large number of input features and outliers within those features.
It has a relatively high computational cost. While the analog/sample matching behavior of KNNs are great from an explainability point of view (model-free ML is great!), for large datasets the cost of memorizing the entire dataset can be enormous.
It needs a complete dataset. Like many other ML models, KNNs do not handle missing data or NaN (Not a Number) values. If your dataset is not complete, you'll need to impute the missing values before using a KNN.

KNNs have been discussed previously on MetPy Mondays here: MetPy Mondays #183 - Predicting Rain with Machine Learning - Using KNN

KNNs are a great supervised ML model to try out if your dataset is on the smaller side. Happy modeling! What ML model should I cover in an upcoming blog?

News@Unidata

Running Pretrained AI-NWP Models, Our Experience at NSF Unidata on Jetstream2

Need a Large-ish GPU!

AI/ML Python Packaging is Still a Mess

Input Data Sourcing and Pre-processing

Hope for the Future

If You Just Want the Data

Use it in MetPy

Parting Thoughts

NSF Unidata-built Resources

References

Book Review - StatQuest Guide to Neural Networks

GOES Rebroadcast User Survey

What is GOES Rebroadcast?

What is GEOXO?

Does this Concern Me?

netCDF vs Zarr, an Incomplete Comparison

netCDF and Zarr

Unexpected Interactions with HPC

Convolutional Neural Networks (CNNs) for Earth Systems Science

Quick Code Block

ESS Research that uses CNNs

Additional Resources for Learning about CNNs

SSEC Unidata Server Has Been Shut Down!

Why is the Keras 3 Release a Big Deal for the Deep Learning Community?

Why would you want to use a different backend?

Is TensorFlow Dead?

Why Does Unidata Care?

Unidata's Science Gateway Now Offers Access to Jupyter Hubs with Keras 3 loaded!

SSEC Unidata Server Shutting Down in April 2024

Shutdown of Two Special-Purpose THREDDS Data Servers

K Nearest Neighbors

Why you might use KNN for your ML project

Some Downsides to KNN

More reading and resources