From f493ef46ec17066aa56b00c028e99fe0873335d5 Mon Sep 17 00:00:00 2001 From: tmanik <tmanik@internet2.edu> Date: Mon, 9 Sep 2024 14:59:19 -0400 Subject: [PATCH] Updated with additional info --- 03-data-processing/README.md | 4 ++++ 1 file changed, 4 insertions(+) diff --git a/03-data-processing/README.md b/03-data-processing/README.md index b99badd..5a43c48 100644 --- a/03-data-processing/README.md +++ b/03-data-processing/README.md @@ -17,6 +17,10 @@ By the end of this section, you will have: These scripts will form the backbone of our data processing pipeline, allowing us to efficiently collect, store, and prepare weather data for analysis. +### Data Set Description + +This project utilizes the [NOAA Global Surface Summary of the Day (GSOD) dataset](https://registry.opendata.aws/noaa-gsod/), which is available through the AWS Open Data Registry. The GSOD dataset is derived from the Integrated Surface Hourly (ISH) dataset, containing global weather data from over 9,000 stations worldwide. It provides daily summaries of weather observations, including metrics like temperature, wind, and precipitation, and is updated 1-2 days after the original observation. The data collection starts from 1929 and is currently at Version 8, making it a comprehensive resource for analyzing long-term weather trends and patterns. + ## Prerequisites Before starting this lesson, please ensure that you have: