On the previous page, I have added USA climate data from a data set found at Kaggle. OK, but this data stops at 2021. The documentation on the Kaggle data set informs us that it was created from the Global Historical Climatology Network - Daily (GHCN-Daily) dataset, So I decided to have a look at this data set, The information can be found at www.ncei.noaa.gov/cdo-web/search?datasetid=GHCND .Â
I will be using API calls to get data from this website. The format of the URL is like in the following example:
This will return me the TMAX (maximum temperature) value for the dates between 1-Nov-2025 and 30-Nov-2025 for the weather station US10adam025
To automate this process, I have developed the workflow below:
It reads all the USA weather stations from the usa_stations_with_state table and , in a loop, requests the weather data from the NOAA API.
The API call URL is constructed in the String Manipulation node, like this.
join("https://www.ncei.noaa.gov/access/services/data/v1?dataset=daily-summaries&dataTypes=TMAX&startDate=2025-11-01&endDate=2025-11-30&format=csv&units=metric&stations=",$station_id$)
The results are written to the USAdata_2025 table:
Below you can see an improved version of the previous workflow. It incorporates a Try / Catch block around the CSV reader, which prevents the workflow from failing in case the API does not respond properly.
Let's now check for the highest temperatures recorded in the US during 2025. Below is the workflow which does this :
Below is the output of this workflow. Several values reached or exceeded 50 degrees Centigrade.
The workflow below shows the maximum values recorded per state for all the data we have loaded so far (2018, 2019, 2020, 2021 and 2025) :
And below its output. Note that the top six maximum temperature values recorded correspond to the year 2025. The values are in tenths of degrees Centigrade.
I have installed the KNIME batch execution extension and created a CRON entry to execute the USA data workflow once every day.... TODO modify it to use current date in the API call.