GeoSciAI2026 - 気象課題 / Meteorological Challenge

GeoSciAI2026

1.はじめに --Introduction

2.データについて --Data

3.評価について --Evaluation Metrics

4.チュートリアルを兼ねた例 --Baseline example scripts

5.予測結果の自動採点システムアップロード --How to upload prediction to the scoring system

気象課題

「気候データを高解像度化しよう」

Meteorological AI Challenge:
Climate Data Downscaling

1.はじめに --Introduction

皆さんは、天気予報をどのくらい活用していますか。天気予報は、日々の生活だけでなく、災害への備えや社会活動を支える重要な情報です。

現在の天気予報は、主に数値モデルと呼ばれる計算手法によって作られています。これは、地球や地域を格子状に区切り、気温や降水量などの変化を物理法則に基づいて計算するものです。まず全球規模のモデルで大まかな気象の流れを計算し、その結果をもとに、より細かい格子を用いた領域モデルで詳細な予測を行います。

このように、解像度を段階的に高めていく手法を「ダウンスケーリング」と呼びます。気象予報だけでなく、より長期的な気候予測とその応用分野でも、ダウンスケーリングが活用されています。将来の気候変動によってどのような影響があるかは地域によって異なるため、それを調べるために、地域詳細な気候予測データが必要になるのです。ただし、領域モデルを使ったダウンスケーリングは高解像度になるほど計算コストが大きくなるため、近年では機械学習を用いてこの過程を効率化する研究が進められています。

本コンペティションでは、四国周辺を対象とした0.25度解像度（約25km四国周辺）の複数の気象データから、同じ範囲を対象とした0.05度解像度（約5km）の降水量分布を予測する課題に取り組んでいただきます。気象やデータ分析に興味のある方の、積極的な挑戦をお待ちしています。

How often do you check weather forecasts? Weather forecasts provide valuable information not only for our daily lives but also for disaster preparedness and many social activities.

Today’s weather forecasts are mainly created using computational methods called numerical models. These models divide the Earth or a specific region into grid cells and simulate changes in weather variables such as temperature and precipitation based on physical laws. Typically, large-scale atmospheric patterns are first calculated using global models. These results are then used in regional models with finer grid spacing to produce more detailed predictions.

This process of gradually increasing spatial resolution is known as downscaling. Downscaling is used not only in weather forecasting but also in long-term climate projections and their applications. Since the impacts of climate change vary from region to region, high-resolution climate data are essential for understanding local risks and supporting adaptation planning. However, producing high-resolution data using traditional numerical models requires substantial computational resources. Recently, machine learning approaches have attracted increasing attention as a promising way to make this process more efficient.

In this competition, participants will develop models to predict precipitation distributions at 0.05-degree resolution using multiple meteorological datasets provided at 0.25-degree resolution. We encourage students, researchers, and anyone interested in meteorology, climate science, or data science to take on this challenge and explore new approaches to climate data downscaling.

2.データについて --Data

予測対象は0.05度格子の降水量です。予測に用いることができるのは、0.25度格子の様々な気象変数です。0.25度/0.05度の標高データも使うことができます。（参考までに0.05度格子の気温も置いておりますが、予測対象期間の0.05度格子の気温はありません）

0.25度格子と0.05度格子で提供する気象変数が異なります。対象地域は四国周辺です。1つの0.25度格子の中に、0.05度格子が5x5でぴったりはまる構造になっています（50x50を5x5の格子にまとめると、中央の格子の座標が10x10の格子の座標と一致します；下図参照）。ファイル名は特徴量名_年_粒度.npzです。1994～1998年の5年分のデータで学習し、2003年夏季のデータ（JJA2003）に適用してください。

0.25度格子（10×10）
- rain (降水量；mm/hr)
- temp (地上気温；K)
- slp (地上気圧；hPa)
- ua850 (850hPa東西風；m/s)
- va850 (850hPa南北風；m/s)
- ta850 (850hPa気温；K)
- hus850 (850hPa比湿；kg/kg)
- zs (標高データ；固定値；m)
0.05度格子（50×50）
- rain (降水量；予測する対象です；mm/hr)
- t (地上気温；K)
- zs (標高データ；固定値；m)

The meteorological variables provided differ between the 0.25-degree grid and the 0.05-degree grid. The target domain covers the region around Shikoku, Japan. The dataset is structured such that each 0.25-degree grid cell contains a corresponding 5 × 5 set of 0.05-degree grid cells. File names follow the format featureName_year_resolution.npz.

Participants should train their models using the five years of data from 1994–1998 and apply the trained model to the summer 2003 dataset (JJA2003).

0.25-degree Grid (10 × 10)

rain – Precipitation (mm/hr)
temp – Surface air temperature (K)
slp – Sea level pressure (hPa)
ua850 – Zonal wind at 850 hPa (m/s)
va850 – Meridional wind at 850 hPa (m/s)
ta850 – Air temperature at 850 hPa (K)
hus850 – Specific humidity at 850 hPa (kg/kg)
zs – Surface elevation (static field; m)

0.05-degree Grid (50 × 50)

rain – Precipitation (mm/hr) (target variable)
t – Surface air temperature (K)
zs – Surface elevation (static field; m)

図：対象範囲の南西端部における50x50格子と10x10格子の関係
The relationship between the 50×50 grid and the 10×10 grid at the southwestern part of the target domain.

3.評価について --Evaluation Metrics

評価は降水量の二乗平均平方根誤差（Root mean square error）で行います。0mmの降水量も含みます。6時間ごと5年分の0.25度格子（10格子×10格子）と、0.05度格子（50格子×50格子）の情報を使ってモデルを構築し、評価期間（時間ステップ数368）の0.05度格子（50格子×50格子）の降水量を推定してください。降水量は、6時間ごとの、過去1時間降水量のことです。
学習データは通年のデータですが、テストは夏のみを対象としています。評価データ（テストデータ）の内容について、統計解析、可視化、または目視による確認を含め、いかなる方法による事前確認も禁止します。

Model performance will be evaluated using the Root Mean Square Error (RMSE) of precipitation. Precipitation values, including zero precipitation, will be directly used in the RMSE calculation without any transformation.

Participants are required to develop their models using 6-hourly data over a 5-year period, provided at both 0.25-degree resolution (10 × 10 grid cells) and 0.05-degree resolution (50 × 50 grid cells). Precipitation refers to the hourly accumulated precipitation for the preceding hour, provided at 6-hour intervals.

The RMSE will be calculated by aggregating prediction errors over all grid cells and all time steps in the evaluation dataset.

The training dataset includes data from all seasons, whereas the test dataset is limited to the summer season. Any attempt to inspect, analyze, or infer properties of the evaluation dataset is strictly prohibited.

4.チュートリアルを兼ねた例 --Baseline example scripts

粗い格子の情報と細かい格子の情報を対応させるためにはいくつかの手法がありますが、ここではいったん粗い格子を細かい方の格子に合わせて内挿し、推定値を作成する例を示します。なお、ファイル名は特徴量名_年_粒度.npzです。ここでは降水量のみを使って推定をしています。

① データを読みこんで図化するスクリプト（いちばんはじめの時間ステップのみ；以下に示すのは50x50の標高データを図化した例）

使い方：python plot_npz_50x50.py zs_50x50.npz --out fig.png

② 内挿後に線形回帰して、テストデータに適用して推定値を出力するスクリプト

使い方：python interp_regress.py \

　　 --train_coarse_npz coarse_10x10.npz \

--train_fine_npz fine_50x50.npz \

--test_coarse_npz coarse_test_10x10.npz

--out_npz pred_from_regression.npz

There are several approaches to aligning coarse- and fine-resolution information. Here, as a simple baseline example, we first interpolate the coarse-resolution data onto the fine-resolution grid and then produce estimated values on that grid.

Script to load and visualize the data (first time step only)
Usage: python plot_npz_50x50.py zs_50x50.npz --out fig.png
Script that interpolates coarse-grid data to the fine grid, applies linear regression, generates estimated values for the test dataset
Usage: python interp_regress.py \
　　 --train_coarse_npz coarse_10x10.npz \
--train_fine_npz fine_50x50.npz \
--test_coarse_npz coarse_test_10x10.npz
--out_npz pred_from_regression.npz

5.予測結果の自動採点システムアップロード --How to upload prediction to the scoring system

予測結果のアップロード方法

本ページ上部の投稿方法ページに従って、自動採点システムへの参加登録が完了しましたら、トップ画面上部の「コンペティション一覧」から、参加したいコンペティションのページに移動してください。
「データ」カラムをクリックし、評価データのURLを記載したファイルと提出ファイルのサンプルをダウンロードします。
ダウンロードした評価データを用いて、予測を行います。予測結果を、予測結果のndarrayのみを含むnpzファイルとして保存します。この時、配列名（キー名）は問いません。
「投稿」カラムをクリックし、投稿ページから、予測結果のnpzファイルを提出します。
提出に成功すると、スコアが計算され、表示されます。ランキングは最後に提出したスコアに基づいて計算されます。

Please follow the instructions on the “Submission Guideline” page at the top of this page to complete your registration for the scoring system. Once registration is complete, navigate to the competition you wish to join from the “Competition List” at the top of the main page.
Click the “Data” column to download a file containing the URL of the evaluation dataset (same to the dataset provided on this page) as well as a sample submission file.
Use the downloaded evaluation data to generate your predictions. Save your predictions as an npz file that contains only the prediction ndarray. The array name (key) can be arbitrary.
Click the “Submission” column and upload your npz file from the submission page.
Once the submission is successful, your score will be calculated and displayed. The ranking will be determined based on your most recent submission.

注意：スコア・ランキングについて

スコアは、コンペティション開催中は、テストデータの50%のみを用いて計算されます。このスコアを「中間スコア」と呼びます。コンペティション開催中のランキングは、中間スコアによって算出されます。

最終的なスコア・ランキングは、最後に提出した予測結果について、中間スコアに用いられなかった残りの50%を用いて計算されますため、中間スコアと異なることにご注意くさい。最終的なスコア・ランキングは、コンペティション終了後、JpGU-AGU Meeting 2026にて行われる表彰式にて発表されます。

Note: Regarding Scores and Rankings

During the competition, scores are calculated using only 50% of the test data. This score is referred to as the intermediate score. The ranking during the competition is determined based on this intermediate score.

The final score and ranking will be calculated using the remaining 50% of the test data that was not used for the intermediate score, based on your most recent submission. Therefore, please note that the final results may differ from the intermediate results.

The final score and ranking will be announced at the award ceremony held during the JpGU-AGU Meeting 2026 after the competition concludes.

Page updated

Report abuse