Add fine-tuning code and scripts #1

llewelld · 2025-05-21T17:58:12Z

Adds the initial code for fine-tuning.

Only the small model works on an 80 GiB A100. The standard model gives an out-of-memory error on the GPU.

Adds code for fine-tuning the models on Baskerville.

The directory contains both prediction and fine-tuning, so the name no longer makes sense. Keeping both tasks in the same directory turned out to be convenient because the downloads directory can be reused more easily by both in this case.

Wraps the model in FSDP, but still runs out of memory when performing the backwards step.

Adds headers to various files: 1. Shebang interpreter directive. 2. vim modeline configuration. 3. SPDX licence identifier.

Renames the scripts that match closest to those working on DAWN to use the suffix "aligned" to avoid confusion over thier purpose.

Adds a loss function to align with the formula in the paper.

Fixes the loss function implementation for the aligned and FSDP cases.

Adds code for comparing DAWN and Baskerville results and generating comparisong graphs.

Adds a README with instructions for how to run the comparisons and generate the comparison graphs.

llewelld force-pushed the baskerville branch 5 times, most recently from 5779493 to f9c710c Compare May 22, 2025 13:10

llewelld force-pushed the baskerville branch from f9c710c to bb8bf4b Compare June 2, 2025 17:15

llewelld added 9 commits June 3, 2025 11:56

Add Baskerville fine-tuning code

5aec9aa

Adds code for fine-tuning the models on Baskerville.

Rename era5-prediction to era5-experiments

8e6c39a

The directory contains both prediction and fine-tuning, so the name no longer makes sense. Keeping both tasks in the same directory turned out to be convenient because the downloads directory can be reused more easily by both in this case.

Add FSDP fine tuning

2770571

Wraps the model in FSDP, but still runs out of memory when performing the backwards step.

Add headers to files

266d3d9

Adds headers to various files: 1. Shebang interpreter directive. 2. vim modeline configuration. 3. SPDX licence identifier.

Rename dawn scripts to aligned

debe7de

Renames the scripts that match closest to those working on DAWN to use the suffix "aligned" to avoid confusion over thier purpose.

Add correct loss function

24b8351

Adds a loss function to align with the formula in the paper.

Update loss function and FSDP implementation

71c90e5

Fixes the loss function implementation for the aligned and FSDP cases.

Add comparison code

e26165e

Adds code for comparing DAWN and Baskerville results and generating comparisong graphs.

Add comparision instructions

a0ded40

Adds a README with instructions for how to run the comparisons and generate the comparison graphs.

llewelld force-pushed the baskerville branch from 70a0e8b to a0ded40 Compare June 3, 2025 10:56

llewelld merged commit 0830e37 into main Jun 3, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add fine-tuning code and scripts #1

Add fine-tuning code and scripts #1

Uh oh!

llewelld commented May 21, 2025

Uh oh!

Uh oh!

Add fine-tuning code and scripts #1

Add fine-tuning code and scripts #1

Uh oh!

Conversation

llewelld commented May 21, 2025

Uh oh!

Uh oh!