Skip to content

Conversation

@gabrieltseng
Copy link
Contributor

Depends on allenai/rslearn#272

@robmarkcole
Copy link

robmarkcole commented Sep 30, 2025

Is this just using the dataset as is from https://zenodo.org/records/5012942 ? I guess not, in which case it would be nice to know how it is generated, presumably with rslearn

@favyen2
Copy link
Collaborator

favyen2 commented Sep 30, 2025

Is this just using the dataset as is from https://zenodo.org/records/5012942 ? I guess not, in which case it would be nice to know how it is generated, presumably with rslearn

Here is the script used to convert the data:

https://github.com/allenai/rslearn_projects/blob/master/one_off_projects/2025_05_13_pastis/convert.py

It is just converted to rslearn dataset format, but the images still come from the original dataset (not generated by rslearn).

@robmarkcole
Copy link

robmarkcole commented Oct 1, 2025

@gabrieltseng it is also necessary to include the required config.json, might be worth documenting these.

What machine did you use for training? On an H100 I get Tried to allocate 112.50 GiB. GPU 0 has a total capacity of 79.44 GiB of which 50.23 GiB is free with batch_size 1

@gabrieltseng
Copy link
Contributor Author

@robmarkcole I have been using GPUs with less memory than a H100 for testing. It may be worth increasing the patch size or decreasing the model size if you are running into memory issues

@favyen2 favyen2 merged commit ea968a4 into master Oct 6, 2025
3 checks passed
@favyen2 favyen2 deleted the galileo-config branch October 6, 2025 15:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants