-
Notifications
You must be signed in to change notification settings - Fork 22
Import scripts for Feb 2025 E&O Open Data Release #258
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Conversation
These are the scripts and metadata needed for the 2025 Feb ATLAS Education and Outreach Open Data Release. These are based on the recent proton-proton Research Open Data release, but with several simplified formats for various educational projects and skims that satisfy a variety of use-cases. We'll need 25 new DOIs and record IDs for this release, and we should plan for a news item as usual. Please let me know if anything seems amiss, or if I've missed anything in here!
The record IDs could start from 93910, with the following DOIs:
|
Thanks @tiborsimko ! I've pushed an update that includes those record IDs and DOIs, and seems to run locally. I think this is ready for you to try out to see if a test deployment works. Let me know if I've screwed anything up, or missed anything! Best, |
atlas-2025feb-odeo/make_odeo_json.py
Outdated
}, | ||
{ | ||
"description": "More about this ntuple format", | ||
"url": "https://opendata.atlas.cern/docs/documentation/data_format/FEB2025_ntuple/" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Note that this URL does not resolve: https://opendata.atlas.cern/docs/documentation/data_format/FEB2025_ntuple/
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep, we are just waiting on an MR to create the page (one issue there left to resolve). If this could go ahead in the test deployment, that'd be great — we'd then want to make sure all the links are 100% working before the final deployment.
Thanks, I have left some minor observations about about a broken link and a placeholder record ID,. Looking at the ODEO records, they have at most 373 files attached. This means that we do not have to attach them via file indexes (as for record 80000), but we could attach them directly (as for record 15010). If we do this, then we can simply amend the produced JSON snippets for files from the current format: {
"checksum": "adler3204d90a6c",
"events": 1471843,
"filename": "opendata:ODEO_FEB2025_v0_1LMET30_data15_periodD.1LMET30.root",
"size": 364145700,
"type": "root",
"uri_root": "root://eospublic.cern.ch:1094//eos/opendata/atlas/rucio/opendata/ODEO_FEB2025_v0_1LMET30_data15_periodD.1LMET30.root"
}, to the following format: {
"checksum": "adler32:04d90a6c",
"size": 364145700,
"uri": "root://eospublic.cern.ch//eos/opendata/atlas/rucio/opendata/ODEO_FEB2025_v0_1LMET30_data15_periodD.1LMET30.root"
}, After that the record JSON files are directly uploadable. Here is an example of how it looks: Would this be OK? |
Thanks @tiborsimko ! That proposed change is fine with me, and I've gone ahead and applied it in the latest update. Cheers, |
These are the scripts and metadata needed for the 2025 Feb ATLAS Education and Outreach Open Data Release. These are based on the recent proton-proton Research Open Data release, but with several simplified formats for various educational projects and skims that satisfy a variety of use-cases. We'll need 25 new DOIs and record IDs for this release, and we should plan for a news item as usual.
Please let me know if anything seems amiss, or if I've missed anything in here! Assigning @tiborsimko as he's helped us with the last imports, but please feel free to re-assign if appropriate.
Cheers,
Zach