Run reproducibility

For science quality runs, we should improve traceability and reproducibility. Currently, FEDS runs are not fully self-documented, and depend on someone remembering some combination of: which input data was used, which version of the codebase was used, the environment they were created in (which can be mismatched between DPS, ADE, and local), what settings were used, which region bounds were used. Some of this information exists, but is spread around various logs, .env files, or is left implicit in directory naming. This worked fine at first, but as we continue to add new sensors (NOAA21), new types of runs (NRT vs "archive" vs "constrained archive," and more), and more collaborators, having a systematic record will only become more important. 

As a goal, I'd like to get to a "single source of truth-" one metadata record for each run that contains information such as: 
- Commit hash, branch/version tag, and DPS algorithm name (if applicable) of code version used to produce these outputs
- All settings in FireConsts
- Source of input data (monthly files, FIRMS NRT, FIRMS SP) (related to #215)
- Region shapefile 
- Environment and package version information (related to  #222) 

@eorland @mccabete are there other bits of information you would like to see in this record? 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run reproducibility #225

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Run reproducibility #225

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions