3 March 2025 1800 UTC #73
Replies: 2 comments
-
Notes(5 | 1800) Opening
(5 | 1805) Check insName and affiliation. Why you are here? How are you showing up? https://feelingswheel.com/ (20 | 1820) Scoping: Data rescueContext: Data rescue is the transformation of information from a non-machine readable format to a machine readable one. This can include scanning paper copies of reports or records, transcribing scanned records into flat text documents, or extracting x-y values from a scanned graph. Generally this stage should be as close to the original information as practical for specific purpose, and should be extendable for future purpose. This stage must maintain provenance of the data including links to the original source, how the data was made machine readable, what data was not rescued, and who did this work.
(20 | 1840) Scoping: Data harmonizationContext: Data harmonization is the process of transforming data from one data model to a second. In this context, a data model is a digital set of relational data tables and/or graphical (list-based) data including both primary data and metadata objects. In this project, data harmonization focuses on ensuring complete annotations that link the data objects and fill in any missing gaps needed for the data curation phase including units, control vocabulary, methodology details, and definitions. Any text extractions from non-machine readable sources should build off of a Data Rescue.
|
Beta Was this translation helpful? Give feedback.
-
RecapWe had a scoping discussion on data rescue and harmonization today and established the context for each of these phases. Revisions of this context based on the conversation today are below. Our next call will be March 31 and focus on scoping the data curation phase as well as brainstorming next steps for the data rescue phase. Data rescueData rescue is the transformation of information from a non-machine readable format to a machine readable one. This can include scanning paper copies of reports or records, transcribing scanned records into flat text documents, or extracting x-y values from a scanned graph. Generally this stage should be as close to the original information as practical for specific purpose, and should be extendable for future purpose. Information here may be primary or meta data. This stage must maintain provenance of the data including links to the original source, how the data was made machine readable, what data was not rescued, and who did this work. Data harmonizationData harmonization is the process of transforming data from the original collection of machine readable information to a standardize format. This original data collection may be composed of a machine-readable set of relational data tables and/or graphical (list-based) data including both primary data and metadata objects. The transformation is done via a set of scripts and manually created annotation file. This final format is a single data table with the following columns: id(s) - of variable - is type - with entry. Provenance in this stage is maintained by a |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Agenda
These calls are lead by @ktoddbrown and are generally open calls. Please contact them for call in information. See #72 for overall strategic direction of meetings.
(5 | 1800) Opening
(5 | 1805) Check ins
(20 | 1820) Scoping: Data rescue
Context: Data rescue is the transformation of information from a non-machine readable format to a machine readable one. This can include scanning paper copies of reports or records, transcribing scanned records into flat text documents, or extracting x-y values from a scanned graph. Generally this stage should be as close to the original information as practical for specific purpose, and should be extendable for future purpose. This stage must maintain provenance of the data including links to the original source, how the data was made machine readable, what data was not rescued, and who did this work.
Status: Currently this stage is being prototyped and documentation is under development in the wiki.
(20 | 1840) Scoping: Data harmonization
Context: Data harmonization is the process of transforming data from one data model to a second. In this context, a data model is a digital set of relational data tables and/or graphical (list-based) data including both primary data and metadata objects. In this project, data harmonization focuses on ensuring complete annotations that link the data objects and fill in any missing gaps needed for the data curation phase including units, control vocabulary, methodology details, and definitions. Any text extractions from non-machine readable sources should build off of a Data Rescue.
Status: Currently this stage is documented in the wiki but does not include references to data rescue.
(20 | 1900) Stuck to unstuck and ToDo's
ToDo:
@ktoddbrown write up recap from 17 February 2025 #71, draft 2025 priorities in 2025 Recaps #72, and drafted today's agenda
Round: How can the group help you move forward in this project?
(5 | 1920) Follow up on
(5 | 1925) Check outs [appreciations, discoveries, and ToDo]
Beta Was this translation helpful? Give feedback.
All reactions