Skip to content

Extract rest of PowerPoints and add to database #298

@leandrumartin

Description

@leandrumartin

🎯 Goal / Objective

A clear and concise description of what this task aims to achieve. Why is this task important?

We need to extract data from the rest of the PowerPoint files available to us and add the data to the database.

See steps in Docs/README_data_extraction_process.md for steps on data extraction. That file details which extraction scripts to run. It will be updated shortly with more complete steps on how to correct and tweak the extracted data, and you will be notified about this change.

The final output of this is that the extracted files will be added into the appropriate subfolder of the DataPelvis/ folder in the data branch. So to facilitate this, you will split a new branch off of that branch (make sure your local data branch is up to date first), add the extracted files to the appropriate sub-folders, and make a pull request to the data branch.


✅ Tasks to be Completed

A checklist of the specific, actionable steps required to complete this issue. This helps track progress.

  • Run extraction scripts on the PowerPoint
  • Correct data
  • Add to appropriate subfolder

Acceptance Criteria

A checklist of conditions that must be met for this task to be considered complete. How will we verify that it's done correctly?

  • All data from the PowerPoints have been extracted
  • The data is clean and correct

Additional Context

Add any other context, notes, screenshots, or links that might be helpful for completing this task.

See sub-issues.

Sub-issues

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions