Skip to content

Add Lower Limb data extracted from PowerPoint#312

Open
Brehana-Naidu wants to merge 4 commits intodatafrom
lower-limb-data
Open

Add Lower Limb data extracted from PowerPoint#312
Brehana-Naidu wants to merge 4 commits intodatafrom
lower-limb-data

Conversation

@Brehana-Naidu
Copy link
Collaborator

Lower Limb Data Addition

Overview

This pull request adds the complete extracted and organized data from the Bone Box (Lower Limb) PowerPoint presentation to the data branch.

Data Included

Boneset Structure

  • lower_limb.json: Main boneset definition containing 5 bones

Bones (5 total)

  • Femur (with 13 subbones)
  • Patella (with 2 subbones)
  • Tibia (with 8 subbones)
  • Fibula (with 8 subbones)
  • Foot (with 10 subbones)

Subbones (40+ files)

Individual JSON files for each anatomical subpart with standardized naming (snake_case IDs).

Descriptions (45+ files)

  • Individual description files for each bone and subbone
  • Includes detailed anatomical descriptions
  • Contains muscle attachment information
  • Properly formatted and cleaned

Images (120+ files)

  • Extracted from 57 slides of the PowerPoint
  • Organized by bone with sequential numbering
  • Files named: {bone_id}_image{number}.{png|jpeg}

Annotations

  • Colored Regions (4 files): Anatomical overlay data for femur, tibia, fibula, and foot
  • Text Labels (4 files): Text annotation data for femur, patella, tibia, and fibula

Quality Standards

All data follows DataPelvis folder structure
File naming conventions standardized
JSON structure validated
226+ files organized into appropriate directories
Complete hierarchy: boneset bones subbones descriptions + images

Related Issue

Resolves extraction of Lower Limb PowerPoint data for digital anatomy reference tool

BreNaidu added 3 commits March 2, 2026 11:15
- Extracted 57 slides from Lower Limb PowerPoint
- Extracted bone images (150+ images)
- Extracted bone descriptions (57 bones with descriptions)
- Extracted bone hierarchy (boneset structure)
- Extracted colored regions annotations (46 slides)
- Extracted text label annotations (4 labeled slides)

Files prepared in: data_extraction/Bone_Box_Lower_Limb_extracted/
Pending: Final cleanup and organization per DataPelvis structure
Organized and added complete Lower Limb extraction data:

Boneset Structure:
- lower_limb.json (main boneset)

Bones (5 total):
- femur.json, patella.json, tibia.json, fibula.json, foot.json

Subbones (40+ total):
- Individual JSON files for each anatom anatomical subpart

Descriptions (45+ files):
- Individual description files for each bone and subbone
- Includes anatomical details and muscle attachment information

Images (120+ files):
- Organized and renamed images for each bone
- Files: femur_image1-120.{png,jpeg}, patella_image.png, tibia_image.png, etc.

Annotations:
- Colored Regions (4 files): femur, tibia, fibula, foot
- Text Labels (4 files): femur, patella, tibia, fibula

All data follows DataPelvis folder structure standards
@leandrumartin leandrumartin linked an issue Mar 3, 2026 that may be closed by this pull request
5 tasks
Copy link
Collaborator

@leandrumartin leandrumartin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • The data_extraction folder should not be committed
  • The images should not all be named after just the femur. You should look through the PowerPoint and verify which bone the image is a picture of based on the slides. You can also go ahead and get rid of duplicate images.
  • The femur and fibula both have "head" and "shaft" subbones. In order for these to have unique IDs, I think they should be renamed to "femur_head", "fibula_head", "femur_shaft", and "fibula_shaft". These changes would have to be made in bones/femur.json, bones/fibula.json, and in the subbones/ directory. It also looks like there are only descriptions (in descriptions/ for the femur head and femur shaft, not for the fibula head and fibula shaft.
  • The descriptions generally need to be cleaned up by looking through the PowerPoints and making sure the description arrays only contain lines of text in the actual descriptions on each slide.
  • The text labels weren't extracted, which is fine, since the script needed to be improved. I've recently made improvements to the text extraction script; you can update your main branch to see the changes to the new extract_text_labels.py which should better extract the text labels.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since they have their own dedicated slides in the PowerPoint, the following should also be their own dedicated subbones:

[
  "1st_metatarsal",
  "proximal_phalanx",
  "distal_phalanx",
  "2nd-5th_metatarsals"
  "phalanges"
]

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the description for the "1st metatarsal." The filename, name, and ID should be adjusted to match.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the description for the "2nd-5th metatarsals." The filename, name, and ID should be adjusted to match.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This file should not be committed.

@leandrumartin
Copy link
Collaborator

Slight addendum to my previous comment about removing duplicate images: To link a boneset, bone, or subbone to a specific image, you can (as described in the documentation for cleaning the output of Extract_Bone_Descriptions.py) include that image filename in the "images" field. So duplicate images are not needed at all.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Extract Lower Limb PowerPoint data and add to database

3 participants