Fix corrupt image handling in custom dataset by sushantkhemalapure · Pull Request #126 · humanai-foundation/RenAIssance

sushantkhemalapure · 2026-03-27T16:49:59Z

Issue

In custom_dataset.py, when an image cannot be read (e.g., corrupt or missing), the dataset returns an unexpected type (such as a string or tuple) instead of the expected sample dictionary. This can break the DataLoader during training or evaluation.

Fix

Updated the dataset logic to handle unreadable images safely by ensuring only valid sample formats are returned. Invalid samples are either skipped or handled with a clear error.

Impact

Prevents DataLoader crashes caused by inconsistent return types
Improves robustness when encountering corrupt or missing images
Makes training and evaluation more stable

Testing

Tested with valid images → dataset works as expected
Simulated corrupt/missing images → handled without crashing

…n clarity

- Updated TrOCR model path from ../../weights to ../../models - Ensures correct model loading instead of CRAFT weights

- Corrected unpacking of image.shape[:2] to (height, width) - Prevents distortion in rotated images

- Added sys.path handling to locate CRAFT module - Ensures qapp.py runs consistently outside Docker

- Prevent returning invalid sample types - Raise error or safely skip unreadable images

sushantkhemalapure · 2026-03-27T16:50:26Z

Handled invalid image cases to avoid breaking the DataLoader. Let me know if you'd prefer a different strategy (skip vs error).

sushantkhemalapure added 7 commits March 23, 2026 19:42

Improved README with pipeline explanation and beginner-friendly setup

e61addf

Fix: improve README formatting, clarity, and consistency

a514e65

Add benchmark table for OCR models in README and improve documentatio…

efd03a3

…n clarity

Fix incorrect OCR model path in Tk apps

bbe9ba8

- Updated TrOCR model path from ../../weights to ../../models - Ensures correct model loading instead of CRAFT weights

Fix incorrect width/height assignment in image rotation

749e41c

- Corrected unpacking of image.shape[:2] to (height, width) - Prevents distortion in rotated images

Fix import path in qapp.py for non-Docker environments

17336ad

- Added sys.path handling to locate CRAFT module - Ensures qapp.py runs consistently outside Docker

Fix corrupt image handling in dataset

42cb811

- Prevent returning invalid sample types - Raise error or safely skip unreadable images

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix corrupt image handling in custom dataset#126

Fix corrupt image handling in custom dataset#126
sushantkhemalapure wants to merge 7 commits intohumanai-foundation:mainfrom
sushantkhemalapure:fix/dataset-corrupt-image-handling

sushantkhemalapure commented Mar 27, 2026

Uh oh!

sushantkhemalapure commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

sushantkhemalapure commented Mar 27, 2026

Issue

Fix

Impact

Testing

Uh oh!

sushantkhemalapure commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant