Skip to content

Refactor: Centralize image preprocessing pipeline to resolve redundant OpenCV reads#129

Open
abhiram123467 wants to merge 6 commits intohumanai-foundation:mainfrom
abhiram123467:fix/127-refactor-preprocessing
Open

Refactor: Centralize image preprocessing pipeline to resolve redundant OpenCV reads#129
abhiram123467 wants to merge 6 commits intohumanai-foundation:mainfrom
abhiram123467:fix/127-refactor-preprocessing

Conversation

@abhiram123467
Copy link
Copy Markdown

This PR addresses Issue #127 by cleaning up scattered and duplicated image preprocessing code across the repository. Previously, various scripts and Jupyter notebooks implemented their own cv2.imread and grayscale conversion logic, leading to duplicate code and potential shape-unpacking bugs when images varied in channel depth.

Adds Setup and How to Run sections to README.md with clear instructions for new contributors.
Updated ResNet18 and ResNet34 to classify 3 classes instead of 1000. Added fully connected layers and flattening before classification.
)

This module provides LLM post-processing for OCR output, improving accuracy by correcting errors in historical Spanish documents. It includes functions for computing character error rates, cleaning raw OCR text, and processing batches of OCR strings.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant