GitHub - bakartikey/OCR-ICR-on-documents: Extracting data from documents

OCR/ICR on documents

Objective:

The goal is to find an algorithm that can extract the maximum information from a given page

I broke the process in to the following 6 steps:

Character isolation
Noise reduction
Boundary removal
Normalising
Thinning
Feature extraction

Challenges:

There were many challenges to overcome.

Black Border Removal
ICR (Intelligent Character Recognition): recognize and convert hand-drawn characters into text
Scanned page (Detect edges and apply a perspective transform to obtain the top-down view of the document)
Remove noise
Shape detection and extraction
OCR
Handwriting recognition
Minimize errors But the main problem was to “identify which part of the form contains text”.

My Approach

Input image => Detecting orientation of Image => Detecting and fixing skew angle => Removing form/table structure => Removing noise and making text clearer => Applying OCR and handwriting recognition

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
code		code
result		result
.DS_Store		.DS_Store
README.md		README.md
input.png		input.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

bakartikey/OCR-ICR-on-documents

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages