OCR Project

1. CNN Handwritten Recognition

Overview

This component focuses on detecting Hiragana characters using a CNN model. The dataset is sourced from ETL文字データベース, and the model is trained with a modified size of 32x32 images.

Model Architecture

1.CNN model

Feature Extraction: Kernel size=3, Strides=1, Filters=32, Activation=ReLU
Pooling: MaxPooling2D
Optimizer: RMSprop()
Loss Function: Sparse Categorical Crossentropy
Accuracy: 98.11%

AlexNet model

5 convolutional layers.
3 fully connected layers
Optimizer: RMSprop()
Loss Function: Sparse Categorical Crossentropy
Accuracy: 99.38%

Notebook for recognition model

2. OCR Model

Overview

The OCR model detects contours and generates character images for the CNN model to process. Initially attempted without pyocr, it faced issues with connected lines in characters. The morphological dilation transformation in OpenCV solved this problem.

Model Details

Contour Detection
Morphological Dilation
Adjusting Rectangle Area for Image Generation

OCR_1.ipynb (basic OCR model without adjustment),

OCR_2.ipynb (OCR model with adjustment such as morphological Dilation, adjusting rectangle area)

3. Model Deployment

Transporting Models to Local Environment

Models built on Ubuntu VM and Colab are made available for local use, addressing issues with M1 chip MacBooks using Miniforge and virtual environments.

4. Automated Flashcard Generator

Overview

An automated flashcard generator that uses the Jisho API to retrieve word meanings. Flashcards are then exported into an Anki deck (output.apkg) for easy import.

Usage

Input words or use a picture for word retrieval.
Jisho API fetches word meanings.
Anki flashcards are generated and exported.

Link to Jisho API Link to Anki API

Name		Name	Last commit message	Last commit date
Latest commit History 82 Commits
__pycache__		__pycache__
hiragana_recognition_cnn.tf		hiragana_recognition_cnn.tf
pictures		pictures
templates		templates
.DS_Store		.DS_Store
.gitignore		.gitignore
OCR_1.ipynb		OCR_1.ipynb
OCR_2.ipynb		OCR_2.ipynb
README.md		README.md
hiragana_recognition_cnn.h5		hiragana_recognition_cnn.h5
main.py		main.py
ocr_handwritten.py		ocr_handwritten.py
output.apkg		output.apkg
recognition_modeling.ipynb		recognition_modeling.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

OCR Project

1. CNN Handwritten Recognition

Overview

Model Architecture

2. OCR Model

Overview

Model Details

3. Model Deployment

Transporting Models to Local Environment

4. Automated Flashcard Generator

Overview

Usage

About

Uh oh!

Releases

Packages

Languages

yusuke-satani/OCR_project

Folders and files

Latest commit

History

Repository files navigation

OCR Project

1. CNN Handwritten Recognition

Overview

Model Architecture

2. OCR Model

Overview

Model Details

3. Model Deployment

Transporting Models to Local Environment

4. Automated Flashcard Generator

Overview

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages