This component focuses on detecting Hiragana characters using a CNN model. The dataset is sourced from ETL文字データベース, and the model is trained with a modified size of 32x32 images.
1.CNN model
- Feature Extraction: Kernel size=3, Strides=1, Filters=32, Activation=ReLU
- Pooling: MaxPooling2D
- Optimizer: RMSprop()
- Loss Function: Sparse Categorical Crossentropy
- Accuracy: 98.11%
- AlexNet model
- 5 convolutional layers.
- 3 fully connected layers
- Optimizer: RMSprop()
- Loss Function: Sparse Categorical Crossentropy
- Accuracy: 99.38%
Notebook for recognition model
The OCR model detects contours and generates character images for the CNN model to process. Initially attempted without pyocr, it faced issues with connected lines in characters. The morphological dilation transformation in OpenCV solved this problem.
- Contour Detection
- Morphological Dilation
- Adjusting Rectangle Area for Image Generation
OCR_1.ipynb (basic OCR model without adjustment),
OCR_2.ipynb (OCR model with adjustment such as morphological Dilation, adjusting rectangle area)
Models built on Ubuntu VM and Colab are made available for local use, addressing issues with M1 chip MacBooks using Miniforge and virtual environments.
An automated flashcard generator that uses the Jisho API to retrieve word meanings. Flashcards are then exported into an Anki deck (output.apkg) for easy import.
- Input words or use a picture for word retrieval.
- Jisho API fetches word meanings.
- Anki flashcards are generated and exported.