Extracting the text from speech bubbles from manga
A page can be translated by running the below commamd
python easyocr_translate.py --img_path IMG_PATH --out_path OUT_PATH
Its a work in progress, the quality of cleaning and translation is very poor
After training the efficientdet for 3 epochs on the Manga109 dataset
Google translation with easyocr
I am using the scripts in src folder from signatrix implementation of efficientdet which is available here
I also used the manga109api to save time on parsing the manga109 dataset
- Enhancing the text containing parts of the image using morphological operations for improving the OCR's performance
- Improve Cleaning
- Improve text placement