Extending OCR Model Support for Additional Languages: Fine-tuning or Alternative Solutions?" #499

parth1313 · 2025-01-31T06:33:46Z

Hi Everyone,

The model is providing excellent OCR results for the languages it has been trained on. However, I want to extend its capabilities to perform OCR for other languages as well.

What is the best approach to achieve this?
Should I fine-tune the vision encoder, the LLM, or both? Or is there an alternative approach I should consider?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extending OCR Model Support for Additional Languages: Fine-tuning or Alternative Solutions?" #499

Extending OCR Model Support for Additional Languages: Fine-tuning or Alternative Solutions?" #499

parth1313 commented Jan 31, 2025

Extending OCR Model Support for Additional Languages: Fine-tuning or Alternative Solutions?" #499

Extending OCR Model Support for Additional Languages: Fine-tuning or Alternative Solutions?" #499

Comments

parth1313 commented Jan 31, 2025