Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extending OCR Model Support for Additional Languages: Fine-tuning or Alternative Solutions?" #499

Open
parth1313 opened this issue Jan 31, 2025 · 0 comments

Comments

@parth1313
Copy link

Hi Everyone,

The model is providing excellent OCR results for the languages it has been trained on. However, I want to extend its capabilities to perform OCR for other languages as well.

What is the best approach to achieve this?
Should I fine-tune the vision encoder, the LLM, or both? Or is there an alternative approach I should consider?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant