-
Notifications
You must be signed in to change notification settings - Fork 190
feat: Update Tesseract installation to include all language packs #183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Description: Replaced specific installation of tesseract-langpack-eng with tesseract-langpack-\* to ensure support for all available OCR languages. This improves flexibility for multilingual OCR processing without requiring manual additions for each language. Changes: Updated os-packages.txt: replaced tesseract-langpack-eng with tesseract-langpack-\* Ensures all Tesseract language packs are installed via wildcard in dnf Note: Wildcard is escaped (\*) to prevent shell expansion and allow dnf to interpret it correctly. Signed-off-by: Athanasios Oikonomou <athoik@gmail.com>
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
|
@athoik do you know what is the added size for the container image? |
All the OCR languages will add an extra 685MB
If that's a problem, I can add only Greek language I am interesting to perform OCR. |
@dolfim-ibm could be specific to 2 cpu images? |
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
|
Description:
Replaced specific installation of tesseract-langpack-eng with tesseract-langpack-* to ensure support for all available OCR languages. This improves flexibility for multilingual OCR processing without requiring manual additions for each language.
Changes:
Updated os-packages.txt: replaced tesseract-langpack-eng with tesseract-langpack-*
Ensures all Tesseract language packs are installed via wildcard in dnf
Note:
Wildcard is escaped (*) to prevent shell expansion and allow dnf to interpret it correctly.