[Feature]: Feature Request - Use Google Document AI or VIsion AI instead of Tesseract #1434

epatels · 2024-11-18T12:21:41Z

Describe the proposed feature

Hi,

I know Tesseract OCR engine is free. But unfortunately is not very good especially while performing OCR for Indian Languages.

This is where Google Document AI and Google VIsion AI excels. I understand there is a cost involved in using these services.

But I am looking for a solutions that performs underlying OCR process using Google Document AI or Google VIsion AI OCR engine. The rest can remain unmodified with the output being a searchable PDF (in Indian Languages).

grantbarrett · 2025-04-24T04:04:48Z

I have forked and updated kkrell2016’s pre-existing Google Vision OCRmyPDF plugin and am happy to report that it works well. See here: https://github.com/grantbarrett/son-of-ocrmypdf_plugin_GoogleVision. I recognize that by using a paid service like Google it goes against the spirit of open source, but the Google Vision OCR is very, very good so the compromise is worth it to me.

epatels · 2025-04-24T05:34:39Z

Oh, wow. That's really great. Can't wait to test asap. Thanks and Regards, Chandrakant Patel Mumbai ----------------------------------------------------------

…

On Thu, 24 Apr 2025 at 09:35, Grant Barrett ***@***.***> wrote: *grantbarrett* left a comment (ocrmypdf/OCRmyPDF#1434) <#1434 (comment)> I have forked and updated a pre-existing Google Vision OCRmyPDF plugin and am happy to report that it now works well. See here: https://github.com/grantbarrett/son-of-ocrmypdf_plugin_GoogleVision. I recognize that by using a paid service like Google it goes against the spirit of open source, but the Google Vision OCR is very, very good so the compromise is worth it to me. — Reply to this email directly, view it on GitHub <#1434 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AC2QXLR6B3FZU4HFSLVDEYL23BPHNAVCNFSM6AAAAABR7QY3CCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDQMRWGMYTANRZHE> . You are receiving this because you authored the thread.Message ID: ***@***.***>

epatels added enhancement triage Issue needs triage labels Nov 18, 2024

epatels assigned jbarlow83 Nov 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Feature Request - Use Google Document AI or VIsion AI instead of Tesseract #1434

[Feature]: Feature Request - Use Google Document AI or VIsion AI instead of Tesseract #1434

epatels commented Nov 18, 2024

grantbarrett commented Apr 24, 2025 •

edited

Loading

epatels commented Apr 24, 2025 via email

[Feature]: Feature Request - Use Google Document AI or VIsion AI instead of Tesseract #1434

[Feature]: Feature Request - Use Google Document AI or VIsion AI instead of Tesseract #1434

Comments

epatels commented Nov 18, 2024

Describe the proposed feature

grantbarrett commented Apr 24, 2025 • edited Loading

epatels commented Apr 24, 2025 via email

grantbarrett commented Apr 24, 2025 •

edited

Loading