OCR - Expose document writer with OCR capabilities

### Rationale
In [one of my apps](https://github.com/rimerosolutions/entrusted), I started replacing few libraries with MuPDF and I noticed that MuPDF has [Tesseract](https://tesseract-ocr.github.io/tessdoc/Home.html) support (_**duplicate symbol errors**_ with `Leptonica` in my app, only on Windows). 
- My app uses [leptonica-sys](https://crates.io/crates/leptonica-sys) with [tesseract-sys](https://crates.io/crates/tesseract-sys), I was able to "resolve" linkage errors with `RUSTFLAGS` (No issue on Linux and Mac OS).
- This led me to check if I can just use Tesseract with MuPDF and compile with `crt-static` flags ([without additional flags such as -C link-arg=/FORCE:MULTIPLE](https://github.com/rimerosolutions/entrusted/issues/99#issuecomment-2800062605))

### Goal
I believe that "most" people don't have to use `tesseract-rs` or its related `sys` crates containing more advanced features. With `mupdf-rs` I can drop 5 dependencies in one of my apps (`poppler-rs`, `cairo-rs`, `lopdf`, `tesseract-sys`, `leptonica-sys`).

### Proposal
I can send a pull request for the following:

- Expose a new method in `mupdf-rs` ([DocumentWriter](https://docs.rs/mupdf/0.4.4/mupdf/document_writer/struct.DocumentWriter.html)) to allow OCR via [fz_new_pdfocr_writer](https://docs.rs/mupdf-sys/0.4.4/mupdf_sys/fn.fz_new_pdfocr_writer.html)
- This allows trivial branches for applications that optionally perform OCR, because it doesn't require a different data structure.
- I suppose that this would need to be guarded by the `tesseract` feature in the code

### Other notes
I did some quick tests under Windows 11 and it's fine. I'll test soon under Linux and Mac OS prior submitting a pull request.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OCR - Expose document writer with OCR capabilities #128

Rationale

Goal

Proposal

Other notes

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

OCR - Expose document writer with OCR capabilities #128

Description

Rationale

Goal

Proposal

Other notes

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions