Although GroupDocs.Redaction itself does not contain OCR as a part of its distributable, it allows you to integrate any paid or free OCR solution.
You have to implement IOcrConnector interface and its recognize() method, taking a stream with an image as an argument and returning a structured representation of the text, including bounding rectangles.
Java
publicclassMyOwnOcrConnectorimplementsIOcrConnector{publicMyOwnOcrConnector(){}publicRecognizedImagerecognize(InputStreamimageStream){// TODO Create an instance of RecognizedImage class using OCR result returned by your OCR toolkit
}}
Once the instance is passed to RedactorSettings constructor, GroupDocs.Redaction will use it for image files and embedded images during an ordinary textual redaction process.
Java
try(Redactorredactor=newRedactor("\\Sample.docx",newLoadOptions(),newRedactorSettings(newMyOwnOcrConnector()))){// Assign an instance before using Redactor
redactor.apply(newExactPhraseRedaction("John Doe",newReplacementOptions(java.awt.Color.BLACK)));redactor.save();}
Along with full featured Java library we provide simple, but powerful free Apps.
You are welcome to perform redactions for various document formats like PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more with our free online Free Online Document Redaction App.
Was this page helpful?
Any additional feedback you'd like to share with us?
Please tell us how we can improve this page.
Thank you for your feedback!
We value your opinion. Your feedback will help us improve our documentation.