Check if reader isn’t null (formatted text extraction is supported for the document);
Read a text from reader.
The following example shows how to extract a document text as HTML text:
// Create an instance of Parser class
try(Parserparser=newParser(Constants.SampleDocx)){// Extract a formatted text into the reader
try(TextReaderreader=parser.getFormattedText(newFormattedTextOptions(FormattedTextMode.Html))){// Print a formatted text from the document
// If formatted text extraction isn't supported, a reader is null
System.out.println(reader==null?"Formatted text extraction isn't suppported":reader.readToEnd());}}
More resources
GitHub examples
You may easily run the code above and see the feature in action in our GitHub examples: