The following example shows how to extract plain formatted text:
// Create an instance of Parser class
try(Parserparser=newParser(Constants.SampleDocx)){// Extract a formatted text into the reader
try(TextReaderreader=parser.getFormattedText(newFormattedTextOptions(FormattedTextMode.PlainText))){// Print a formatted text from the document
// If formatted text extraction isn't supported, a reader is null
System.out.println(reader==null?"Formatted text extraction isn't suppported":reader.readToEnd());}}
The following Microsoft Word document is used as input document:
The following plain text document is extracted using the example above:
More resources
GitHub examples
You may easily run the code above and see the feature in action in our GitHub examples: