The following example shows how to extract Markdown formatted text:
// Create an instance of Parser class
try(Parserparser=newParser(Constants.SampleDocx)){// Extract a formatted text into the reader
try(TextReaderreader=parser.getFormattedText(newFormattedTextOptions(FormattedTextMode.Markdown))){// Print a formatted text from the document
// If formatted text extraction isn't supported, a reader is null
System.out.println(reader==null?"Formatted text extraction isn't suppported":reader.readToEnd());}}
The API supports the following formatting:
Bold text
Italic text
Hyperlinks
Headings
Numbering and bullets lists
Tables
The following Microsoft Word document is used as input document:
The following Markdown document is extracted using the example above:
More resources
GitHub examples
You may easily run the code above and see the feature in action in our GitHub examples: