GroupDocs.Parser for Java 21.2 Release Notes

Full List of Issues Covering all Changes in this Release

KeySummaryCategory
PARSERNET-1734Improve text page extraction from WordProcessing documentsImprovement
PARSERJAVA-152Document parsing performance issueBug

Public API and Backward Incompatible Changes

Improve text page extraction from WordProcessing documents

Description

This improvement enhanced the work with documents that contain sections, footers, headers and footnotes.

Public API changes

No API changes.

Document parsing performance issue

Description

This fix improves the performance of raw text extraction from PDF documents.

Public API changes

No API changes.

Usage

The following code shows how to extract a raw text from PDF:

// Create an instance of Parser class
try (Parser parser = new Parser(filePath)) {
    // Create a text reader object to read a text with TextOptions for raw text extraction
    try (TextReader reader = parser.getText(new TextOptions(true))) {
        // Extract the whole text from the file
        String text = reader.readToEnd();
        // Print the text to the console
        System.out.println(text);
    }
}