Get supported features Leave feedback

The set of the supported features depends on the document format. GroupDocs.Parser provides the functionality to check if feature supported for the document. getFeatures property is used for this purposes.

Features class has the following members:

Member	Description
isFeatureSupported(String)	Returns the value that indicates whether the feature is supported.
isText	The value that indicates whether text extraction is supported.
isTextPage	The value that indicates whether text page extraction is supported.
isFormattedText	The value that indicates whether formatted text extraction is supported.
isFormattedTextPage	The value that indicates whether formatted text page extraction is supported.
isSearch	The value that indicates whether text search is supported.
isHighlight	The value that indicates whether highlight extraction is supported.
isStructure	The value that indicates whether text structure extraction is supported.
isToc	The value that indicates whether table of contents extraction is supported.
isContainer	The value that indicates whether container extraction is supported.
isMetadata	The value that indicates whether metadata extraction is supported.
isTextAreas	The value that indicates whether text areas extraction is supported.
isImages	The value that indicates whether images extraction is supported.
isParseByTemplate	The value that indicates whether parsing by template is supported.
isParseForm	The value that indicates whether form parsing is supported.

Here are the steps for check if feature is supported:

Instantiate Parser object for the initial document;
Call corresponding property of getFeatures property to check if the feature is supported.

The following example shows how to check if text extraction feature is supported:

// Create an instance of Parser class
try (Parser parser = new Parser(Constants.SampleZip)) {
    // Check if text extraction is supported for the document
    if (!parser.getFeatures().isText()) {
        System.out.println("Text extraction isn't supported");
        return;
    }
    // Extract a text from the document
    try (TextReader reader = parser.getText()) {
        System.out.println(reader.readToEnd());
    }
}

If the feature isn’t supported, the method returns null instead of the value. So if checking of Features properties is omitted, result is checked for null:

// Create an instance of Parser class
try (Parser parser = new Parser(Constants.SampleZip)) {
    // Extract a text into the reader
    try (TextReader reader = parser.getText()) {
        // Print a text from the document
        // If text extraction isn't supported, a reader is null
        System.out.println(reader == null ? "Text extraction isn't supported" : reader.readToEnd());
    }
}

This example prints “Text extraction isn’t supported” because there is no text in zip-archive.

Some operations may consume significant time. So it’s not optimal to call the method to just check the support for the feature. For this purpose getFeatures property is used.

More resources

Advanced usage topics

To learn more about document data extraction features and get familiar how to extract text, images, forms and more, please refer to the advanced usage section.

GitHub examples

You may easily run the code above and see the feature in action in our GitHub examples:

Free online document parser App

Along with full featured Java library we provide simple, but powerful free Apps.

You are welcome to extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more with our free online Free Online Document Parser App.

We value your opinion. Your feedback will help us improve our documentation.

Get supported features Leave feedback

More resources

Advanced usage topics

GitHub examples

Free online document parser App

Was this page helpful?

Any additional feedback you'd like to share with us?

Please tell us how we can improve this page.

Thank you for your feedback!