Extract metadata from Microsoft Office PowerPoint presentations Leave feedback

To extract metadata from Microsoft Office PowerPoint presentations getMetadata method is used. This method allows to extract the following metadata:

Name	Description
title	The title of the presentation.
subject	The subject of the presentation.
keywords	The keyword of the presentation.
comments	The comments of the presentation.
content-status	The content status of the presentation.
category	The category of the presentation.
company	The company of the presentation.
manager	The manager of the presentation.
author	The name of the presentation’s author.
last-author	The name of the last presentation’s author.
hyperlink-base	The base string used for evaluating relative hyperlinks in this presentation.
application	The name of the application.
application-version	The version number of the application that created the presentation.
created-time	The time of the presentation creation.
last-saved-time	The time of the the presentation when it was last saved.
last-printed-time	The time of the presentation when it was last printed.
revision-number	The presentation revision number.
total-editing-time	The total editing time in minutes.

Here are the steps to extract metadata from Microsoft Office PowerPoint presentation:

Instantiate Parser object for the initial presentation;
Call getMetadata method and obtain collection of document metadata objects;
Iterate through the collection and get metadata names and values.

Warning
getMetadata method returns null value if metadata extraction isn’t supported for the document. For example, metadata extraction isn’t supported for TXT files. Therefore, for TXT file getMetadata method returns null. If Microsoft Office PowerPoint presentation has no metadata, getMetadata method returns an empty collection.

The following example demonstrates how to extract metadata from PowerPoint presentation:

// Create an instance of Parser class
try (Parser parser = new Parser(Constants.SamplePptx)) {
    // Extract metadata from the presentation
    Iterable<MetadataItem> metadata = parser.getMetadata();
    // Iterate over metadata items
    for (MetadataItem item : metadata) {
        // Print an item name and value
        System.out.println(String.format("%s: %s", item.getName(), item.getValue()));
    }
}

More resources

GitHub examples

You may easily run the code above and see the feature in action in our GitHub examples:

Free online document parser App

Along with full featured .NET library we provide simple, but powerful free Apps.

You are welcome to parse documents and extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more with our free online Free Online Document Parser App.

We value your opinion. Your feedback will help us improve our documentation.

Extract metadata from Microsoft Office PowerPoint presentations Leave feedback

More resources

GitHub examples

Free online document parser App

Was this page helpful?

Any additional feedback you'd like to share with us?

Please tell us how we can improve this page.

Thank you for your feedback!