Extract metadata from documents

GroupDocs.Parser allows to extract basic metadata from documents of various formats: PDF, Emails, Ebooks, Microsoft Office: Word (DOC, DOCX), PowerPoint (PPT, PPTX), Excel (XLS, XLSX), LibreOffice formats and many others (see full list at supported document formats article).

Extract metadata from documents

To extract metadata from documents simply call the GetMetadata method:

IEnumerable<MetadataItem> GetMetadata();

This method returns a collection of MetadataItem objects with following members:

MemberDescription
NameThe name of the metadata item
ValueThe value of the metadata item

Here are the steps to extract metadata from the document:

  • Instantiate Parser object for the initial document;
  • Call GetMetadata method and obtain collection of document metadata objects;
  • Check if collection isn’t null (metadata extraction is supported for the document);
  • Iterate through the collection and get metadata names and values.

The following example shows how to extract metadata from a document:

// Create an instance of Parser class
using(Parser parser = new Parser(filePath))
{
    // Extract metadata from the document
    IEnumerable<MetadataItem> metadata = parser.GetMetadata();
    // Check if metadata extraction is supported
    if(metadata == null)
    {
        Console.WriteLine("Metatada extraction isn't supported");
    }

    // Iterate over metadata items
    foreach(MetadataItem item in metadata)
    {
        // Print an item name and value
        Console.WriteLine(string.Format("{0}: {1}", item.Name, item.Value));
    }
}

More resources

Advanced usage topics

To learn more about document data extraction features and get familiar how to extract text, images, forms and more, please refer to the advanced usage section.

GitHub examples

You may easily run the code above and see the feature in action in our GitHub examples:

Free online metadata extractor App

Along with full featured .NET library we provide simple, but powerfull free APPs.

You are welcome to extract metadata from your documents with our free online GroupDocs Parser App.