Extract metadata from PDF documents

To extract metadata from PDF documents GetMetadata method is used. This method allows to extract the following metadata:

titleThe title of the presentation.
subjectThe subject of the presentation.
keywordsThe keyword of the presentation.
authorThe name of the presentation’s author.
applicationThe name of the application.
application-versionThe version number of the application that created the presentation.
created-timeThe time of the presentation creation.
last-saved-timeThe time of the the presentation when it was last saved.

Here are the steps to extract metadata from PDF document:

  • Instantiate Parser object for the initial document;
  • Call GetMetadata method and obtain collection of document metadata objects;
  • Iterate through the collection and get metadata names and values.

The following example demonstrates how to extract metadata from PDF document:

// Create an instance of Parser class
using(Parser parser = new Parser(filePath))
    // Extract metadata from the document
    IEnumerable<MetadataItem> metadata = parser.GetMetadata();
    // Iterate over metadata items
    foreach(MetadataItem item in metadata)
        // Print the item name and value
        Console.WriteLine(string.Format("{0}: {1}", item.Name, item.Value));

