Extract Metadata from PDF Documents in C# .NET
Leave feedback
On this page
Extract Metadata from PDF Documents in C# .NET
PDF files often contain metadata such as the title, subject, author, creation date, and application used to generate the document. With GroupDocs.Parser for .NET, you can easily read these metadata properties programmatically using the GetMetadata method.
This guide shows how to extract metadata from PDF documents in C# step by step.
What Metadata Can Be Extracted?
The GetMetadata method can return the following properties from a PDF document:
Metadata Field
Description
title
Title of the PDF document
subject
Subject of the PDF document
keywords
Keywords associated with the document
author
Author of the document
application
Application that created the PDF
application-version
Version of the application
created-time
Date and time the PDF was created
last-saved-time
Date and time the PDF was last modified
⚡ Note: The available metadata depends on the PDF file itself. Not all fields may be present in every document.
How to Extract PDF Metadata in C#
Follow these steps to get metadata from a PDF document:
Create a Parser object and load the PDF file.
Call the GetMetadata method to retrieve metadata items.
Iterate through the collection and read metadata names and values.
Example: Extract PDF Metadata in C#
// Load the PDF fileusing(Parserparser=newParser(filePath)){// Extract metadataIEnumerable<MetadataItem>metadata=parser.GetMetadata();// Display metadata fieldsforeach(MetadataItemiteminmetadata){Console.WriteLine($"{item.Name}: {item.Value}");}}
Warning
GetMetadata method returns null value if metadata extraction isn’t supported for the document. For example, metadata extraction isn’t supported for TXT files. Therefore, for TXT file GetMetadata method returns null. If PDF document has no metadata, GetMetadata method returns an empty collection.
Why Extract PDF Metadata?
Extracting PDF metadata is useful for:
Document management – quickly find and categorize files by author, subject, or keywords.
Compliance checks – verify creation and modification dates.
Search indexing – improve search accuracy by including document properties.
More resources
GitHub examples
You may easily run the code above and see the feature in action in our GitHub examples:
Along with full featured .NET library we provide simple, but powerful free Apps.
You are welcome to parse documents and extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more with our free online Free Online Document Parser App.
Was this page helpful?
Any additional feedback you'd like to share with us?
Please tell us how we can improve this page.
Thank you for your feedback!
We value your opinion. Your feedback will help us improve our documentation.
On this page
Analyzing your prompt, please hold on...
An error occurred while retrieving the results. Please refresh the page and try again.