GroupDocs.Metadata for Python via .NET reads, edits, removes, and exports metadata across documents, images, audio, video, and many other formats — Microsoft Office, PDF, OpenDocument, images (JPEG, PNG, PSD, TIFF), audio (MP3, WAV), video, fonts, e-books, archives, and more. It exposes the major metadata standards (XMP, EXIF, IPTC, Image Resource Blocks, ID3) and format-specific properties through one unified, predicate-driven API. It runs entirely on-premise, requires no Microsoft Office installation, and ships as a pre-built wheel on Windows, Linux, and macOS.
Read every property in a file, or search for specific properties using a plain Python predicate (lambda p: ...) — no specification objects required. You can match on a property’s name, value, type, or tags.
Add new properties or update existing ones in a unified way. Predefined tags let you set common properties (author, creation date, title) consistently regardless of the underlying format.
Read and write the metadata standards used by images and many other formats:
EXIF — read, update, and remove EXIF tags in JPEG, TIFF, PNG, WEBP, PSD, and more.
IPTC IIM — read and edit IPTC datasets in JPEG, TIFF, and PSD.
XMP — read, update, and add custom XMP packages across formats.
Image Resource Blocks (IRB) and format-specific native packages are exposed through the same root metadata tree.
Audio Metadata
Manage audio tags through the root package: ID3v1, ID3v2, Lyrics3, and APE, plus MPEG audio technical properties. Read or edit titles, artists, albums, and other tag fields, then export or strip them as needed.
Document Inspection and File Type Detection
Read basic facts about a file without walking its full metadata tree — format, MIME type, page count, size, and encryption state — and detect a file’s format by its internal structure rather than its extension.
Get document info — format, MIME type, page count, size, and encryption flag.
Office documents also expose inspection data such as user comments, form fields, hidden pages, revisions, digital signatures, and common statistics (word count, character count).
Loading Files from Different Sources
The Metadata constructor accepts a file path, a binary file-like object, or a URI, so you can load files from local disk, in-memory buffers, or cloud storage.
Saving a modified file requires a license. Without a license, the library runs in evaluation mode — it reads only the first few properties of each package and save() raises an “Evaluation only” exception. See Evaluation Limitations and Licensing.
Exporting Metadata
Export the metadata tree — or a filtered subset of properties — to a spreadsheet or a structured data file for indexing, reporting, or downstream processing.
GroupDocs.Metadata is a useful building block for AI document pipelines: extract metadata as structured data to enrich search indexes and LLM context. The groupdocs-metadata-net pip package ships an AGENTS.md file inside the wheel so AI coding assistants can discover the API surface automatically, and GroupDocs runs a public MCP server for on-demand documentation lookups. See Agents and LLM Integration for the full story.
On-Premise Deployment
No cloud calls, no outbound network traffic, no third-party software dependencies beyond what the OS already provides. The wheel is self-contained on Windows and ships its own native runtime libraries on Linux and macOS. See System Requirements for the short list of optional native packages (libgdiplus, fontconfig).
Was this page helpful?
Any additional feedback you'd like to share with us?
Please tell us how we can improve this page.
Thank you for your feedback!
We value your opinion. Your feedback will help us improve our documentation.