Features Overview

Overview

GroupDocs.Metadata for Python via .NET reads, edits, removes, and exports metadata across documents, images, audio, video, and many other formats — Microsoft Office, PDF, OpenDocument, images (JPEG, PNG, PSD, TIFF), audio (MP3, WAV), video, fonts, e-books, archives, and more. It exposes the major metadata standards (XMP, EXIF, IPTC, Image Resource Blocks, ID3) and format-specific properties through one unified, predicate-driven API. It runs entirely on-premise, requires no Microsoft Office installation, and ships as a pre-built wheel on Windows, Linux, and macOS.

See the full list of supported formats or browse the Developer Guide for runnable examples of every API surface.

Reading and Searching Metadata

Read every property in a file, or search for specific properties using a plain Python predicate (lambda p: ...) — no specification objects required. You can match on a property’s name, value, type, or tags.

Editing Metadata

Add new properties or update existing ones in a unified way. Predefined tags let you set common properties (author, creation date, title) consistently regardless of the underlying format.

  • Set metadata properties — add or update every property matching a predicate.
  • Add metadata — add missing properties (for example, a last-printed date) when the format supports them.

Removing and Cleaning Metadata

Remove particular properties that match a predicate, or strip every detected property in a single call — ideal for privacy and compliance workflows.

Working with Metadata Standards

Read and write the metadata standards used by images and many other formats:

  • EXIF — read, update, and remove EXIF tags in JPEG, TIFF, PNG, WEBP, PSD, and more.
  • IPTC IIM — read and edit IPTC datasets in JPEG, TIFF, and PSD.
  • XMP — read, update, and add custom XMP packages across formats.

Image Resource Blocks (IRB) and format-specific native packages are exposed through the same root metadata tree.

Audio Metadata

Manage audio tags through the root package: ID3v1, ID3v2, Lyrics3, and APE, plus MPEG audio technical properties. Read or edit titles, artists, albums, and other tag fields, then export or strip them as needed.

Document Inspection and File Type Detection

Read basic facts about a file without walking its full metadata tree — format, MIME type, page count, size, and encryption state — and detect a file’s format by its internal structure rather than its extension.

Office documents also expose inspection data such as user comments, form fields, hidden pages, revisions, digital signatures, and common statistics (word count, character count).

Loading Files from Different Sources

The Metadata constructor accepts a file path, a binary file-like object, or a URI, so you can load files from local disk, in-memory buffers, or cloud storage.

Saving Files

After editing, save back to the original source or to a new destination — a path or a stream.

Note
Saving a modified file requires a license. Without a license, the library runs in evaluation mode — it reads only the first few properties of each package and save() raises an “Evaluation only” exception. See Evaluation Limitations and Licensing.

Exporting Metadata

Export the metadata tree — or a filtered subset of properties — to a spreadsheet or a structured data file for indexing, reporting, or downstream processing.

AI and LLM Integration

GroupDocs.Metadata is a useful building block for AI document pipelines: extract metadata as structured data to enrich search indexes and LLM context. The groupdocs-metadata-net pip package ships an AGENTS.md file inside the wheel so AI coding assistants can discover the API surface automatically, and GroupDocs runs a public MCP server for on-demand documentation lookups. See Agents and LLM Integration for the full story.

On-Premise Deployment

No cloud calls, no outbound network traffic, no third-party software dependencies beyond what the OS already provides. The wheel is self-contained on Windows and ships its own native runtime libraries on Linux and macOS. See System Requirements for the short list of optional native packages (libgdiplus, fontconfig).