Features Overview Leave feedback

Overview

GroupDocs.Merger for Python via .NET provides a comprehensive set of document manipulation features across 70+ supported formats — Microsoft Office, PDF, OpenDocument, images, Visio diagrams, eBooks, archives, and more. It runs entirely on-premise, requires no third-party office applications, and ships as a pre-built wheel on Windows, Linux, and macOS.

See the full list of supported formats or browse the Developer Guide for runnable examples of every API surface.

Merge / Join

The core capability is combining multiple documents into one. You can merge entire documents or select specific pages or page ranges from each source document. Format-specific join options let you control how the documents are stitched together — for example, continuous section breaks for Word documents or bookmark preservation for PDFs.

Merge whole documents — see Merge Files.
Merge specific pages from each source — see Merge PDF or any per-format merge page.
Word-specific options (WordJoinOptions) and PDF-specific options (PdfJoinOptions) are documented per format in Merge Files.

Split

Split a document into a collection of smaller documents. You can emit one file per listed page, split by interval (every N pages), or split a plain-text file by line numbers.

Split by page list or interval — see Split Document.
Split a text file at specific line numbers — see Split Text File.

Page Operations

GroupDocs.Merger exposes granular, page-level control over any document without requiring a round-trip through an office application.

Extract Pages

Produce a new document that contains only the specified pages or page range from a source document. See Extract Pages.

Remove Pages

Remove one or more pages from a document by page number. See Remove Pages.

Swap Pages

Exchange the positions of two pages within a document. See Swap Pages.

Move Page

Move a single page to a different position within the page order. See Move Page.

Rotate Pages

Rotate selected pages by 90°, 180°, or 270° using RotateMode enum values. See Rotate Pages.

Change Page Orientation

Switch selected pages between portrait and landscape orientation using OrientationMode. See Change Page Orientation.

Security

GroupDocs.Merger manages document password protection without requiring the document to be opened in an editor:

Add a password — protect a document with a new password.
Update a password — change an existing password after loading with the current one.
Remove a password — strip protection after loading with the correct password.
Check protection — call merger.is_password_set() to test whether a document is protected.

See Security Operations for runnable examples of each operation.

Document Inspection

Read metadata from a document without modifying it — file type, page count, page dimensions, visibility flags, and file size. Enumerate all formats supported at runtime via FileType.get_supported_file_types().

Read document metadata — see Get Document Information.
List all supported formats at runtime — see Get Supported File Types.

Page Preview

Generate raster image previews (PNG, JPEG, or BMP) of individual document pages. Previews are useful for displaying document thumbnails in a UI, validating page content before merging, or feeding page images into a vision model.

See Page Preview for a runnable example using the file-stream callback pattern.

Loading Documents

GroupDocs.Merger accepts documents from local disk, binary streams (file handles, in-memory buffers), and password-protected files via LoadOptions.

Load from local disk — see Load from Local Disk.
Load from a stream — see Load from Stream.
Load a password-protected file — see Load Password Protected Document.

AI / LLM Integration

GroupDocs.Merger is designed to be a first-class building block for AI document pipelines. The groupdocs-merger-net pip package ships an AGENTS.md file inside the wheel so AI coding assistants can discover the API surface automatically, and GroupDocs runs a public MCP server for on-demand documentation lookups.

See Agents and LLM Integration for the full story — including a runnable pipeline that merges PDFs and extracts a page subset for downstream AI processing.

On-Premise Deployment

No cloud calls, no outbound network traffic, no Microsoft Office or Adobe Acrobat installation required. The wheel is self-contained on Windows and ships its own native runtime libraries on Linux and macOS. See System Requirements for the short list of optional native packages (libgdiplus, libfontconfig1).

We value your opinion. Your feedback will help us improve our documentation.