GroupDocs.Conversion for Python via .NET Overview

What is GroupDocs.Conversion?

GroupDocs.Conversion for Python via .NET is a native Python library that converts documents between 10,000+ format pairs — DOCX, PDF, XLSX, PPTX, images, CAD, email, compressed archives, eBooks, and page-description languages. It runs entirely on-premise, requires no Microsoft Office installation, and ships as a pre-built wheel on Windows, Linux, and macOS.

Typical uses include:

  • Document ingest pipelines — normalise whatever users upload to a single canonical format (usually PDF or HTML) before storing or indexing.
  • AI / RAG preprocessing — convert DOCX, XLSX, EML, and scanned PDFs into page-level PNGs or plain text for LLM consumption. See Agents and LLM Integration for the full pipeline.
  • Content migration — move between legacy and modern formats (DOC → DOCX, XLS → XLSX, PPT → PPTX) without losing formatting.
  • Print and archive — render arbitrary office formats to PDF/A for long-term storage and printing.
  • Page-by-page processing — extract a specific slide, page, or sheet as a standalone file for downstream workflows.

Key Capabilities

CapabilityDescription
10,000+ conversion pairsDOCX, PDF, XLSX, PPTX, images, CAD, email, archives, eBooks, HTML, TeX, and more. See supported formats.
Load options per formatPasswords, fonts, encodings, sheet indexes, PDF annotations, email fields.
Convert options per targetDPI, page size, watermarks, rotation, font embedding, flavor.
Page selectionConvert specific pages or page ranges with page_number + pages_count or an explicit pages list.
Per-page file outputEmit one file per page for presentations, PDFs, and multi-page images.
Archives and containersConvert every file inside a ZIP, RAR, 7Z, or OST / PST in a single call.
WatermarksAdd text watermarks during conversion — color, size, opacity, foreground or background.
Document inspectionRead format, page count, author, dimensions, and TOC without converting.
Possible-conversion discoveryAsk the engine what targets a given source supports before running a pipeline.
StreamsLoad input from file-like objects — handy for cloud blobs and HTTP bodies.
Logging and diagnosticsWire ConsoleLogger through ConverterSettings for production traces.
On-premiseNo cloud calls, no Microsoft Office install, no network traffic.

Quick Example

from groupdocs.conversion import Converter
from groupdocs.conversion.options.convert import PdfConvertOptions

def quick_example():
    """Convert a DOCX document to PDF — the five-line hello-world."""
    with Converter("./business-plan.docx") as converter:
        options = PdfConvertOptions()
        converter.convert("./business-plan.pdf", options)

if __name__ == "__main__":
    quick_example()
from groupdocs.conversion import Converter
from groupdocs.conversion.options.convert import PdfConvertOptions
from groupdocs.conversion.options.load import WordProcessingLoadOptions

def convert_with_options():
    """Open a password-protected DOCX, render pages 1-2 at 300 DPI."""
    load_options = WordProcessingLoadOptions()
    load_options.password = "secret"

    with Converter("./protected.docx", load_options) as converter:
        options = PdfConvertOptions()
        options.dpi = 300
        options.page_number = 1
        options.pages_count = 2
        converter.convert("./protected.pdf", options)

if __name__ == "__main__":
    convert_with_options()
Binary file (PDF, 283 KB)

Download full output

Where to next

  1. Install the packageInstallation walks through PyPI and offline wheel installation for Windows, Linux, and macOS.
  2. Run your first conversionQuick Start Guide converts a DOCX to PDF in five minutes.
  3. Explore the examplesRunning Examples clones the runnable repository and runs every documented scenario locally or in Docker.
  4. Use it in depth — the Developer Guide covers loading, converting, inspecting, page selection, watermarks, archives, and logging.
  5. Plug it into AI pipelinesAgents and LLM Integration explains the MCP server, AGENTS.md, and how to chain GroupDocs.Conversion with GroupDocs.Markdown for RAG.
Close
Loading

Analyzing your prompt, please hold on...

An error occurred while retrieving the results. Please refresh the page and try again.