GroupDocs.Conversion for Python via .NET Overview
Leave feedback
On this page
What is GroupDocs.Conversion?
GroupDocs.Conversion for Python via .NET is a native Python library that converts documents between 10,000+ format pairs — DOCX, PDF, XLSX, PPTX, images, CAD, email, compressed archives, eBooks, and page-description languages. It runs entirely on-premise, requires no Microsoft Office installation, and ships as a pre-built wheel on Windows, Linux, and macOS.
Typical uses include:
Document ingest pipelines — normalise whatever users upload to a single canonical format (usually PDF or HTML) before storing or indexing.
AI / RAG preprocessing — convert DOCX, XLSX, EML, and scanned PDFs into page-level PNGs or plain text for LLM consumption. See Agents and LLM Integration for the full pipeline.
Content migration — move between legacy and modern formats (DOC → DOCX, XLS → XLSX, PPT → PPTX) without losing formatting.
Print and archive — render arbitrary office formats to PDF/A for long-term storage and printing.
Page-by-page processing — extract a specific slide, page, or sheet as a standalone file for downstream workflows.
Key Capabilities
Capability
Description
10,000+ conversion pairs
DOCX, PDF, XLSX, PPTX, images, CAD, email, archives, eBooks, HTML, TeX, and more. See supported formats.
No cloud calls, no Microsoft Office install, no network traffic.
Quick Example
fromgroupdocs.conversionimportConverterfromgroupdocs.conversion.options.convertimportPdfConvertOptionsdefquick_example():"""Convert a DOCX document to PDF — the five-line hello-world."""withConverter("./business-plan.docx")asconverter:options=PdfConvertOptions()converter.convert("./business-plan.pdf",options)if__name__=="__main__":quick_example()
fromgroupdocs.conversionimportConverterfromgroupdocs.conversion.options.convertimportPdfConvertOptionsfromgroupdocs.conversion.options.loadimportWordProcessingLoadOptionsdefconvert_with_options():"""Open a password-protected DOCX, render pages 1-2 at 300 DPI."""load_options=WordProcessingLoadOptions()load_options.password="secret"withConverter("./protected.docx",load_options)asconverter:options=PdfConvertOptions()options.dpi=300options.page_number=1options.pages_count=2converter.convert("./protected.pdf",options)if__name__=="__main__":convert_with_options()
Install the package — Installation walks through PyPI and offline wheel installation for Windows, Linux, and macOS.
Run your first conversion — Quick Start Guide converts a DOCX to PDF in five minutes.
Explore the examples — Running Examples clones the runnable repository and runs every documented scenario locally or in Docker.
Use it in depth — the Developer Guide covers loading, converting, inspecting, page selection, watermarks, archives, and logging.
Plug it into AI pipelines — Agents and LLM Integration explains the MCP server, AGENTS.md, and how to chain GroupDocs.Conversion with GroupDocs.Markdown for RAG.
Was this page helpful?
Any additional feedback you'd like to share with us?
Please tell us how we can improve this page.
Thank you for your feedback!
We value your opinion. Your feedback will help us improve our documentation.
On this page
Analyzing your prompt, please hold on...
An error occurred while retrieving the results. Please refresh the page and try again.