Extract Images from Documents Leave feedback

Extract images

Python

from groupdocs.parser import Parser

with Parser("./slides.pptx") as parser:
    images = parser.get_images()
    if images is None:
        print("Image extraction isn't supported for this format.")
    else:
        for index, image_area in enumerate(images, start=1):
            print(f"Page: {image_area.page.index + 1}, Type: {image_area.file_type}")
            # Save each image
            output_path = f"image-{index}.{image_area.file_type.extension}"
            image_area.save(output_path)

slides.pptx

The following sample file is used in this example: slides.pptx

Steps

Create Parser for the source file.
Call get_images() to obtain PageImageArea items.
Inspect page, rectangle, file_type, and save streams with save() or get_image_stream().

For coordinates-based processing and format conversion, see the advanced images guide; the same workflow applies in Python.

id: extract-images-from-documents url: parser/python-net/extract-images-from-documents title: Extract Images from Documents weight: 8 version: 25.12 description: “Extract embedded images from PDF, Word, Excel, presentations, emails, and archives using GroupDocs.Parser for Python via .NET.” productName: GroupDocs.Parser for Python via .NET hideChildren: false toc: true tags: python, parser, image-extraction, v25.12

GroupDocs.Parser can list and export images from supported documents (PDF, Office, emails, eBooks, and more).

Extract images

Python

from groupdocs.parser import Parser

with Parser("./slides.pptx") as parser:
    images = parser.get_images()
    if images is None:
        print("Image extraction isn't supported for this format.")
    else:
        for index, image_area in enumerate(images, start=1):
            print(f"Page: {image_area.page.index + 1}, Type: {image_area.file_type}")
            # Save each image
            output_path = f"image-{index}.{image_area.file_type.extension}"
            image_area.save(output_path)

slides.pptx

The following sample file is used in this example: slides.pptx

Steps

Create Parser for the source file.
Call get_images() to obtain PageImageArea items.
Inspect page, rectangle, file_type, and save streams with save() or get_image_stream().

For coordinates-based processing and format conversion, consult the .NET advanced images guide; the same workflow applies in Python.

We value your opinion. Your feedback will help us improve our documentation.

Extract Images from Documents Leave feedback

On this page

Extract images

Steps

For coordinates-based processing and format conversion, see the advanced images guide; the same workflow applies in Python.

Extract images

Steps

Was this page helpful?

Any additional feedback you'd like to share with us?

Please tell us how we can improve this page.

Thank you for your feedback!

On this page