Extract images from document Leave feedback

Prerequisites

GroupDocs.Parser for Python via .NET installed
Sample documents containing images
Write access to save extracted images (optional)

Extract images from document

To extract all images from a document:

Python

from groupdocs.parser import Parser

# Create an instance of Parser class
with Parser("./sample.pdf") as parser:
    # Extract images
    images = parser.get_images()
    
    # Check if image extraction is supported
    if images is None:
        print("Image extraction isn't supported")
    else:
        # Iterate over images
        for idx, image in enumerate(images):
            # Print image information
            print(f"Image {idx + 1}:")
            print(f"  Page: {image.page.index + 1}")
            print(f"  Type: {image.file_type}")
            print(f"  Size: {image.rectangle.width}x{image.rectangle.height}")
            print(f"  Position: ({image.rectangle.left}, {image.rectangle.top})")

sample.pdf

The following sample file is used in this example: sample.pdf

Expected behavior: Returns a collection of PageImageArea objects representing all images found in the document, or None if image extraction is not supported.

Save extracted images to files

To save extracted images to disk:

Python

from groupdocs.parser import Parser
import os

# Create output directory
output_dir = "extracted_images"
os.makedirs(output_dir, exist_ok=True)

# Create an instance of Parser class
with Parser("./sample.docx") as parser:
    # Extract images
    images = parser.get_images()
    
    if images is None:
        print("Image extraction isn't supported")
    else:
        # Iterate over images and save them
        for idx, image in enumerate(images):
            # Get file extension based on image type
            extension = image.file_type.extension
            
            # Generate filename
            filename = f"image_{idx + 1}{extension}"
            filepath = os.path.join(output_dir, filename)
            
            # Save image to file
            image.save(filepath)
            print(f"Saved: {filepath}")

sample.docx

The following sample file is used in this example: sample.docx

Expected behavior: Saves each extracted image to a separate file with the appropriate file extension (.png, .jpg, .gif, etc.).

Extract images with metadata

To extract images along with detailed

Extract images from document Leave feedback

On this page

Prerequisites

Extract images from document

Save extracted images to files

Extract images with metadata

`Get image stream`

`Convert images during extraction`

`Batch image extraction`

`Notes`

Was this page helpful?

Any additional feedback you'd like to share with us?

Please tell us how we can improve this page.

Thank you for your feedback!

On this page

Extract images from document Leave feedback

On this page

Prerequisites

Extract images from document

Save extracted images to files

Extract images with metadata

Get image stream

Convert images during extraction

Batch image extraction

Notes

Related pages

Was this page helpful?

Any additional feedback you'd like to share with us?

Please tell us how we can improve this page.

Thank you for your feedback!

On this page

`Get image stream`

`Convert images during extraction`

`Batch image extraction`

`Notes`

`Related pages`