Extract images from PDF documents

To extract images from PDF documents GetImages methods are used. By default images are extracted with its original format. With using ImageOptions class it is possible to extract images from PDF documents as bmp, gif, jpeg, png and webp formats.

Here are the steps to extract images from PDF document to PNG-files:

  • Instantiate Parser object for the initial document;
  • Call GetImages method and obtain the collection of image objects;
  • Iterate through the collection and save image contents to the file.

The following example demonstrates how to extract images from PDF document:

// Create an instance of Parser class
using (Parser parser = new Parser(filePath))
    // Extract images from document
    IEnumerable<PageImageArea> images = parser.GetImages();
    // Create the options to save images in PNG format
    ImageOptions options = new ImageOptions(ImageFormat.Png);
    int imageNumber = 0;
    // Iterate over images
    foreach (PageImageArea image in images)
        // Save the image to the png file
        image.Save(imageNumber.ToString() + ".png", options);

More resources

GitHub examples

You may easily run the code above and see the feature in action in our GitHub examples:

Free online document parser App

Along with full featured .NET library we provide simple, but powerful free Apps.

You are welcome to parse documents and extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more with our free online Free Online Document Parser App.