How to Extract Images from documents via .NET?

Images can be used to deliver information in such a way that may not be expressible by words. Images help us in grabbing user’s attention and explain tough concepts with ease. Sometimes while reading documents, journals or benefiting from presentations we often found some fascinating images and wanted to download it. GroupDocs.Parser for .NET is a powerful API that help users to develop useful applications for extracting images from different types of documents and save them in PNG, JPEG, WebP, GIF, BMP and other formats. The API has included supports for text as well images extraction from some of the most commonly used file formats, such as PDF, Emails, Ebooks, Microsoft Office formats: Word (DOC, DOCX), PowerPoint (PPT, PPTX), Excel (XLS, XLSX), LibreOffice formats and many more. The API also fully supports documents parsing, extracting plain and structured text, text searching by keywords, extract metadata or images, containers as well as attachments and many more.

Extract images from documents in .NET

GroupDocs.Parser for .NET makes it easy for C# developers to extract images from a documents by implementing a few easy steps.

Instantiate Parser object for the initial document;
Call GetImages method and obtain collection of image objects;
Check if reader isn’t null (images extraction is supported for the document);
Iterate through the collection and get sizes, image types and image contents.

Learn more about the images extraction

How to extract images from documents using C# example code

// Extract images from documents using GroupDocs.Parser API
// Create an instance of Parser class
using (Parser parser = new Parser(filePath)) {
    // Extract images
    IEnumerable<PageImageArea> images = parser.GetImages();
    // Check if images extraction is supported
    if (images == null) {
        Console.WriteLine("Images extraction isn't supported");
        return;
    }
    // Iterate over images
    foreach (PageImageArea image in images) {
        // Print a page index, rectangle and image type:
        Console.WriteLine(string.Format("Page: {0}, R: {1}, Type: {2}", image.Page.Index, image.Rectangle, image.FileType));
    }
}

System Requirements

GroupDocs.Parser for .NET APIs are supported on all major platforms and operating systems. Before executing the code below, please make sure that you have the following prerequisites installed on your system.

Operating Systems: Microsoft Windows, Linux, MacOS
Development Environments: Microsoft Visual Studio, Xamarin, MonoDevelop
Frameworks
Download the latest version of GroupDocs.Parser for .NET from Nuget