GroupDocs.Metadata at a glance

Python library to control metadata

Illustration metadata

Main features

GroupDocs.Metadata for Python via .NET is a powerful library for managing metadata in various file formats. Integrate viewing, editing, deleting, searching, comparing, replacing, and exporting metadata in your Python apps. Supports popular formats like PDF, Microsoft Office (Word, Excel, PowerPoint), Outlook emails, Project, Visio diagrams, OneNote, images (PSD, CAD), audio, video, OpenType fonts, and metafiles.

Easy Metadata Manipulation

Our library offers features like metadata search, replacement, property comparison, and extraction. Export metadata to Excel, CSV, or DataSet formats. Supports metadata standards like built-in, XMP, EXIF, and custom properties.

Popular Platform Support

GroupDocs.Metadata for Python via .NET is compatible with popular Python versions and runs on Windows, Linux, and macOS.

Platform Independence

GroupDocs.Metadata for Python via .NET is ready for integration with various operating systems and package managers.

Amazon
Docker
Azure
Eclipse
IntelliJ
Windows
Linux
PyPI

Supported file formats

GroupDocs.Metadata for Python via .NET processes a wide range of file formats. Explore the full list.

Office formats

  • Portable: PDF
  • Word: DOC, DOCX, DOCM, DOT, DOTX, DOTM, RTF, TXT
  • Excel: XLS, XLSX, XLSM, XLSB, XLTM, XLT, XLTM, XLTX, XLAM, SXC, SpreadsheetML
  • PowerPoint: PPT, PPTX, PPS, PPSX, PPSM, POT, POTM, POTX, PPTM
  • OpenDocument: ODT, ODS
  • Visio: VSD, VDX, VSS, VSSX, VSX, VST, VSTX, VTX, VSDX, VDW, VSTM, VSSM, VSDM

Media & Graphics

  • Video: AVI, MOV, QT, FLV
  • Popular image formats: JPG, JPEG, JPE, JP2, PNG, BMP
  • Multi-page images: GIF, WEBP, TIFF, DJVU, DJV, DICOM
  • Audio: MP3, WAV
  • Matroska Media Container: MKV, MKA, MK3D, WEBM
  • AutoCAD: DWG, DXF
  • Photoshop: PSD

Other

  • Outlook: MSG, EML, EMLX, PST, OS
  • Fonts: OTF, OTC, TTF, TTC
  • Project: MPP
  • Metafiles: EMF, WMF
  • vCard: VCF, VCR
  • OneNote: ONE
  • Others: EPUB, ZIP, TORRENT, ASF

GroupDocs.Metadata for Python via .NET Features

Enhance document security with metadata processing.

Feature icon

Protect Business Files

Remove metadata and comments from business reports and documents

Feature icon

Photo Location

Manage image metadata, including photo location information

Feature icon

Control Office Documents

Manipulate metadata in PDF, Word, Excel, PowerPoint, and other formats

Feature icon

Analyze Metadata Changes

Compare and analyze metadata changes in supported file formats

Feature icon

Built-In Metadata Support

Work with built-in and custom metadata properties

Feature icon

Control Image Metadata

Collect image files metadata like EXIF or XMP

Feature icon

Email Metadata Support

Manage metadata and attachments in email messages

Feature icon

EXIF Image Metadata

Edit EXIF metadata in WEBP, PNG, or PSD files

Feature icon

Optimize File Content

Reduce memory usage in PDF, Excel, and image files

Feature icon

Matroska Multimedia Support

Get Matroska subtitles and metadata in audio and video files

Feature icon

Generate Image Previews

Create previews for MSG, CAD, EML, or EPUB image formats

Feature icon

Multimedia Support

Retrieve XMP metadata from MOV, MP3, and WEBP files

Feature icon

Control Digital Signatures

Identify and remove digital signatures in PDFs and Office documents

Feature icon

Metadata Export

Export metadata to Excel, CSV, or DataSet formats

Feature icon

Built-In Metadata Control

Analyze specific metadata properties using defined keys

Feature icon

Password Protection

Detect password protection in PDFs and MS Word, Excel, and PowerPoint documents

Feature icon

Replace Metadata Content

Replace metadata properties in Word, Excel, PowerPoint, and PDF files

Feature icon

PNG Metadata Export

Extract text metadata from PNG images using Python

Feature icon

Update Image Metadata

Add or update XMP and EXIF metadata using the Search API

Feature icon

Control Office Files

Access and delete hidden data in PDF, MS Word, Excel, and PowerPoint files

Feature icon

File Type Detection

Detect file type at runtime using Python

Feature icon

Matroska Multimedia Preview

Retrieve thumbnails and image previews for files with Matroska support

Feature icon

TIFF Support

Manipulate IPTC metadata in TIFF images

Feature icon

HEIC Media Support

Manage EXIF tags and XMP metadata in HEIC/HEIF images

Feature icon

Font Metadata Support

Control metadata in OpenType font files

Feature icon

Microsoft Project Support

Retrieve metadata from encrypted Microsoft Project files

Feature icon

JPEG metadata Support

Add, update, or remove EXIF data for JPEG2000 images

Code samples

Explore code examples demonstrating common GroupDocs.Metadata for Python via .NET functionalities

Get Insights into Document Metadata

Use GroupDocs.Metadata for Python via .NET API to retrieve inner document metadata:

How to get specific document metadata


import groupdocs.metadata as gm
                
# Load source document to Metadata constructor
with gm.Metadata("input.pptx") as metadata:

    # Get all the properties that contains the name of the last document editor
    # or the date/time the document was last modified
    specification = gm.search.ContainsTagSpecification(gm.tagging.Tags.person.editor)
        .either(gm.search.ContainsTagSpecification(gm.tagging.Tags.time.modified))
        
    properties = metadata.find_properties(specification)

    # Process retrieved metadata entries
    for property in properties:
        print(f"Property name: {property.name}, Property value: {property.value}")

Hide Business Info in Documents

Use our solution to add metadata to your documents:

How to add some missing metadata properties to a file regardless of its format.


import groupdocs.metadata as gm

# Load source document
with gm.Metadata("input.pdf") as metadata:
    if metadata.file_format != gm.common.FileFormat.UNKNOWN and metadata.get_document_info().is_encrypted != True:

        # Add a property containing the file last printing date if it's missing
        # Property will be added if the document supports such type of metadata
        specification = gm.search.ContainsTagSpecification(gm.tagging.Tags.time.printed)
        now = datetime.now()
        property_value = gm.common.PropertyValue(now)
        affected = metadata.add_properties(specification, property_value)

        # Save modified document to a specified path
        print(f"Affected properties: {affected}");
        metadata.save("output.pdf")

Temporary license tips

1
Sign up with your work email.
Free mail services are not allowed.
2
Use Get a temporary license button on the second step.
 English