GroupDocs.Metadata at a glance

Library designed for metadata manipulation via Java

Illustration metadata

Control files & documents metadata

GroupDocs.Metadata for Java is an advanced metadata management API to manipulate metadata information of documents, images, archives, torrents and various other file formats. Developers can now enhance the functionality of their Java applications by easily incorporating metadata viewing, modifying, deleting, extracting, searching, comparing, replacing and exporting features within all popular business document formats such as PDF, Microsoft Office Word, Excel spreadsheets, PowerPoint presentations & slides, Outlook emails, Project, Visio diagrams, OneNote, images, AutoCAD, Photoshop, audio, video, OpenType fonts and metafiles.

Manipulate built-in metadata

The Java metadata library offers you features such as metadata search, replace metadata properties, compare metadata of supported file formats to identify similarities as well as differences. You can also edit or modify metadata for better information management and export retrieved metadata information to Excel file, CSV file and DataSet. The API offers comprehensive support to work with all commonly used metadata standards like built-in, XMP, EXIF and custom metadata properties within supported document formats.

Broad Platform Support

GroupDocs.Metadata for Java is compatible with all Java versions and supports popular operating systems (Windows, Linux, MacOS) that are capable to run Java runtime.

Platform Independence

GroupDocs.Metadata for Java supports various operating systems and package managers.

Amazon
Docker
Azure
Eclipse
IntelliJ
Windows
Linux
Maven

Supported file formats

GroupDocs.Metadata for Java enables processing of a wide range of file formats. See the complete list.

Office formats

  • Portable: PDF
  • Word: DOC, DOCX, DOCM, DOT, DOTX, DOTM, RTF, TXT
  • Excel: XLS, XLSX, XLSM, XLSB, XLTM, XLT, XLTM, XLTX, XLAM, SXC, SpreadsheetML
  • PowerPoint: PPT, PPTX, PPS, PPSX, PPSM, POT, POTM, POTX, PPTM
  • OpenDocument: ODT, ODS
  • Visio: VSD, VDX, VSS, VSSX, VSX, VST, VSTX, VTX, VSDX, VDW, VSTM, VSSM, VSDM

Media & Graphics

  • Video: AVI, MOV, QT, FLV
  • Popular image formats: JPG, JPEG, JPE, JP2, PNG, BMP
  • Multi-page images: GIF, WEBP, TIFF, DJVU, DJV, DICOM
  • Audio: MP3, WAV
  • Matroska Media Container: MKV, MKA, MK3D, WEBM
  • AutoCAD: DWG, DXF
  • Photoshop: PSD

Other

  • Outlook: MSG, EML, EMLX, PST, OS
  • Fonts: OTF, OTC, TTF, TTC
  • Project: MPP
  • Metafiles: EMF, WMF
  • vCard: VCF, VCR
  • OneNote: ONE
  • Others: EPUB, ZIP, TORRENT, ASF

GroupDocs.Metadata for Java Features

Protect your PDF, Office documents, and images metadata

Feature icon

EXIF Images Metadata

Update EXIF metadata properties in WEBP, PNG, and PSD files

Feature icon

Get File Metadata

Search properties of document, EXIF, and XMP metadata

Feature icon

Clean Office Formats

Access and delete hidden data in Microsoft Word, Excel, PowerPoint, and PDF files

Feature icon

Metadata Export

Export metadata of supported file formats to Excel, CSV, or DataSet

Feature icon

PNG Image Support

Extract text metadata from PNG image files

Feature icon

Remove Digital Signatures

Identify and delete digital signatures in Word, Excel, and PDF files

Feature icon

Built-In Metadata Support

Read metadata property using a defined key for any supported format

Feature icon

Image Previews

Generate image previews for EPUB, CAD, EML, and MSG files

Feature icon

File Content Optimization

Reduce memory consumption of PDF, Excel, and image formats

Feature icon

Matroska Multimedia Support

Read Matroska subtitles and retrieve metadata of audio and video files

Feature icon

Replace Metadata Content

Replace metadata properties of Word, Excel, PowerPoint, and PDF files

Feature icon

Clean Business Data

Delete metadata and comments from reports and documents

Feature icon

Photo Location

Manipulate image metadata properties and delete photo location information

Feature icon

Analyze Metadata Distinctions

Identify differences or similarities in metadata of supported formats by comparison

Feature icon

Password Protection

Detect document password protection in Word, Excel, PowerPoint, and PDF files

Feature icon

Archive & Torrents Support

Manipulate built-in and custom metadata and fetch metadata of torrents and archive formats

Feature icon

EXIF Image Metadata

Add or update XMP and EXIF metadata properties of arbitrary types using the Search API

Feature icon

Detect Document File Type at Run-time

Our solution provides the ability to detect file or stream type before metadata processing

Feature icon

Font Metadata Support

Supports enumeration of any metadata type and reads metadata of OpenType font files

Feature icon

Email Metadata Support

Get and delete metadata of email messages and remove attachments

Feature icon

Microsoft Excel Support

Metadata extraction from Microsoft Excel files starting from Excel 95

Feature icon

Matroska Multimedia Previews

Fetch thumbnails and image previews of supported formats with Matroska multimedia container support

Feature icon

Microsoft Project Support

Read metadata from encrypted Microsoft Project files

Feature icon

TIFF Support

Add, update, and delete IPTC metadata packages in TIFF images

Feature icon

JPEG Support

Add, update, and remove EXIF metadata packages in JPEG2000 images

Feature icon

Multimedia Files Support

Extract XMP metadata properties in MOV, MP3, and WEBP files

Feature icon

HEIC Media Support

Read EXIF tags and XMP metadata properties from HEIC/HEIF image formats

Code samples

Explore code examples illustrating typical GroupDocs.Metadata for Java functionalities

Review Document Metadata

Utilize GroupDocs.Metadata for Java to control inner document content. Learn more: document metadata search:

How to get specific document metadata


// Load source document to Metadata constructor
try (Metadata metadata = new Metadata("source.pptx")){

    // Get all the properties that contains the name of the last document editor
    // or the date/time the document was last modified
    IReadOnlyList<MetadataProperty> properties = metadata.findProperties(
        new ContainsTagSpecification(Tags.getPerson().getEditor()).
        or(new ContainsTagSpecification(Tags.getTime().getModified())));

    // Process retrieved metadata entries
    for (MetadataProperty property : properties) {
        System.out.println(String.format("Property name: %s, Property value: %s", 
            property.getName(), property.getValue()));
}

Add metadata to documents

GroupDocs.Metadata for Java empowers you to add hidden entries to your business data:

How to add some missing metadata properties to a file regardless of its format.

// Load source document
try (Metadata metadata = new Metadata("source.pdf")) {
    if (metadata.getFileFormat() != FileFormat.Unknown && !metadata.getDocumentInfo().isEncrypted()) {

        // Add a property containing the file last printing date if it's missing
        // Property will be added if the document supports such type of metadata
        int affected = metadata.addProperties(
            new ContainsTagSpecification(Tags.getTime().getPrinted()), 
            new PropertyValue(new Date()));

        System.out.println(String.format("Affected properties: %s", affected));

        // Save modified document to a specified path
        metadata.save("output.pdf");
    }
}

Temporary license tips

1
Sign up with your work email.
Free mail services are not allowed.
2
Use Get a temporary license button on the second step.
 English