GroupDocs.Metadata for .NET offers an advanced set of metadata management and manipulation features, allowing developers to easily read, edit, remove, search, compare, replace and export metadata information from images and document formats without using any external software. Extract metadata details from PDF, Word, Excel, PowerPoint, Outlook, OneNote, Visio, Project, AutoCAD, Archive and Multimedia file formats, and perform supported metadata operations with true flexibility.
GroupDocs.Metadata makes it easy for .NET developers to read and extract metadata information from PDF files from within their applications by implementing a few easy steps.
GroupDocs.Metadata for .NET APIs are supported on all major platforms and operating systems. Before executing the code below, please make sure that you have the following prerequisites installed on your system.
using (var metadata = new GroupDocs.Metadata.Metadata("input.pdf"))
{
// extract all metadata properties that fall into a particular category
var properties = metadata.FindProperties(
p => p.Tags.Any(t => t.Category == GroupDocs.Metadata.Tagging.Tags.Content));
// iterate over all properties and display
foreach (var property in properties)
{
Console.WriteLine("{0} = {1}", property.Name, property.Value);
}
// extract all properties having a specific type and value
var year = DateTime.Today.Year;
properties = metadata.FindProperties(
p => p.Value.Type == GroupDocs.Metadata.Common.MetadataPropertyType.DateTime &&
p.Value.ToStruct(DateTime.MinValue).Year == year);
// display all datetime properties with the year value equal to the current year
foreach (var property in properties)
{
Console.WriteLine("{0} = {1}", property.Name, property.Value);
}
// extract all properties having names matching the specified regex
const string pattern = "^author|company|(.+date.*)$";
var regex = new System.Text.RegularExpressions.Regex(pattern,
System.Text.RegularExpressions.RegexOptions.IgnoreCase);
properties = metadata.FindProperties(p => regex.IsMatch(p.Name));
// display properties whose names match the following pattern
foreach (var property in properties)
{
Console.WriteLine("{0} = {1}", property.Name, property.Value);
}
}
Retrieve metadata information of PDF file right now by visiting GroupDocs.Metadata Live Demos website.
The live demo has the following benefits
No need to download API
No need to write any code
Just upload the source file
Get download link to save the file
Portable Document Format (PDF) is a type of document created by Adobe back in 1990s. The purpose of this file format was to introduce a standard for representation of documents and other reference material in a format that is independent of application software, hardware as well as Operating System. PDF files can be opened in Adobe Acrobat Reader/Writer as well in most modern browsers like Chrome, Safari, Firefox via extensions/plug-ins. Most of the commercially available software suites also offer conversion of their documents to PDF file format without the requirement of any additional software component. Thus, PDF file format has full capability to contain information like text, images, hyperlinks, form-fields, rich media, digital signatures, attachments, metadata, Geospatial features and 3D objects in it that can become as part of source document.
Read More About PDF File FormatMulti format documents and images metadata extraction API for .NET. Retrieve metadata of some of the popular file formats as stated below.
(Adobe Portable Document Format)
(Microsoft Word Document)
(Microsoft Word Macro-Enabled Document)
(Microsoft Word Open XML Document)
(Microsoft Word Document Template)
(Word Open XML Document Template)
(Microsoft Excel Binary File Format)
(Microsoft Excel Open XML Spreadsheet)
(Microsoft Excel Macro-Enabled Spreadsheet)
(OOXML Macro Enabled Workbook Template)
(PowerPoint Presentation)
(Microsoft PowerPoint Slide Show)
(PowerPoint Open XML Presentation)
(PowerPoint Open XML Slide Show)
(Microsoft PowerPoint Open XML Template)
(Microsoft PowerPoint Template)
(Microsoft PowerPoint Presentation)
(Microsoft PowerPoint Slide Show)
(Open Document Spreadsheet)
(Open Document Text)
(Tagged Image File Format)
(JPEG Image)
(Portable Network Graphic)
(Graphical Interchange Format File)
(Bitmap File Format)
(JPEG 2000 Core Image File)
(Raster Web Image File Format)
(Adobe Photoshop Document)
(Windows Metafile)
(Enhanced Metafile Format)
(Microsoft Project Document)
(Microsoft Outlook E-mail Message)
(E-mail Message)
(Autodesk Design Data Formats)
(AutoCAD Drawing Interchange)
(Microsoft OneNote)
(MPEG Audio Layer III)
(Waveform Audio File Format)
(Digital Imaging & Comm in Medicine)
(Audio Video Interleave File)
(Microsoft Visio 2003-2010 Drawing)
(Microsoft Visio Drawing)
(Microsoft Visio 2003-2010 Stencil)
(Microsoft Visio 2003-2010 XML Drawing)
(Microsoft Visio 2003-2010 XML Stencil)
(Archive File Format)
(Digital E-Book File Format)
(Electronic Business Card)
(vCard)