GroupDocs.Parser for .NET is a text, metadata and image extractor API for business applications developed using C#, ASP.NET, and other .NET technologies. It supports extraction of raw, formatted & structured text as well as metadata from the files of supported formats. Through GroupDocs.Parser for .NET, your applications can also perform parsing of password protected documents for popular formats, such as Word processing documents, Excel spreadsheets, PowerPoint presentations, OneNote, PDF files and ZIP archives.
GroupDocs.Parser API is a right choice for corporate solutions which needs file text extraction feature. These APIs are well supported on all major operating systems and platforms including Frameworks: .NET Framework, .NET Standard, .NET Core, Mono.
GroupDocs.Parser for .NET makes it easy for C# developers to extract a text from a PDF file by implementing a few easy steps.
// Extract text from PDF file using GroupDocs.Parser API
// Create an instance of Parser class
using (Parser parser = new Parser(filePath)) {
// Extract a text into the reader
using (TextReader reader = parser.GetText()) {
// Print a text from the document
// If text extraction isn't supported, a reader is null
Console.WriteLine(reader == null ? "Text extraction isn't supported" : reader.ReadToEnd());
}
}
GroupDocs.Parser for .NET APIs are supported on all major platforms and operating systems. Before executing the code below, please make sure that you have the following prerequisites installed on your system.
Extract text from PDF file right now by visiting GroupDocs.Parser Live Demos website. The live demo has the following benefits.
No need to download API
No need to write any code
Just upload the source file
Get download link to save the file
.NET documents parse & text extraction API for file formats and images. Extract data for some of the popular file formats as stated below.
(PowerPoint Slide Show)
(Microsoft PowerPoint 97-2003)
(Open XML presentation Format)
(Rich Text Format)
(LaTeX Source Document)
(The 7th Guest Video File)
(Visio Macro-Enabled Drawing)
(Visio Drawing)
(Visio Macro-Enabled Stencil File)
(Visio Stencil File)
(Visio Macro-Enabled Drawing Template)
(Visio Drawing Template)
(Visio Stencil XML File)
(Anim8or 3D Model)
(Excel Macro-Enabled Add-In)
(Microsoft Excel Spreadsheet (Legacy))