Extract DOCX Files from Java


Download Free Trial

About GroupDocs.Annotation for Java API

GroupDocs.Annotation for Java API is a library that allows you to add annotations to PDF, Word and other documents on Mac, Windows or Ubuntu. GroupDocs.Annotation for Java is a native Java API for managing annotations with comprehensive support for creating, adding, editing, deleting, extracting and exporting annotations from images and various other documents. The full list of supported document formats you could see on this page.

This library allows you to work not only with DOCX document but also with many other types documents such as Word, Excel, PowerPoint, Outlook emails, Visio, Adobe, OpenDocument, OpenOffice, Photoshop, AutoCad and many others.

The GroupDocs.Annotation for Java API allows you to create and add new notes, edit annotations, extract comments, annotations, and remove them from documents. The library supports 13 different annotation types, including Text, Polyline, Area, Underline, Point, Watermark, Arrow, Ellipse, Text Replacement, Distance, Text Field, Resource Redaction in PDF, HTML, Microsoft Word documents, spreadsheets, diagrams, presentations, drawings, images and many other file formats.

The example (please see below) demonstrates working with DOCX document, in this example you could see the main steps of how to work with GroupDocs.Annotation: Setup a license, open a document you want to work with, creating an annotation, adding data objects to set annotation properties according to your requirements and saving the result to the needed place. Also you could have a look more detailed on the supported features on our github page, or in our product documentation.

Steps to Extract Annotations from DOCX in Java

GroupDocs.Annotation makes it easy for Java developers to annotate documents and extract annotation information from DOCX files within any Java-based application by implementing a few easy steps.

  • Create Reply objects with comment and date.
  • Instantiate LoadOptions object and call SetImportAnnotations with true argument.
  • Define variable with type List.
  • Call get method and return result to variable above.

System Requirements

GroupDocs.Annotation for Java APIs are supported on all major platforms and operating systems. Before executing the code below, please make sure that you have the following prerequisites installed on your system.

  • Operating Systems: Microsoft Windows, Linux, MacOS
  • Development Environment: NetBeans, Intellij IDEA, Eclipse etc
  • Java Runtime Environment: Java 7 (1.7) and above
  • Get the latest version of GroupDocs.Annotation for Java from GroupDocs Artifact Repository

// For using this example input file ("annotated.bmp") must be with annotations
LoadOptions loadOptions = new LoadOptions();

// Create an instance of Annotator class and get annotations
final Annotator annotator = new Annotator("annotated.bmp", loadOptions);
List annotations = annotator.get();

Live Demos to Delete Annotations from Documents and Images

Extract annotation information from DOCX file right now by visiting GroupDocs.Annotation Live Demos website. The live demo has the following benefits

No need to download API

No need to write any code

Just upload the source file

Get download link to save the file

About DOCX File Format

DOCX is a well-known format for Microsoft Word documents. Introduced from 2007 with the release of Microsoft Office 2007, the structure of this new Document format was changed from plain binary to a combination of XML and binary files. Docx files can be opened with Word 2007 and lateral versions but not with the earlier versions of MS Word which support DOC file extensions.

Read More About DOCX File Format

Extracting Annotations From Other Supported File Formats

Java annotations editor API for documents and image formats. Update annotation properties from some of the popular file formats as stated below.

Extract Annotation from PDF

(Adobe Portable Document Format)

Extract Annotation from DOC

(Microsoft Word Document)

Extract Annotation from DOCM

(Microsoft Word Macro-Enabled Document)

Extract Annotation from DOCX

(Microsoft Word Open XML Document)

Extract Annotation from DOT

(Microsoft Word Document Template)

Extract Annotation from DOTX

(Word Open XML Document Template)

Extract Annotation from RTF

(Rich Text Document)

Extract Annotation from ODT

(Open Document Text)

Extract Annotation from XLS

(Microsoft Excel Binary File Format)

Extract Annotation from XLSX

(Microsoft Excel Open XML Spreadsheet)

Extract Annotation from XLSM

(Microsoft Excel Macro-Enabled Spreadsheet)

Extract Annotation from XLSB

(Microsoft Excel Binary Worksheet)

Extract Annotation from ODS

(Open Document Spreadsheet)

Extract Annotation from PPT

(PowerPoint Presentation)

Extract Annotation from PPTX

(PowerPoint Open XML Presentation)

Extract Annotation from PPSX

(PowerPoint Open XML Slide Show)

Extract Annotation from POTM

(Microsoft PowerPoint Template)

Extract Annotation from PPTM

(Microsoft PowerPoint Presentation)

Extract Annotation from PPS

(Microsoft PowerPoint 97-2003 Slide Show)

Extract Annotation from ODP

(OpenDocument Presentation)

Extract Annotation from HTML

(HyperText Markup Language)

Extract Annotation from TIFF

(Tagged Image File Format)

Extract Annotation from PNG

(Portable Network Graphic)

Extract Annotation from MSG

(Microsoft Outlook E-mail Message)

Extract Annotation from VSD

(Microsoft Visio 2003-2010 Drawing)

Extract Annotation from VSDX

(Microsoft Visio Drawing)

Extract Annotation from VSS

(Microsoft Visio 2003-2010 Stencil)

Extract Annotation from VST

(Microsoft Visio 2013 Stencil)

Extract Annotation from DWG

(Autodesk Design Data Formats)

Extract Annotation from DXF

(AutoCAD Drawing Interchange)

Extract Annotation from DCM

(Digital Imaging and Communications in Medicine)

Extract Annotation from WMF

(Windows Metafile)

Extract Annotation from EMF

(Enhanced Metafile Format)

Back to top
 English