Convert PDF to HTML in Java

PDF to HTML conversion with a few lines of Java code

Download Free Trial

About GroupDocs.Conversion for Java API

GroupDocs.Conversion for Java is an advanced file format conversion API for converting between popular image and document formats such as Microsoft Office, OpenDocument, PDF, HTML, email, CAD. and much more with just a few lines of code. The native API automatically detects the formats of the original documents and offers many options for customizing the converted documents. Along with the function of extracting information from a document, it also supports caching of the conversion results to the local disk by default. However, any type of cache storage can be supported by implementing the appropriate interfaces - Amazon S3, Dropbox, Google Drive, Windows Azure, Reddis, or any others.

Convert your PDF files to HTML files in Java. It only takes a couple of lines of Java code on any platform of your choice, such as Windows, Linux, macOS. You can try converting PDF to HTML for free and evaluate the quality of the conversion results. Along with simple file conversion scripts, you can try more sophisticated options for loading the PDF source file and storing the HTML output.

For example, for the source file PDF, you can use the following upload options:

  • automatic detection of the file format;
  • specify a password for protected files (if the file format supports it);
  • replace missing fonts to preserve the appearance of the document.

There are also advanced conversion options for the HTML file:

  • convert a specific page of a document or a range of pages;
  • add a watermark to the converted HTML.

Once the conversion is complete, you can save the HTML file to your local file path or to any third party storage such as FTP, Amazon S3, Google Drive, Dropbox etc. Please note - to convert PDF to HTML, you do not need to install any additional software, such as MS Office, Open Office, Adobe Acrobat Reader etc.

Steps to Convert PDF to HTML in Java

GroupDocs.Conversion allows developers to easily convert a PDF file to HTML with a few lines of code.

  • Create a new instance of the Converter class and upload the file PDF with the full path
  • Set ConvertOptions for document type to HTML.
  • Call the convert() method and pass the document name (full path) and format (HTML) as a parameter

System Requirements

Basic conversion using GroupDocs.Conversion for the Java API can be done with just a few lines of code. Our APIs are supported on all major platforms and operating systems. Before executing the code below, make sure you have the following prerequisites installed on your system.

  • Operating systems: Microsoft Windows, Linux, MacOS
  • Development environment: NetBeans, Intellij IDEA, Eclipse, etc.
  • Java runtime: J2SE 6.0 and above
  • Get the latest GroupDocs.Conversion for Java from Maven

// Load source file PDF for conversion
Converter converter = new Converter("input.pdf");
// Prepare conversion options for target format HTML
ConvertOptions convertOptions = new FileType().fromExtension("html").getConvertOptions();
// Convert to HTML format
converter.convert("output.html", convertOptions);

PDF to HTML Live Demo

Convert PDF to HTML now by visiting the GroupDocs.Conversion App website. The free demo has the following benefits

No need to download API

No need to write any code

Just upload the source file

Get download link to save the file

Other supported PDF conversions in Java

You can also convert PDF to many other file formats. Please see the list below.

Convert PDF to BMP

(Bitmap Image File)

Convert PDF to DCM

(DICOM Image)

Convert PDF to EMF

(Enhanced Metafile Format)

Convert PDF to EMZ

(Windows Compressed Enhanced Metafile)

Convert PDF to EPUB

(Open eBook File)

Convert PDF to GIF

(Graphical Interchange Format)

Convert PDF to JP2

(JPEG 2000 Core Image)

Convert PDF to JPEG

(Joint Photographic Expert Group Image)

Convert PDF to PDF

(Portable Document Format)

Convert PDF to PNG

(Portable Network Graphic)

Convert PDF to PSB

(Photoshop Large Document Format)

Convert PDF to PSD

(Photoshop Document)

Convert PDF to SVG

(Scalar Vector Graphics)

Convert PDF to SVGZ

(Compressed Scalable Vector Graphics)

Convert PDF to TEX

(LaTeX Source Document)

Convert PDF to TGA

(Truevision Graphics Adapter)

Convert PDF to TIFF

(Tagged Image File Format)

Convert PDF to WEBP

(Raster Web Image Format)

Convert PDF to WMF

(Windows Metafile)

Convert PDF to WMZ

(Compressed Windows Metafile)

Convert PDF to XPS

(XML Paper Specifications)

Back to top