Convert PDF to HTML

Portable Document Format (PDF) is a type of document created by Adobe back in 1990s. The purpose of this file format was to introduce a standard for representation of documents and other reference material in a format that is independent of application software, hardware as well as Operating System. PDF files can be opened in Adobe Acrobat Reader/Writer as well in most modern browsers like Chrome, Safari, Firefox via extensions/plug-ins.

Steps to convert PDF to HTML in C#

GroupDocs.Conversion allows developers to convert the PDF file to HTML format in an easy and intuitive way just using a few lines of code as described below:

  • Create an instance of Converter class and pass source PDF file path as a constructor parameter. You may specify absolute or relative file path as per your requirements.
  • Create an instance of MarkupConvertOptions class.
  • Call Converter class Convert method and pass the filename for the converted HTML file and the MarkupConvertOptions object from the previous step as parameters.
// Load the source PDF file
using (var converter = new GroupDocs.Conversion.Converter("sample.pdf"))
{
    // Set the convert options for HTML format
   var options = new MarkupConvertOptions();
    // Convert to HTML format
    converter.Convert("converted.html", options);
}

Code Examples

Please find more use-cases and complete C# sources of our backend and frontend examples and try them for free!

PDF to HTML Live Demo

GroupDocs.Conversion for .NET provides an online PDF to HTML converter, which allows you to try it for free and check conversion quality and accuracy.

“Convert PDF to HTML”