Convert HTML to TXT

HTML (Hyper Text Markup Language) is the extension for web pages created for display in browsers. Known as language of the web, HTML has evolved with requirements of new information requirements to be displayed as part of web pages. The latest variant is known as HTML 5 that gives a lot of flexibility for working with the language. HTML pages are either received from server, where these are hosted, or can be loaded from local system as well.

Steps to convert HTML to TXT in C#

GroupDocs.Conversion allows developers to convert the HTML file to TXT format in an easy and intuitive way just using a few lines of code as described below:

  • Create an instance of Converter class and pass source HTML file path as a constructor parameter. You may specify absolute or relative file path as per your requirements.
  • Create an instance of WordProcessingConvertOptions class and set Format property to GroupDocs.Conversion.FileTypes.WordProcessingFileType.Txt.
  • Call Converter class Convert method and pass the filename for the converted TXT file and the WordProcessingConvertOptions object from the previous step as parameters.
// Load the source HTML file
using (var converter = new GroupDocs.Conversion.Converter("sample.html"))
{
    // Set the convert options for TXT format
   var options = new WordProcessingConvertOptions { Format = GroupDocs.Conversion.FileTypes.WordProcessingFileType.Txt };
    // Convert to TXT format
    converter.Convert("converted.txt", options);
}

Code Examples

Please find more use-cases and complete C# sources of our backend and frontend examples and try them for free!

HTML to TXT Live Demo

GroupDocs.Conversion for .NET provides an online HTML to TXT converter, which allows you to try it for free and check conversion quality and accuracy.

“Convert HTML to TXT”