Extract data from Microsoft Office Word documents

GroupDocs.Parser provides the functionality to extract data from Microsoft Office Word documents. Both classic (doc, dot) and Open XML (docx, dotx) formats are supported. Also LibreOffice Writer (OpenOffice.org Writer) formats and RTF are supported.

The following table provides the list of supported formats:

FormatDescription
DOCMicrosoft Office Word Document
DOTMicrosoft Office Word Document Template
DOCXMicrosoft Office Open XML Document
DOCMMicrosoft Office Open XML Macro-Enabled Document
DOTXMicrosoft Office Open XML Document Template
DOTMMicrosoft Office Open XML Document Macro-Enabled Template
TXTPlain text
ODTOpen Document Text
OTTOpen Document Text Template
RTFRich Text Format

More resources

GitHub examples

You may easily run the code above and see the feature in action in our GitHub examples:

Free online document parser App

Along with full featured .NET library we provide simple, but powerful free Apps.

You are welcome to parse documents and extract data from PDF, DOC, DOCX, PPT, PPTX, XLS, XLSX, Emails and more with our free online Free Online Document Parser App.

The following examples demonstrate how to extract data from Microsoft Office Word documents: