• Introducing GroupDocs.Parser for .NET
    • Getting Started
      • Features Overview
      • Supported Document Formats
      • System Requirements
      • Installation
      • Evaluation Limitations and Licensing
      • How to Run Examples
    • Developer Guide
      • Basic usage
        • Get supported file formats
        • Get document info
        • Get supported features
        • Parse data from documents
        • Extract text from documents
        • Extract formatted text from documents
        • Extract metadata from documents
        • Extract images from documents
        • Extract data from attachments and ZIP archives
        • Extract data from PDF forms
        • Extract table of contents
      • Advanced usage
        • Loading
          • Loading specific file formats
          • Password-protected documents
          • Load document from stream
          • Load document from local disk
        • Working with hyperlinks
          • Extract hyperlinks from document
          • Extract hyperlinks from document page
          • Extract hyperlinks from document page area
        • Working with tables
          • Extract tables from document
          • Extract tables from document page
        • Working with text
          • Extract text in Accurate mode
          • Extract text in Raw mode
          • Extract highlights
          • Search text
          • Working with formatted text
            • HTML
            • Markdown
            • Extract formatted text from document
            • Extract formatted text from document page
            • Plain text
          • Extract text structure
          • Extract text areas
          • Detect encoding
          • Extract text by table of contents item
        • Working with images
          • Extract images from document
          • Extract images from document page
          • Extract images from document page area
          • Extract images to files
        • Working with ZIP archives and attachments
          • Detect file type of container item
          • Iterate through container items
        • Extract data from various formats
          • Extract data from Microsoft Office Word documents
            • Extract text from Microsoft Office Word documents
            • Extract metadata from Microsoft Office Word documents
            • Extract images from Microsoft Office Word documents
            • Extract hyperlinks from Microsoft Office Word documents
            • Extract tables from Microsoft Office Word documents
            • Extract table of contents from Microsoft Office Word documents
            • Search text in Microsoft Office Word documents
          • Extract data from Microsoft Office Excel spreadsheets
            • Extract text from Microsoft Office Excel spreadsheets
            • Extract metadata from Microsoft Office Excel spreadsheets
            • Extract images from Microsoft Office Excel spreadsheets
            • Search text in Microsoft Office Excel spreadsheets
          • Extract data from Microsoft Office PowerPoint presentations
            • Extract text from Microsoft Office PowerPoint presentations
            • Extract metadata from Microsoft Office PowerPoint presentations
            • Extract images from Microsoft Office PowerPoint presentations
            • Search text in Microsoft Office PowerPoint presentations
          • Extract data from PDF documents
            • Extract text from PDF documents
            • Extract metadata from PDF documents
            • Extract images from PDF documents
            • Extract attachments from PDF portfolios
            • Parse data from PDF documents
            • Search text in PDF documents
          • Extract data from Emails
            • Extract text from Emails
            • Extract metadata from Emails
            • Extract images from Emails
            • Extract attachments from Emails
            • Extract emails from Outlook Storage
            • Extract emails from remote server via POP IMAP or Exchange Web Services protocols
            • Search text in Emails
          • Extract data from ePUB eBooks
            • Extract text from EPUB eBooks
            • Extract metadata from EPUB eBook
            • Extract table of contents from EPUB eBooks
            • Search text in EPUB eBooks
          • Extract data from ZIP archives
            • Extract text from ZIP archive files
          • Extract data from HTML documents
            • Extract text from HTML documents
            • Search text in HTML documents
          • Extract data from Microsoft OneNote notebooks
            • Extract text from Microsoft OneNote sections
            • Search text in Microsoft OneNote sections
        • Extract data from databases
        • Working with templates
        • Working with data extracted by template
        • Logging
        • Generate previews
      • Migration notes
    • Release Notes
      • Release Notes - 2020
        • GroupDocs.Parser for .NET 20.12 Release Notes
        • GroupDocs.Parser for .NET 20.10 Release Notes
        • GroupDocs.Parser for .NET 20.8 Release Notes
        • GroupDocs.Parser for .NET 20.6.1 Release Notes
        • GroupDocs.Parser for .NET 20.6 Release Notes
        • GroupDocs.Parser for .NET 20.5 Release Notes
        • GroupDocs.Parser for .NET 20.3 Release Notes
        • GroupDocs.Parser for .NET 20.1 Release Notes
      • Release Notes - 2019
        • GroupDocs.Parser for .NET 19.12 Release Notes
        • GroupDocs.Parser for .NET 19.11 Release Notes
        • GroupDocs.Parser for .NET 19.9 Release Notes
        • GroupDocs.Parser for .NET 19.5 Release Notes
      • Release Notes - 2018
        • GroupDocs.Parser for .NET 18.12 Release Notes
        • GroupDocs.Parser for .NET 18.11 Release Notes
        • GroupDocs.Parser for .NET 18.10 Release Notes
        • GroupDocs.Parser for .NET 18.9 Release Notes
        • GroupDocs.Parser for .NET 18.8 Release Notes
        • GroupDocs.Parser for .NET 18.7 Release Notes
        • GroupDocs.Parser for .NET 18.5 Release Notes
        • GroupDocs.Parser for .NET 18.4 Release Notes
        • GroupDocs.Parser for .NET 18.3 Release Notes
        • GroupDocs.Parser for .NET 18.2 Release Notes
      • Release Notes - 2017
        • GroupDocs.Parser for .NET 17.12 Release Notes
        • GroupDocs.Parser for .NET 17.10 Release Notes
        • GroupDocs.Parser for .NET 17.09 Release Notes
        • GroupDocs.Parser for .NET 17.08 Release Notes
        • GroupDocs.Parser for .NET 17.07 Release Notes
        • GroupDocs.Parser for .NET 17.06 Release Notes
        • GroupDocs.Parser for .NET 17.05 Release Notes
        • GroupDocs.Parser for .NET 17.04 Release Notes
        • GroupDocs.Parser for .NET 17.03 Release Notes
        • GroupDocs.Parser for .NET 17.02 Release Notes
        • GroupDocs.Parser for .NET 17.01 Release Notes
      • Release Notes - 2016
        • GroupDocs.Parser for .NET 16.11 Release Notes
  1. Home
  2. GroupDocs.Parser Product Family
  3. GroupDocs.Parser for .NET
  4. Developer Guide
  5. Advanced usage
  6. Working with text

Working with text

  • Extract text in Accurate mode
  • Extract text in Raw mode
  • Extract highlights
  • Search text
  • Working with formatted text
  • Extract text structure
  • Extract text areas
  • Detect encoding
  • Extract text by table of contents item
Extract tables from document page Extract text in Accurate mode