Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  • Spreadsheets
  • Presentations
  • Text documents
  • PDFs
  • OneNote sections
  • ZIP archives

The following text and presentation templates are also supported for text extraction:

  •     dotx (Template)
  •     dotm (Macro-enabled template)
  •     ott (OpenDocument Text Template)
  •     potx (Template)
  •     potm (Macro-enabled template)
  •     ppsm (Macro-enabled slideshow)
  •     pptm (Macro-enabled presentation)

Metadata Extraction

Following is the list of supported formats for metadata extraction along with their metadata properties that can be extracted using GroupDocs.Parser. 

Metadata Property Name.docx.doc.dot.odt.xlsx.xls.ods.pptx.ppt.odp.pdf.msg.eml.emlx.epub.fb2
Application      
ApplicationVersion        
Template            
Title   
Subject
Comments      
Keywords    
ContentStatus           
Category         
Manager         
Author   
LastAuthor      
Company         
HyperlinkBase          
CreatedTime     
LastSavedTime     
LastPrintedTime        
RevisionNumber         
TotalEditingTime            
EmailFrom             
EmailTo             
EmailCC             
Description              
Language              
Copyrights               
Publisher              
PublishedDate              

The following text and presentation templates are also supported for metadata extraction:

  •     dotx (Template)
  •     dotm (Macro-enabled template)
  •     ott (OpenDocument Text Template)
  •     potx (Template)
  •     potm (Macro-enabled template)
  •     ppsm (Macro-enabled slideshow)
  •     pptm (Macro-enabled presentation)

Encoding Detection

Supported

Not supported

...