GroupDocs.Parser for Java Features

Count Word Occurrence for Single or Multiple Documents Statistically

Extract Text and Metadata from Excel Spreadsheets and PowerPoint Presentation Templates

Fetch Text from a File or Stream, Without Installing Document Reader

Pull Out Formatted Text from a Document Using Fast or Standard Text Extraction Mode

Detect the Media Type of Password Protected XML Documents & Extract Text from Them

Fetch Formatted Text from PowerPoint Presentation, Emails & Attachments Programmatically

Drive out Text from Single or Multiple Pages of OneNote Document

Pull out Raw Text from Simple PDF File or a PDF Portfolio Document

Extract Data from PDF, MS Word, Excel and Presentation Documents

Extract Raw or Formatted Text from Cells, Rows And Columns from Excel Spreadsheet

Gather Raw or HTML Formatted Text from Word Document & Excerpt Highlighted Text from Documents

Get Data from the PDF Forms & Obtain Formatted Table From a PDF or Word Document

Pull Out Single Sentence or Whole Text from EPUB, CHM, Markdown & FB2 Files

Excerpt Table of Contents from Databases, PDF, EPUB, CHM & Word Processing Documents

Retrieve Text Area from Documents for Analysis & Pull Out text with its Content Structure Intact

Obtain Metadata from Supported Document Formats

Draw Out All or Selected Images from Supported Formats & Rotate Extracted Image(s)‎

Extract Text from Files within Zip Archives & OST Containers – Detect Media Types for Zip Container Items

Fetch Data from Email Container (Exchange Web Server, POP3, IMAP)‎

Take Out Text from Database Containers in Fast, Reliable and Efficient Manner

Find Simple Text, Whole Word & Regular Expression within Documents

Prepare Document Template, Extract Data from Document and Analyze Data Fields & Tables

Search & Extract Highlighted Expressions in Documents

Pull out Text with Plain Text Formatter (Simple & ASCII) or Custom Formatting with Edges, Angles, & Intersections

Fetch & Format Text (Font, Hyperlinks, Headings, Lists & Tables) with Markdown Formatter

Get Text with HTML Formatter & Apply Formatter to Paragraph, Hyperlink, Font, Headings, Lists & Tables

Move Table Layout & Detect Tables in a Rectangular Area by Column Separators

Extract Text from Shapes, WordArt Objects & Text Boxes within Microsoft Office File Formats

Extract Images to Files – Save to JPG, PNG, GIF, BMP, PNG or WEBP Formats

Extract Text from Email Servers and Databases via JDBC