Search Export
Outside In Search Export

Outside In Search Export extracts the text and metadata of over 400 supported file types and converts it into XML, HTML or text specifically designed for search and forensic applications. This SDK offers a rich feature set and the option of four output formats:

  • SearchML: Lightweight XML containing text, embeddings and metadata optimized for search and text extraction;
  • SearchHTML: HTML optimized for Web crawlers but with limited display formatting;
  • SearchText: Plain text file (UTF-8 encoded Unicode) with properties and body text from the input file;
  • PageML: XML which provides paginated text.
Its use is appropriate for search, forensics or any application that needs to extract content and convert it into a format conducive to post-processing and analysis.

  • Extracts text and metadata information from files
  • Developers can choose the output format most suitable to their application
  • Optional 'metadata only' mode extracts document properties to build metadata repositories or to quickly flag key documents for further processing
  • Optimized for performance and is designed for high-throughput server environments

Getting Started
Getting Started guide

Datasheets and Whitepapers
Outside In Technology datasheet
Outside In Technology supported formats
The Risks of Metadata and Hidden Information

 

 
Free Download

Left Curve
Outside In SDKs
Right Curve
 · Clean Content
 · Content Access
 · File ID
 ·
 ·
 ·
 ·
 ·
 ·

Left Curve
Technical Support
Right Curve
 · Outside In Technical Support

E-mail this page
Printer View Printer View
Oracle Is The Information Company About Oracle | Oracle RSS Feeds | Careers | Contact Us | Site Maps | Legal Notices | Terms of Use | Privacy
E-mail this page
Printer View Printer View
Oracle Is The Information Company About Oracle | Oracle RSS Feeds | Careers | Contact Us | Site Maps | Legal Notices | Terms of Use | Privacy