Thank you for accepting the OTN License Agreement; you may now download this software.
Clean Content 2013.1.0
Release Date: 11/13/2013
This SDK allows developers to extract and scrub text, metadata and hidden information from Microsoft Office and PDF documents. Clean Content also has a bridge to the Outside In core filters adding extraction for an additional 600 formats (Search Export required). The API is available in Java, C/C++ and .NET. Clean Content is a native Java technology, therefore the Java API can be used on any operating system that supports a Java 1.5 or higher compatible Java Runtime Environment. The C/C++ API is available on Windows (x86-32 & x86-64) and Linux (x86-32 & x86-64). The .NET API is available on Windows (x86-32 & x86-64). This single download includes support for all these platforms.
This SDK provides programmatic access to the text and metadata of the supported file types. It is optimized for performance, interactively providing data to the host application in-memory as the input file is processed. It is accessed via a C language API.
This SDK converts the supported file types into HTML, rendering embedded graphics as a choice of GIF, JPEG, or PNG. It uses a sophisticated template architecture, as well as a rich set of options, to allow a high degree of customization of the HTML output. It is accessed via a C language API.
This SDK converts the supported file types into a choice of TIFF, JPEG, JPEG2000, BMP, GIF, or PNG. It offers numerous options, including the ability to size the image output from thumbnail to full-size, and control image resolution. It is accessed via a C language API.
This SDK provides a framework in which to run Outside In Export Technology. It conforms to a Web services model, provides additional APIs (C, SOAP, Java), and hosts the exports in a separate pool of processes.
This SDK renders high-fidelity representations of the supported file types into a view window, without reliance on the native application. It provides file viewing, printing, copy/paste, and annotation capability. It is accessed via a C language API.
This SDK converts and normalizes the content of supported file types into XML defined by Outside In's "FlexionDoc" schema. This schema uses XML tags to describe document content, properties, structure and formatting. It is accessed via a C language API.