Informatica OCR Plugin

Informatica OCR Plugin

Posted by: Eccella Corporation

Demonstrates how image files(jpg, jpeg, pdf etc.) can be converted into text and processed further using PowerCenter, Data Transformation Studio and ABBYY Fine Reader.


Informatica OCR plugin is a PowerCenter based tool which leverages the image processing capabilities of ABBYY FineReader and the parsing capabilities of Informatica DT Studio to convert and process image files.The plugin comprises of a simple PowerCenter workflow. The workflow consists of a mapping which triggers a DT service. The DT code uses java to invoke the ABBYY engine on the server which does the initial conversion of source files from image to text. The text is kept in memory and can then be parsed by the DT service as per the business requirements and the relevant data returned to PowerCenter.


Can read and parse scanned text images.The input can be a file-list of the image files. The text is stored in the memory and hence other business transformations can be applied on the fly.

Current Version: 1.1Release Date: April 27, 2012. System Requirements :

  • Operating System: Red Hat Enterprise Linux 5.6 or later, Suse Linux Enterprise Server 10 or later.
  • RAM: 2GB or more (recommended)
  • 10 MB free hard disk space.
Informatica Product Requirements :
  • PowerCenter ETL Job Requires PowerCenter 8.5x or later.
  • Data Transformation Studio
Additional Software requirements :
  • ABBYY FineReader Engine 9.0 CLI for Linux.



Eccella Corporation is a Business Intelligence and Application Development company with offices in NY, London and Mumbai.Our consulting specialty is in Data Management with a specific expertise in the Informatica Platform including PowerCenter, B2B, IDQ, ILM, MDM and PowerExchange. Our clients include Fortune 500 companies and large organizations (government etc.). We provide expert services in Architecture, Design and Development.Our application development and software solutions are geared toward the small and medium business market where we offer the little guys the tools and abilities of the larger players allowing them to compete and effectively manage their operations.Eccella was founded in 2010, focusing on Data Management consulting worldwide and maintains offices in New York, London and Mumbai. Headquaters Eccella Corporation 545 8th Avenue, Suite 680New York, NY 10028 Phone No : 1 (855)

Comments (2) Comment can only be posted by Signed/Logged in user

Sort: Newest | Oldest
  • How can I download the DT service?

  • gud work