tFileOutputOCR – Talend Custom Components

tFileOutputOCR
Name: tFileOutputOCR
Icon: tFileOutputOCR
Author: bennatigiuliano
Resources:
Download:
Install Instructions:
Example: Coming soon…
Features:
Overview:

This component based on Tesjeract allow you to convert any pictures containing text to .txt file

![screenshot](https://talendforge.org/exchange/tos/upload_tos/extension-581/screenshot.jpg)

Release Notes:

Release version: 0.1 – 2012-07-16 15:22:09
This component based on Tesjeract ( http://code.google.com/p/tesjeract/ ) allow you to convert any pictures containing text to .txt file.

Please read Tesjeract FAQ :
tessdll.dll , tesjeract.dll and tessdata directory need to be in C:/Windows/System32
You also need the [http://www.microsoft.com/downloads/details.aspx?familyid=a5c84275-3b97-4ab7-a40d-3802b2af5fc2 Microsoft Visual C++ 2008 SP1 Redistributable Package].
See also JVM stacks settings : http://www.talendforge.org/forum/viewtopic.php?id=24838 .

or simply use tSystem(tesseract.exe) 😉

Compatible:
  • 5.0 (obsolete)
  • 6.0 (obsolete)
  • 6.1 (obsolete)
  • 6.2 (obsolete)

Document get from Talend Exchange
Thank you for watching.

Was this article helpful?
Subscribe
Notify of
guest
0 Comments
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x