tTikaExtractor | |
---|---|
Name: | tTikaExtractor |
Icon: | |
Author: | Fxp |
Resources: | |
Download: | |
Install Instructions: | |
Example: | Coming soon… |
Features: | |
Overview: | |
tTikaExtractor use Apache TIKA parser to easily extract information from many different formats like (html, pdf, doc, odt, image, audio, video, …). See http://tika.apache.org/1.0/formats.html for more information about available parsers. ![screenshot](https://talendforge.org/exchange/tos/upload_tos/extension-475/screenshot.jpg) |
|
Release Notes: | |
Release version: 0.1 – 2012-01-25 17:03:59 If you have trouble parsing some formats, download the complete tika-app jar file from http://tika.apache.org/download.html and replace the one included in that pack which was modified in order to upload the component to exchange which has probably a limit around 18Mo. |
|
Compatible: | |
|