Thursday 15 September 2011 10:45:08 am
This version of eZ Tika incorporates contributions from Felix Woldt and an updated tika.jar.
The main new feature is that the extension will work simply by activating it. No need to copy and modify files around your server if you don't need to (typically a server hosting just one installation of eZ Publish).
Besides the zero-config option, it is now possible to activate a dedicated eztika debugging setting that will log the text extraction success or failure status and also optionally keeps the temporary file containing the extracted text itself.
Downloads and more: http://projects.ez.no/eztika
eZ Tika is a binary file plugin and wrapper for the Apache Tika project which aims to extract plain text and meta-data from a large variety of files.
For more information on Apache Tika, visit http://tika.apache.org/
Happy indexing!
Paul