eZ Community » Learn » eZ Publish » eZ Find Demystified: Installing and...

eZ Find Demystified: Installing and configuring a multi-core Solr/eZ Find 2.6 instance with eZ Tika

Monday 14 May 2012 8:34:25 pm

  • Currently 5 out of 5 Stars.
  • 1
  • 2
  • 3
  • 4
  • 5

After completing this tutorial, you should be able to set up a single, multi-core eZ Find instance with eZ Tika. This will allow you to index multiple eZ Publish sites along with virtually any file type, as well as take advantage of eZ Find’s extended searching and sorting functionality.

The multi-core setup variant is useful even if you just have a single eZ Publish site – it doesn’t require any extra resources, and you have the ability to easily add more sites to the search server when the need arises.

 

Download as PDF

Introduction

eZ Find is a search extension that integrates the content management features of eZ Publish with the powerful, Lucene-based Apache Solr search server. Once installed and configured, it provides faster and more extensive search facilities as well as functionality to overcome some limitations in eZ Publish, such as selecting and sorting on an attribute common to more than one content class.

The eZ Tika extension is a binary file plugin and wrapper for the Apache Tika toolkit, which detects and extracts metadata and structured text content from various document types such as PDF, Word, Excel, ODF, and many more. Used together with eZ Find, virtually all content in an eZ Publish system can be indexed, categorised and searched.

The result is a search server that features enterprise-grade speed and features – the open source way!

It can be tricky to install and configure eZ Find properly, since several technologies are involved and there are some pitfalls to avoid. This tutorial aims to provide a step-by-step guide to the process, as well as give some glimpses into the workings of Solr and how eZ Find integrates it with eZ Publish.

 

Pre-requisites and target population

You should be familiar with the Linux command line and package administration, and have a good working knowledge of setting up and using eZ Publish version 4.x (or Community Project releases from 2011 and onward).

Technical requirements:

  • A running Linux server, preferably Debian 6, with command line and root or sudo access. The system should be configured with the prerequisites for eZ Publish (PHP, MySQL, web server etc.).
  • Two or more separate eZ Publish instances containing some content to index. (You can get by with just one eZ Publish instance if you skip the parts related to adding a second Solr core.)
 

Reference platform for this tutorial:

  • Debian 6 (“Squeeze”) Virtual Machine with 4GB RAM / 2 x 1.5 Ghz CPU / 200GB Hard Drive
  • eZ Publish Community Project 2012.3
  • MySQL 5.1.61
  • PHP 5.3.10

Although this tutorial uses Debian 6 as a reference platform, it should be applicable to other Linux distributions with only minor adaptations. All commands are assumed to be issued as root unless otherwise noted.

 
36 542 Users on board!

Tutorial menu

Printable

Printer Friendly version of the full article on one page with plain styles

Author(s)

Proudly Developed with from