Skip to main content Link Menu Expand (external link) Document Search Copy Copied

How to index asset files with Solr

If you want to index file contents (PDF files for example), you’ll need a parser software named Tika. This software will extract searchable content from a file and pass it to Solr for indexing.

Using Tika in Drupal

To use Tika in Drupal, you need the “Apache Solr Attachments” module. Configure it as follows:

  • Extract using: “Tika (local java application)”
  • Tika directory path: “/usr/local/bin”
  • Tika jar file: “tika.jar”