How to index asset files with Solr
If you want to index file contents (PDF files for example), you’ll need a parser software named Tika. This software will extract searchable content from a file and pass it to Solr for indexing.
Using Tika in Drupal
To use Tika in Drupal, you need the “Apache Solr Attachments” module. Configure it as follows:
- Extract using: “Tika (local java application)”
- Tika directory path: “/usr/local/bin”
- Tika jar file: “tika.jar”