Semaphore for Apache Solr Technical Specifications

Product Sheet

White Paper

Watch a video for Video

Video

Integrate Semaphore with Apache Solr for Improved Content Classification

The diagram shows the Semaphore core modules and the integration points with Solr.

Indexing Pipeline

There are two connection points from Semaphore to the Solr index:

  • A custom collector for a structured data feed.
  • A filter for the Nutch web crawler for a URL feed.

The components intercept the appropriate indexing pipeline task and send the content to the Semaphore Classification Server.  The resulting metadata is included in the Solr Index alongside the free-text index record of the content.

Query Pipeline

The Search Application Framework manages the query experience and exposes advanced taxonomy and ontology-driven capability like topic maps and facet navigation in addition to standard free text search and retrieval capabilities.