Content Classification

Content classification is the process of analyzing a document and adding metadata 'tags' that describe that document which are sourced from a taxonomy or other form of controlled vocabulary.

This can be to:.

  • Add subject metadata to third party content/records management systems to help organization of the content against file plans.
  • Perform entity extraction (companies, people, etc.) to drive a faceted search application.
  • Allow business process and workflow systems to route a document based on its content. For example, news can be routed to particular individual based on the subject matter in the article.

The Semaphore 'classification server' delivers innovative and powerful mechanisms to analyze and classify text by adding 'tags' to text documents, web pages or reports that indicate the subject, key dates, people and companies mentioned in the content turns masses of unstructured data into usable information..

The quality of the tags and the speed of processing are essential. Semaphore provides this automatic enterprise-quality content classification solution that underpins a semantically enhanced system.