Content classification is the process of analyzing a document and adding metadata 'tags' that describe that document which are sourced from a taxonomy or other form of controlled vocabulary.
This can be to:.
Add subject metadata to third party content/records management systems to help organization of the content against file plans.
Perform entity extraction (companies, people, etc.) to drive a faceted search application.
Allow business process and workflow systems to route a document based on its content. For example, news can be routed to particular individual based on the subject matter in the article.
The Semaphore 'classification server' delivers innovative and powerful mechanisms to analyze and classify text by adding 'tags' to text documents, web pages or reports that indicate the subject, key dates, people and companies mentioned in the content turns masses of unstructured data into usable information..
The quality of the tags and the speed of processing are essential. Semaphore provides this automatic enterprise-quality content classification solution that underpins a semantically enhanced system.
API stands for 'application programming interface', an interface implemented by a software program which enables it to interact with other software. It facilitates interaction between different software programs similar to the way the user interface facilitates interaction between humans and computers.
Semaphore "reads" a page of content and automatically suggests appropriate terms from the client’s taxonomy that can be applied to the meta data. This process uses natural language processing of the text and complex classification routines to ensure that the suggestions are consistent and appropriate (and therefore valuable).
The process of applying Terms or “tags” from a controlled vocabulary (often in the form of a taxonomy) to an item of content is known as classification or tagging. These tags are stored alongside the item’s content as metadata (data about data).
Classification, in relation to the Business Classification Scheme (BCS) is the systematic identification and arrangement of business activities and/or records into categories according to a logical structure or model.
A Business Classification Scheme is a conceptual model of an organization.
A web portal presents information from diverse sources in a unified way. A Business Information Portal is a web portal application that consolidates business information into a single application for a specific purpose.
Information is data that has been processed for a purpose (for example, classified, aggregated, sorted). Business Information Systems encompass the business process and technologies that manage this information so that it can ultimately be presented to inform the decision making process.
Categorization is similar to classification in that it is a process that seeks to describe an item by reference to a formal controlled vocabulary such as a taxonomy, business classification scheme or file plan. In document management, this is normally related to the actual storage of the item (a file can only live in one folder) as opposed to classification (where a file can have many metadata values).
Categorization management involves assigning agreed category label (typically sourced from a Business Classification Scheme or a File Plan) to an item for filing. The item can only be assigned to one category. The category often has an associated records management process such as a retention or disposal schedule.
Classification is the process of adding metadata ("tags") to an item (a document, article, picture, etc.) to provide a description of that item abstracted from the item's content. Classification can involve adding one or more metadata values to the item, such as "author", "subject" and "target audience".