Technology
Information
Extraction
Trifeed's
technology transforms text written by human, such as news articles, scientific
documents, e-mails, forum messages and more, into structured data. Utilizing
high level of linguistic analysis, pattern recognition and a large knowledge
base, it extracts, automatically and accurately, information of many types
out of the organization content.
Named Entity Extractor - Identifies names of
people, companies, organizations, places, events, books, movies, dates,
sums of money and many more types of entities. It also identifies the
entity attributes, such as position for person, and normalizes the entity
name.
Quotes Extractor - Identifies quotations
and their speakers (see our Who-Said-What
web site).
Relations Extractor - Extracts relation between
two entities. For example: The phrase "John Smith, vice president Business
Development of IBM" is extracted as a relation between an entity of the
type person with the name of John Smith and an entity of the type company
with the name IBM, and the relation name is "works for".
Concept extractor - Extracts main concepts
within a group of documents.
Classifier - Classifies a document to the
categories it belongs to based on a pre-trained taxonomy.
Summarizer - Creates a summary based on chosen
sentences that brings out the essence of the document.
Similar Documents Locator - Links a leading document to a group of similar documents in the system.
Clusters Generator - Creates clusters of
similar documents within a group of documents. A group of documents may
be a search result. The Clusters Generator divides the documents retrieved
into clusters of documents close to each other by content. |
|
| |
| |
|
| ©
Trifeed Ltd., 2004 |
|
|
|