What it does
Identifies and extracts all types of useful entities contained in text. Entities can be of the following types: Keywords, Taxonomy Terms, People, Organizations, Locations, Addresses, Phone Numbers, Email Addresses, URLs, Times, Dates, Ages, Money Amounts, and Credit Cards. Address extraction is currently implemented for the following regions: New Zealand, Australia, United Kingdom, United States, Germany. Phone number extraction is currently implemented for New Zealand formats only.
What it is for
Entity extraction allows the conversion of unstructured text into structured data. From then on the potential use cases are endless: automatically assign metadata to describe a document's content, thereby improving search; classify a collection of texts geographically on a map; generate analytical insight by computing occurrences of entities within large documents.
What the benefits are
Entity extraction eliminates the need for manual work – instantly assemble the key facts and players in any document. An automated process ensures accuracy and consistency, making the assignment of metadata both more reliable and more efficient. Large document collections can be organized and searched in a faster and more productive manner.

We use a number of publicly available taxonomies. We provide these taxonomies by default as part of our API. The table below lists the vertical, language and source of these taxonomies. Customers may also plug in their own taxonomy or taxonomies in either one of the standard formats: OBO or SKOS.
Vertical Language Taxonomy Description
Agriculture English Agriculture Information Management Standards by the FAO
Alcohol & Drugs English Alcohol and Drugs Thesaurus by the NIAAA
Government English (NZ) New Zealand Public Sector Vocabulary
Government English (UK) Integrated Public Sector Vocabulary (IPSV) by esd, UK
Pharmaceuticals English Conditions, Symptoms & Drugs by Pingar
Public Affairs English Australian Public Affairs Information Service thesaurus by NLA
Government Chinese Chinese (traditional) Public Sector Vocabulary

