Information Discovery vs. Apache UIMA

Information Discovery contains a 100% UIMA compatible text mining platform. It offers numerous annotators for the semantic analysis of text. Our annotators are multilingual and allow text analysis in various languages. Depending on the task we exploit rule based or trainable (machine-learning) based approaches. All trainable annotators come with tools for the creation of new models in new languages or genres. In addition to standard models based on news paper we offer a wide variety of biomedical annotators for text analysis of research litature, patents and medical text.

Would you like to see a demo?

We would be glad to present our products to you and create a demonstration based on your selected data repositories.

Framework Information Discovery V4.X Apache UIMA V2.7
UIMA Java Framework
UIMA C++ Framework
UIMA Default Viewers & Tooling
PEAR Packaging Facilities
UIMA-AS Scaleout Framework
UIMA-AS in the Cloud
Infrastructure Averbis Extraction Platform V3.3 Apache UIMA V2.4
Simple Server (UIMA REST service) Add-On Add-On
Generic Typesystem
Web-based Annotation Client
Scripting Language for Pipeline Configuration
Core Components
Core Components Averbis Extraction Platform V3.3  Apache UIMA V2.4
Collection Readers (CR)
Simple File Reader Add-On
XMI Reader Add-On
Generic XML Reader
Generic Database Reader
Tika Annotator Add-On
Document Zoning
Language Detection
Document Classification
Sentence Splitting, Rule Based Add-On
Sentence Splitting, Trainable
Tokenization, Rule Based Add-On
Tokenization, Trainable
Part-Of-Speech Recognition
Shallow Parsing / Chunking
Stemming Add-On
Morphological Analyis
Stopword Recognition Add-On
Invariant Recognition
Acronym and Abbreviation Resolution
Regular Expression Annotator Add-On
Lemmatizer, Lexicon Based
Concept Recognition Add-On
Named Entity Recognition, Trainable
Concept Disambiguation
Keyword-Extraction, Controlled and Uncontrolled
Evaluation Modules
Table Format Recognition
UIMA Default Annotators
(HMM Tagger, BSF
Annotator, Alchemi, OpenCalais)
Add-On Add-On
Drools Annotator
Relation Extraction, Trainable
CAS Consumer (CC)
XML Writer Add-On
Lucene CAS Indexer (Lucas) Add-On
Solr CAS Consumer (Solrcas)
DB Writer
Flow Controller
Document Language Flow Controller
Document Category Flow Controller
Biomedical Components
Biomedical Components Averbis Extraction Platform V3.3 Apache UIMA V2.4
Medline Reader
Biomedical Sentence Splitter
Biomedical Tokenizer
Negation Annotator
Number Annotator
Disease Annotator
Anatomy Annotator
Drug Annotator
Gene Tagger (Uniprot, EntrezGene)
ChemSpot Annotator