Scroll Top

INFORMATION DISCOVERY VS. APACHE UIMA

Information Discovery contains a 100% UIMA compatible text mining platform. It offers numerous annotators for the semantic analysis of text. Our annotators are multilingual and allow text analysis in various languages. Depending on the task we exploit rule based or trainable (machine-learning) based approaches.

All trainable annotators come with tools for the creation of new models in new languages or genres. In addition to standard models based on news paper we offer a wide variety of biomedical annotators for text analysis of research litature, patents and medical text.

Framework Information Discovery Apache UIMA
UIMA Java Framework yes yes
UIMA C++ Framework yes yes
UIMA Default Viewers & Tooling yes yes
PEAR Packaging Facilities yes yes
UIMA-AS Scaleout Framework yes yes
UIMA-AS in the Cloud yes no
Framework Information Discovery Apache UIMA
Simple Server (UIMA REST service) Add-On Add-On
Generic Typesystem yes no
Web-based Annotation Client yes no
Scripting Language for Pipeline Configuration yes no
Framework Information Discovery Apache UIMA
Collection Readers (CR)
Simple File Reader yes Add-On
XMI Reader yes Add-On
Generic XML Reader yes no
Generic Database Reader yes no
Annotators
Tika Annotator yes Add-On
Document Zoning yes no
Language Detection yes no
Document Classification yes no
Sentence Splitting, Rule Based yes Add-On
Sentence Splitting, Trainable yes no
Tokenization, Rule Based yes Add-On
Tokenization, Trainable yes no
Part-Of-Speech Recognition yes no
Shallow Parsing / Chunking yes no
Stemming yes Add-On
Morphological Analyis yes no
Decompounding yes no
Stopword Recognition yes Add-On
Invariant Recognition yes no
Acronym and Abbreviation Resolution yes no
Regular Expression Annotator yes Add-On
Lemmatizer, Lexicon Based yes no
Concept Recognition yes Add-On
Named Entity Recognition, Trainable yes no
Concept Disambiguation yes no
Keyword-Extraction, Controlled and Uncontrolled yes no
Evaluation Modules yes no
Table Format Recognition yes no
UIMA Default Annotators
(HMM Tagger, BSF
Annotator, Alchemi, OpenCalais)
Add-On Add-On
Drools Annotator yes no
Relation Extraction, Trainable yes no
CAS Consumer (CC)
XML Writer yes Add-On
Lucene CAS Indexer (Lucas) yes Add-On
Solr CAS Consumer (Solrcas) yes no
DB Writer yes no
Flow Controller
Document Language Flow Controller yes no
Document Category Flow Controller yes no
Framework Information Discovery Apache UIMA
Medline Reader yes no
Biomedical Sentence Splitter yes no
Biomedical Tokenizer yes no
Negation Annotator yes no
Number Annotator yes no
Disease Annotator yes no
Anatomy Annotator yes no
Drug Annotator yes no
Gene Tagger (Uniprot, EntrezGene) yes no
ChemSpot Annotator yes no

Start finding Answers in your Data today

We would be glad to present our products to you and create a demonstration based on your selected data repositories.