INFORMATION DISCOVERY VS. APACHE UIMA

Information Discovery contains a 100% UIMA compatible text mining platform. It offers numerous annotators for the semantic analysis of text. Our annotators are multilingual and allow text analysis in various languages. Depending on the task we exploit rule based or trainable (machine-learning) based approaches.

All trainable annotators come with tools for the creation of new models in new languages or genres. In addition to standard models based on news paper we offer a wide variety of biomedical annotators for text analysis of research litature, patents and medical text.

FrameworkInformation DiscoveryApache UIMA
UIMA Java Frameworkyesyes
UIMA C++ Frameworkyesyes
UIMA Default Viewers & Toolingyesyes
PEAR Packaging Facilitiesyesyes
UIMA-AS Scaleout Frameworkyesyes
UIMA-AS in the Cloudyesno
FrameworkInformation DiscoveryApache UIMA
Simple Server (UIMA REST service)Add-OnAdd-On
Generic Typesystemyesno
Web-based Annotation Clientyesno
Scripting Language for Pipeline Configurationyesno
FrameworkInformation DiscoveryApache UIMA
Collection Readers (CR)
Simple File ReaderyesAdd-On
XMI ReaderyesAdd-On
Generic XML Readeryesno
Generic Database Readeryesno
Annotators
Tika AnnotatoryesAdd-On
Document Zoningyesno
Language Detectionyesno
Document Classificationyesno
Sentence Splitting, Rule BasedyesAdd-On
Sentence Splitting, Trainableyesno
Tokenization, Rule BasedyesAdd-On
Tokenization, Trainableyesno
Part-Of-Speech Recognitionyesno
Shallow Parsing / Chunkingyesno
StemmingyesAdd-On
Morphological Analyisyesno
Decompoundingyesno
Stopword RecognitionyesAdd-On
Invariant Recognitionyesno
Acronym and Abbreviation Resolutionyesno
Regular Expression AnnotatoryesAdd-On
Lemmatizer, Lexicon Basedyesno
Concept RecognitionyesAdd-On
Named Entity Recognition, Trainableyesno
Concept Disambiguationyesno
Keyword-Extraction, Controlled and Uncontrolledyesno
Evaluation Modulesyesno
Table Format Recognitionyesno
UIMA Default Annotators
(HMM Tagger, BSF
Annotator, Alchemi, OpenCalais)
Add-OnAdd-On
Drools Annotatoryesno
Relation Extraction, Trainableyesno
CAS Consumer (CC)
XML WriteryesAdd-On
Lucene CAS Indexer (Lucas)yesAdd-On
Solr CAS Consumer (Solrcas)yesno
DB Writeryesno
Flow Controller
Document Language Flow Controlleryesno
Document Category Flow Controlleryesno
FrameworkInformation DiscoveryApache UIMA
Medline Readeryesno
Biomedical Sentence Splitteryesno
Biomedical Tokenizeryesno
Negation Annotatoryesno
Number Annotatoryesno
Disease Annotatoryesno
Anatomy Annotatoryesno
Drug Annotatoryesno
Gene Tagger (Uniprot, EntrezGene)yesno
ChemSpot Annotatoryesno

Start finding Answers in your Data today

We would be glad to present our products to you and create a demonstration based on your selected data repositories.