2011-05-14

Apache UIMA

UIMA

Apache UIMA
Developer(s) IBM, Apache Software Foundation (since October 2006)
Stable release 2.3.0 / January 26, 2010
Written in Java with C++ Enablement
Operating system Cross-platform
Type Text mining, Information Extraction
License Apache License 2.0
Website http://uima.apache.org/

UIMA stands for Unstructured Information Management Architecture. An OASIS standard as of March 2009, UIMA is to date the only industry standard for .

UIMA is a component software architecture for the development, discovery, composition, and deployment of for the analysis of unstructured information and its integration with search technologies developed by IBM. The source code for a reference implementation of this framework has been made available on SourceForge, and later on the website of the Apache Software Foundation.

An example is a logistics analysis software system that could convert unstructured data such as repair logs and service notes into relational tables. These tables can then be used by automated tools to detect maintenance or manufacturing problems.

Other examples are systems that are used in medical environments to analyze clinical notes.

Structure of UIMA

The UIMA architecture can be thought of in four dimensions:

  1. It specifies component interfaces in an analytics pipeline
  2. It describes a set of Design patterns
  3. It suggests two data representations: an in-memory representation of annotations for high-performance analytics and an XML representation of annotations for integration with remote web services.
  4. It suggests development roles allowing tools to be used by users with diverse skills

IBM Watson - The Jeopardy Challenge

In February 2011 a computer from IBM Research named Watson won a competition on Jeopardy! against Ken Jennings and Brad Rutter - two formerly undefeated Jeopardy! champions. Watson is a highly advanced computer from IBM Research that uses UIMA for real-time content analytics.

See also

External links

UIMA wrappers
  • Collection of UIMA NLP Tools of the JULIE Lab
  • U-Compare - an integrated UIMA-based NLP system with wrappers for popular NaCTeM and Tsuji Lab tools
  • LanguageWare provides libraries and tooling for building UIMA analyzers. Download from Alphaworks
  • ClearTK - provides a framework for developing natural language processing (NLP) UIMA components that make use of statistical machine learning.
  • OpenNLP is a Java machine learning toolkit for natural language processing (NLP).


Powered by UIMA





Retrieved from : http://en.wikipedia.org/wiki/UIMA