Productification

Intersection between Technology, User Experience and Product Innovation

30 Nov

UIMA and GATE Applications over Hadoop


There is a new project on google code called Behemoth that utilizes Hadoop to achieve scale for GATE and UIMA applications. Here is a more detailed post from Julien:

"Behemoth allows to deploy GATE or UIMA applications over a Hadoop cluster in
order to do very large scale document analysis. It uses a very simple
representation format which can be used as a common ground between UIMA and
GATE-generated annotations, hence achieving compatibility between both
systems. Since it is Hadoop-based it benefits from all its features
(scalability, fault-tolerance, etc…) and most notably the back up of a
thriving open source community. Quite a few Apache resources already do or
will fit into it: Nutch, Tika, Mahout, Hbase etc…"

Posted via email from Sameer’s posterous


Filed under: General

Post a Comment