30 Nov
UIMA and GATE Applications over Hadoop
There is a new project on google code called Behemoth that utilizes Hadoop to achieve scale for GATE and UIMA applications. Here is a more detailed post from Julien:
"Behemoth allows to deploy GATE or UIMA applications over a Hadoop cluster inorder to do very large scale document analysis. It uses a very simple
representation format which can be used as a common ground between UIMA and
GATE-generated annotations, hence achieving compatibility between both
systems. Since it is Hadoop-based it benefits from all its features
(scalability, fault-tolerance, etc…) and most notably the back up of a
thriving open source community. Quite a few Apache resources already do or
will fit into it: Nutch, Tika, Mahout, Hbase etc…"
Posted via email from Sameer’s posterous
