ARPA - Automatic Text Annotation System

Note: see our more recent work on this topic in the Dynamic Configurable Entity Recognition from Text project!

ARPA is a web service for automatic text annotation. It is used for extracting the main concepts or topics of a text, thus acquiring a quick overview of the text in both human and machine readable form.

For generating the automatic annotations, ARPA can utilize different annotation engines. The annotation engine used in the ARPA demo is Maui - Multi-purpose automatic topic indexing system. For the annotation task, an ontology, hand-annotated traning texts and a word lemmatizer or stemmer are given to Maui. With the use of the training texts, Maui learns to annotate new texts with the concepts in the ontology. ARPA is used for managing the configurations of annotation engines in different annotation projects.

ARPA is a web service coded with Java running in a Tomcat environment. ARPA has an HTTP GET interface returning XML.

ARPA - Automatic Text Annotation System

Articles

2013

2011

Contact: