- FIN-CLARIAH Research Infrastructure
A new national research infrastructure initiative FIN-CLARIAH for...
8.12.2021 8:12 by eahyvone - WarMemoirSampo published on December 3, 2021
A new “Sampo” application, “WarMemoirSampo”...
8.12.2021 8:04 by eahyvone - Five new SeCo papers accepted for the ISWC 2021
The 20th International Semantic Web Conference (ISWC 2021), the...
2.8.2021 6:53 by eahyvone
- Annastiina Ahola, Lilli Peura, Rafael Leal, Heikki Rantala and Eero Hyvönen: Using generative AI and LLMs to enrich art collection metadata for searching, browsing, and studying art history in Digital Humanities
- Eero Hyvönen, Petri Leskinen, Henna Poikkimäki, Heikki Rantala, Jouni Tuominen, Senka Drobac, Ossi Koho, Ilona Pikkanen and Hanna-Leena Paloposki: LetterSampo Finland (1809–1917) Data Service and Portal: Searching, Exploring, and Analyzing Historical Letters and Their Underlying Networks
- Michael Lewis, Eljas Oksanen, Frida Ehrnsten, Heikki Rantala, Jouni Tuominen and Eero Hyvönen: The Impact of Human Decision-making on the Research Value of Archaeological Data
- Tomaž Erjavec, Matyáš Kopp, Nikola Ljubešić, Taja Kuzman, Paul Rayson, Petya Osenova, Maciej Ogrodniczuk, Çağrı Çöltekin, Danijel Koržinek, Katja Meden, Jure Skubic, Peter Rupnik, Tommaso Agnoloni, José Aires, Starkaður Barkarson, Roberto Bartolini, Núria Bel, María Calzada Pérez, Roberts Darģis, Sascha Diwersy, Maria Gavriilidou, Ruben van Heusden, Mikel Iruskieta, Neeme Kahusk, Anna Kryvenko, Noémi Ligeti-Nagy, Carmen Magariños, Martin Mölder, Costanza Navarretta, Kiril Simov, Lars Magne Tungland, Jouni Tuominen, John Vidler, Adina Ioana Vladu, Tanja Wissik, Väinö Yrjänäinen and and Darja Fišer: ParlaMint II: Advancing Comparable Parliamentary Corpora Across Europe
ARPA - Automatic Text Annotation System
Note: see our more recent work on this topic in the Dynamic Configurable Entity Recognition from Text project!
ARPA is a web service for automatic text annotation. It is used for extracting the main concepts or topics of a text, thus acquiring a quick overview of the text in both human and machine readable form.
For generating the automatic annotations, ARPA can utilize different annotation engines. The annotation engine used in the ARPA demo is Maui - Multi-purpose automatic topic indexing system. For the annotation task, an ontology, hand-annotated traning texts and a word lemmatizer or stemmer are given to Maui. With the use of the training texts, Maui learns to annotate new texts with the concepts in the ontology. ARPA is used for managing the configurations of annotation engines in different annotation projects.
ARPA is a web service coded with Java running in a Tomcat environment. ARPA has an HTTP GET interface returning XML.
Articles
2013
2011
Contact:
- Eetu Mäkelä, researcher
- Eero Hyvönen, professor


