FIN-CLARIAH Research Infrastructure
A new national research infrastructure initiative FIN-CLARIAH for...
8.12.2021 8:12 by eahyvone
WarMemoirSampo published on December 3, 2021
A new “Sampo” application, “WarMemoirSampo”...
8.12.2021 8:04 by eahyvone
Five new SeCo papers accepted for the ISWC 2021
The 20th International Semantic Web Conference (ISWC 2021), the...
2.8.2021 6:53 by eahyvone

Eero Hyvönen, Patrik Boman, Heikki Rantala, Annastiiina Ahola and Petri Leskinen: ConfermentSampo - A Knowledge Graph, Data Service, and Semantic Portal for Intangible Academic Cultural Heritage 1643-2023 in Finland
Rafael Leal, Annastiina Ahola and Eero Hyvönen: Enriching Cultural Heritage Knowledge Graph Metadata from Finnish Texts with Large Language Models
Michael Lewis, Eljas Oksanen, Frida Ehrnsten, Heikki Rantala, Jouni Tuominen and Eero Hyvönen: The Impact of Human Decision-making on the Research Value of Archaeological Data
Eero Hyvönen, Petri Leskinen, Henna Poikkimäki, Heikki Rantala, Annastiina Ahola, Refael Leal, Jouni Tuominen, Senka Drobac, Ossi Koho, Ilona Pikkanen and Hanna-Leena Paloposki: LetterSampo Finland knowledge graph, data service, and semantic portal for researching epistolary data of the Grand Duchy of Finland (1809-1917)

Dynamic Configurable Entity Recognition from Text

This project complements current methods for entity recognition in situations where extraction requires a dynamic character, either in the vocabulary used or in other aspects of configuration.

Here a modular approach is taken, where data is first fed into a multilingual lexical analysis web service. The results of this analysis are then used to build search needles, which are finally fed as a SPARQL query into any vocabulary stored at a Linked Data endpoint.

By dissociating the lexical processing from the reference vocabulary lookup, and by allowing both to be dynamically configured, it is possible to tailor entity recognition for a particular task much quicker than traditional methods allow. In addition, querying a live SPARQL endpoint allows any changes to the reference vocabulary to be immediately available for recognition without model rebuilding or similar.

System demonstrators:

Links:

Lexical Analysis Service

Contact Person

D.Sc. Eetu Mäkelä, Aalto University

Publications

2019

2017

Eero Hyvönen, Arttu Oksanen, Jouni Tuominen, Eetu Mäkelä and Minna Tamper: Semanttinen Finlex. Laki ja oikeus avoimena linkitettynä datana. (Semantic Finlex. Law and Justice as Linked Open Data.). Oikeus-lehti, vol. 46, no. 1, March, 2017. bib pdf

Dynamic Configurable Entity Recognition from Text

Contact Person

Publications

2019

2017

2016

2014