- NameSampo: yet another semantic sampo portal released
On Feb 1, 2019, ‘NameSampo: A Linked Open Data Infrastructure...
13.2.2019 13:53 by eahyvone
- BiographySampo released for public use
The next major semantic “Sampo” portal developed by SeCo,...
16.11.2018 21:00 by eahyvone
- New text book about Semantic Web and Linked Data in Finnish
The first text book about Semantic Web and Linked Data in Finnish,...
11.4.2018 12:28 by eahyvone
- Mikko Koho, Lia Gasbarra, Jouni Tuominen, Heikki Rantala, Ilkka Jokipii and Eero Hyvönen: AMMO Ontology of Finnish Historical Occupations
- Agata Dominowska, Elsi Hyttinen, Peter Ivanics, Mikko Koho, Ilona Pikkanen and Risto Turunen: Hiding in Plain Sight: Literature in Newspapers and How to Approach it
- Lia Gasbarra, Mikko Koho, Ilkka Jokipii, Heikki Rantala and Eero Hyvönen: An Ontology of Finnish Historical Occupations
- Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori: BiographySampo - Publishing and Enriching Biographies on the Semantic Web for Digital Humanities Research
Linked Open Data Infrastructure for Digital Humanities in Finland
Digital Humanities (DH) is a major new research paradigm at the crossroads of computing, humanities, and social sciences. The main idea is to develop and use novel computational methods, such as data analysis, topic modeling, visualization, network analysis, deep learning, and artificial intelligence, to solve research problems in Social Sciences and Humanities (SSH) based on big data that is becoming available as a result of digitalization of the society.
DH matches well with the multidisciplinary strategy of Aalto, and there are indeed DH research activities in five Aalto schools (Science; Arts, Design and Architechture; Engineering; Electrical Engineering; Business). At the University of Helsinki (UH), the Helsinki Centre for Digital Humanities HELDIG was recently established as a major strategic profiling action of the university with eight new HELDIG professors nominated in six faculties, and a 10MEUR budget for 2016-2020. There are substantial DH actives also in other Finnish universities, such as the University of Turku and Univerisity of Tampere, and in major Cultural Heritage (CH) and media organizations, such as the National Library, National Archives, Finnish Heritage Agency (Museovirasto), Finnish Literature Society (SKS), National Broadcasting Company Yle, and many others. In EU, DH research infrastructure work is coordinated by the EU ERIC DARIAH -- Digital Research Infrastructure for the Arts and Humanities Aalto and UH have joined into DARIAH as co-operative partners.
LODI4DH is a joint initiative of Aalto University, Department of Computer Science, and University of Helsinki, HELDIG Centre for Digital Humanities, for creating centralized national Linked Data services for open science. The services enable publication and utilization of datasets for data-intensive DH research in structured, standardized formats via open interfaces. LODI4DH is based on the large collaboration network and software created during a long line of national projects in DH between UH and Aalto since 2002 that created several in-use infrastructure prototypes, such as the ONKI ontology service, Finto ontology service at the National Library of Finland (that deployed SKOS-based parts of ONKI as a national service, and has been developing them further), and Linked Data Finland platform LDF.fi. This line of research started with the national FinnONTO project series (2003-2012) on creating a national ontology infrastructure in Finland, and has continued with, e.g., the projects Linked Data Finland (2012-2014) and Linked Open Data Science Service by the SeCo group at Aalto University and University of Helsinki.
ONKI/Finto and LDF.fi already have had a wide user base demonstrating the need for the LODI4DH infrastructure. Applications based on them have also made their way from academic research into real use. For example, the BookSampo system, based originally on LDF.fi and maintained now by the Finnish public libraries (Kirjastot.fi), had 1.6 million visitors in 2016, the WarSampo system had 130 000 distinct users in 2017, and there are tens of thousands of users in the Finto service of the National Library of Finland, based on the ONKI prototype. Many museums in Finland, e.g., Espoo City Museum, AKSELI Consortium of 8 museums, and the new national KOOKOS cataloging system make use of the ONKI ontologies. In addition to the Finnish projects, there are several ongoing research projects with international universities, such as Oxford, Stanford, Colorado, and Pennsylvania, where the Finnish Linked Data services for DH are already in use. LODI4DH focuses on DH research infrastructures but the underlying Linked Data and Semantic Web technology can and has been utilized in other fields of research, too, extending substantially the utilization potential of the infrastructure.
LODI4DH aims at harnessessing all this work into sustainable national services, and integrating the work as a component into the EU ERIC DARIAH infrastructure. LODI4DH infrastructure is open source, publishes open data, and is free of charge for everyone to use.
Domain Ontologies for Data Linking
Data from collaborating organizations is aggregated into shared open shared domain ontologies for 1) historical places and maps, 2) historical persons, 3) events, 4) keyword concepts, and 5) times. These core ontologies, provided as web services, are used as “semantic glue” in data linking and fusion.
Historical Places and Maps
As for historical places and maps, our work aims at developing the Finnish Ontology Service of Historical Places and Maps (Hipla), cf. the demonstrator Hipla.fi.
This work started already in FinnONTO, and has been revitalized in the context of building the National Semantic Biography of Finland and related other biographical systems, see Semantic Biographies Based on Linked Data.
Historical Keyword Concepts
When developing ONKI and Finto, lots of Finnish keyword thesauri were converted and developed further into RDFS and SKOS ontologies, interlinked into a global linked data cloud called the KOKO, and published as ontology services. However, more work is needed here, for example, is areas such as archeology, built enviroments, history, and law.
LODI4DH creates a time ontology for making references to historical times and periods of time, including names of time periods. Services for calendar date conversion will be included in the system. Here results from international projects can be utilized.
Harmonizing Metadata Models
The project works on developing harmonizing metadata models for representing semantic data, such as Bio CRM for extending CIDOC CRM to representing biographical data.
We also work on publishing and sharing interlinked core datasets, that are deemed to be useful in different research projects and applications. These dataset are expected to evolve into a kind of Finnish Linked Open Data Cloud. Work has started on, e.g., the following datasets: Linked Open Name Archive, based on data about 2.7 million place names provided by the Institute for the Languages in Finland (Kotus); Semantic National Biography, based on over 13 000 biographies of prominent Finns edited by the Finnish Literature Society (SKS); WarSampo datatsets related to WW2 history, provided by the National Archives of Finland, Defence Forces, and others; University of Helsinki Person Registry (1640-2000), provided by the University of Helsinki Archives; Semantic Finlex legislation and case law data, provided by the Ministry of Justice.
Linked Data Services
As for the publishing platform, the Linked Data Finland platform is used and developed further with additional services for DH data production, publishing, data analysis, and visualization.
We also produce educational online materials, developing, e.g., the Linked Data School LinDa, for using Linked Data technology in DH research and application development.
Prof. Eero Hyvönen, Aalto University and University of Helsinki (HELDIG)
Dr. Jouni Tuominen, University of Helsinki (HELDIG) and Aalto University
An overview of the project is presented in these slides.