Jouni Tuominen
University Researcher, Staff Scientist, D.Sc. (Tech.)
University of Helsinki
phone: +358 50 556 0402
email: firstname.lastname@helsinki.fi
room: 428 @ Helsinki Institute for Social Sciences and Humanities (HSSH), Fabianinkatu 24, Helsinki
postal address: HSSH, P.O. Box 4, FI-00014 University of Helsinki, Finland
Aalto University
email: firstname.lastname@aalto.fi
room: B333 @ Department of Computer Science, Konemiehentie 2, Espoo
postal address: Dept. of Computer Science, P.O. Box 15400, FI-00076 Aalto, Finland
Academic CV, see citation indices in
Google Scholar, ORCID
0000-0003-4789-5676
Jouni Tuominen is a university researcher at the University of Helsinki, Helsinki Institute for Social Sciences and Humanities (HSSH),
and a visiting researcher at Aalto University, Department of Computer Science, and affiliated with the Helsinki Centre for Digital Humanities (HELDIG).
He received his D.Sc. (Tech.) at Aalto University in 2017, with a dissertation titled "Ontology Services for Knowledge Organization Systems".
His research interests include ontology repositories and services, linked data publishing methods,
ontology models for legacy data, and tooling for digital humanities. He has published over 130 research publications since 2007,
and has received several national and international awards. He has collaborated with museums, libraries, and archives on their collection cataloging
and cultural heritage data publishing processes since 2007.
Areas of research interest:
- ontology services for facilitating content creation workflows (e.g. ONKI service)
- linked data publishing methods and tools (e.g. LDF.fi service)
- semantic modeling and harmonization of legacy data
- methods for digital humanities
I am currently involved in the following projects:
Previous projects I have been involved in include:
Publications
(
publication list as PDF)
2025
Eero Hyvönen, Petri Leskinen, Henna Poikkimäki, Heikki Rantala, Jouni Tuominen, Senka Drobac, Ossi Koho, Ilona Pikkanen and Hanna-Leena Paloposki:
Searching, exploring, and analyzing historical letters and the underlying networks: LetterSampo Finland (1809–1917) data service and semantic portal. 2025. Abstract, submitted for peer review.
bib pdf
2024
Tomaž Erjavec, Matyáš Kopp, Nikola Ljubešić, Taja Kuzman, Paul Rayson, Petya Osenova, Maciej Ogrodniczuk, Çağrı Çöltekin, Danijel Koržinek, Katja Meden, Jure Skubic, Peter Rupnik, Tommaso Agnoloni, José Aires, Starkaður Barkarson, Roberto Bartolini, Núria Bel, María Calzada Pérez, Roberts Darģis, Sascha Diwersy, Maria Gavriilidou, Ruben van Heusden, Mikel Iruskieta, Neeme Kahusk, Anna Kryvenko, Noémi Ligeti-Nagy, Carmen Magariños, Martin Mölder, Costanza Navarretta, Kiril Simov, Lars Magne Tungland, Jouni Tuominen, John Vidler, Adina Ioana Vladu, Tanja Wissik, Väinö Yrjänäinen and and Darja Fišer:
ParlaMint II: Advancing Comparable Parliamentary Corpora Across Europe. March, 2024. Submitted.
bib link
2023
Senka Drobac, Johanna Enqvist, Petri Leskinen, Muhammad Faiz Wahjoe, Heikki Rantala, Mikko Koho, Ilona Pikkanen, Iida Jauhiainen, Jouni Tuominen, Hanna-Leena Paloposki, Matti La Mela and Eero Hyvönen:
The Laborious Cleaning: Acquiring and Transforming 19th-Century Epistolary Metadata.
Digital Humanities in the Nordic and Baltic Countries Publication, DHNB2023 Conference Proceeding, vol. 5, no. 1, pp. 248-262, University of Oslo Library, Norway, 2023.
bib pdf link
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Senka Drobac, Rafael Leal, Matti La Mela, Jouni Tuominen, Henna Poikkimäki and Heikki Rantala:
Plenary Speeches of the Parliament of Finland as Linked Open Data and Data Services.
Joint Proceedings of the Second International Workshop on Knowledge Graph Generation From Text and the First International BiKE Challenge co-located with 20th Extended Semantic Conference (ESWC 2023), pp. 1-20, CEUR Workshop Proceedings, Vol. 3447, August, 2023.
bib pdf link
Joonas Kesäniemi, Matthias Schlögl, Jouni Tuominen, Victor de Boer and Go Sugimoto:
Towards Reusable Aggregated Biographical Research Data: Provenance and Versioning in the InTaVia Knowledge Graph.
Digital Humanities in the Nordic and Baltic Countries Seventh Conference (DHNB 2023), Book of Abstracts (Sofie Gilbert and Annika Rockenberger (eds.)), pp. 117, University of Oslo Library, Oslo, Norway, March, 2023.
bib link
Matthias Schlögl, Joonas Kesäniemi, Jouni Tuominen, Victor de Boer, Go Sugimoto and Carla Ebel:
Dos and Don’ts of Building a Pan-European Biographical Knowledge Graph: Statistical Analysis of the InTaVia-Platform.
Digital Humanities in the Nordic and Baltic Countries Seventh Conference (DHNB 2023), Book of Abstracts (Sofie Gilbert and Annika Rockenberger (eds.)), pp. 106, University of Oslo Library, Oslo, Norway, March, 2023.
bib link
Eero Hyvönen, Petri Leskinen, Laura Sinikallio, Senka Drobac, Rafael Leal, Matti La Mela, Jouni Tuominen, Henna Poikkimäki and Heikki Rantala:
ParliamentSampo Infrastructure for Publishing the Plenary Speeches and Networks of Politicians of the Parliament of Finland as Open Data Services. Aalto University, Dept. of Computer Science, February, 2023. Paper published at the publication event of the ParliamentSampo data service and portal.
bib pdf
2022
Eero Hyvönen, Minna Tamper, Esko Ikkala, Mikko Koho, Rafael Leal, Joonas Kesäniemi, Arttu Oksanen, Jouni Tuominen and Aki Hietanen:
LawSampo Portal and Data Service for Publishing and Using Legislation and Case Law as Linked Open Data on the Semantic Web.
AI4LEGAL-KGSUM 2022: Artificial Intelligence Technologies for Legal Documents and Knowledge Graph Summarization 2022, vol. 3257, pp. 41-50, CEUR Workshop Proceedings, August, 2022.
bib pdf link
Arttu Oksanen, Eero Hyvönen, Minna Tamper, Jouni Tuominen, Henna Ylimaa, Katja Löytynoja, Matti Kokkonen and Aki Hietanen:
An Anonymization Tool for Open Data Publication of Legal Documents.
AI4LEGAL-KGSUM 2022: Artificial Intelligence Technologies for Legal Documents and Knowledge Graph Summarization 2022, vol. 3257, pp. 12-21, CEUR Workshop Proceedings, August, 2022.
bib pdf link
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Matti La Mela, Jouni Tuominen, Kimmo Elo, Senka Drobac, Mikko Koho, Esko Ikkala, Minna Tamper, Rafael Leal and Joonas Kesäniemi:
Linked Data Approach for Studying Parliamentary Speeches and Networks of Politicians in Finland 1907-2021 (long paper).
Digital Humanities 2022, Conference Abstracts, July 25-29, 2022 Online, Tokyo. Japan, University of Tokyo, pp. 254-257, ADHO, July, 2022.
bib link
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Matti La Mela, Jouni Tuominen, Kimmo Elo, Senka Drobac, Mikko Koho, Esko Ikkala, Minna Tamper, Rafael Leal and Joonas Kesäniemi:
Finnish Parliament on the Semantic Web: Using ParliamentSampo Data Service and Semantic Portal for Studying Political Culture and Language.
Digital Parliamentary data in Action (DiPaDA 2022), Workshop at the 6th Digital Humanities in Nordic and Baltic Countries Conference, long paper, pp. 69-85, CEUR Workshop Proceedings, Vol. 3133, May, 2022.
bib pdf link
Minna Tamper, Rafael Leal, Laura Sinikallio, Petri Leskinen, Jouni Tuominen and Eero Hyvönen:
Extracting Knowledge from Parliamentary Debates for Studying Political Culture and Language.
Proceedings of the 1st International Workshop on Knowledge Graph Generation From Text and the 1st International Workshop on Modular Knowledge co-located with 19th Extended Semantic Conference (ESWC 2022) (Sanju Tiwari, Nandana Mihindukulasooriya, Francesco Osborne, Dimitris Kontokostas, Jennifer D’Souza and Mayank Kejriwal (eds.)), vol. 3184, pp. 70-79, CEUR WS, May, 2022. International Workshop on Knowledge Graph Generation from Text (TEXT2KG 2022).
bib pdf link
Jouni Tuominen, Mikko Koho, Ilona Pikkanen, Senka Drobac, Johanna Enqvist, Eero Hyvönen, Matti La Mela, Petri Leskinen, Hanna-Leena Paloposki and Heikki Rantala:
Constellations of Correspondence: a Linked Data Service and Portal for Studying Large and Small Networks of Epistolary Exchange in the Grand Duchy of Finland.
DHNB 2022 The 6th Digital Humanities in Nordic and Baltic Countries Conference, pp. 415-423, CEUR Workshop Proceedings, Vol. 3232, March, 2022.
bib pdf link
Heikki Rantala, Esko Ikkala, Jouni Tuominen, Eero Hyvönen, Ville Rohiola, Eljas Oksanen and Mikko Koho:
FindSampo: A Linked Data Based Service for Analyzing and Disseminating Archaeological Finds.
6th Digital Humanities in Nordic and Baltic Countries Conference, poster paper, book of abstracts, pp. 118-119, March, 2022.
bib link
Heikki Rantala, Esko Ikkala, Ville Rohiola, Mikko Koho, Jouni Tuominen, Eljas Oksanen, Anna Wessman and Eero Hyvönen:
FindSampo: A Linked Data Based Portal and Data Service for Analyzing and Disseminating Archaeological Object Finds.
The Semantic Web: ESWC 2022, Lecture Notes in Computer Science, vol. 13261, pp. 478-494, Springer, 2022.
bib pdf link
2021
Eero Hyvönen, Esko Ikkala, Mikko Koho, Jouni Tuominen, Toby Burrows, Lynn Ransom and Hanno Wijsman:
Mapping Manuscript Migrations on the Semantic Web: A Semantic Portal and Linked Open Data Service for Premodern Manuscript Research.
The Semantic Web - ISWC 2021, Lecture Notes in Computer Science, vol. 12922, pp. 615-630, Springer, 2021.
bib pdf link
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Senka Drobac, Jouni Tuominen, Kimmo Elo, Matti La Mela, Mikko Koho, Esko Ikkala, Minna Tamper, Rafael Leal and Joonas Kesäniemi:
Parlamenttisampo: eduskunnan aineistojen linkitetyn avoimen datan palvelu ja sen käyttömahdollisuudet. Informaatiotutkimus, vol. 40, no. 3, pp. 216-244, November, 2021.
bib pdf link
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori:
Biografiasampo yhdistää ja rikastaa suomalaiset elämäkerrat linkitettynä datana semanttisessa webissä (Biographysampo links and enriches Finnish biographies as linked data on the Semantic Web. Informaatiotutkimus, vol. 40, no. 3, pp. 346-368, November, 2021.
bib pdf link
Laura Sinikallio, Senka Drobac, Minna Tamper, Rafael Leal, Mikko Koho, Jouni Tuominen, Matti La Mela and Eero Hyvönen:
Plenary Debates of the Parliament of Finland as Linked Open Data and in Parla-CLARIN Markup.
3rd Conference on Language, Data and Knowledge, LDK 2021, Open Access Series in Informatics (OASIcs), vol. 93, pp. 8:1-8:17, Schloss Dagstuhl - Leibniz-Zentrum für Informatik GmbH, Zaragoza, Spain, August, 2021.
bib pdf link
Eero Hyvönen, Heikki Rantala, Esko Ikkala, Mikko Koho, Jouni Tuominen, Babatunde Anafi, Suzie Thomas, Anna Wessman, Eljas Oksanen, Ville Rohiola, Jutta Kuitunen and Minna Ryyppö:
Citizen Science Archaeological Finds on the Semantic Web: The FindSampo Framework. Antiquity, A Review of World Archaeology, vol. 95, no. 382, pp. e24, Cambridge University Press, August, 2021.
bib pdf link
Toby Burrows, Doug Emery, Arthur Mitchell Fraas, Eero Hyvönen, Esko Ikkala, Mikko Koho, David Lewis, Andrew Morrison, Kevin Page, Lynn Ransom, Emma Cawfield Thomson, Jouni Tuominen, Athanasios Velios and Hanno Wijsman:
A New Model for Manuscript Provenance Research: The Mapping Manuscripts Migrations Project. Manuscript Studies, vol. 6, no. 1, pp. 131-144, The University of Pennsylvania Press, Philadelphia, US, July, 2021.
bib pdf link
Toby Burrows, Mikko Koho, Jouni Tuominen, Eero Hyvönen, Kevin Page, David Lewis, Doug Emery, Hanno Wijsman, Lynn Ransom and Emma Cawlfield Thomson:
Modelling the History of Medieval and Renaissance Manuscripts for the Mapping Manuscript Migrations Portal.
Data for History 2021: Modelling Time, Places, Agents, June, 2021. Abstract.
bib link
Mikko Koho, Toby Burrows, Eero Hyvönen, Esko Ikkala, Kevin Page, Lynn Ransom, Jouni Tuominen, Doug Emery, Mitch Fraas, Benjamin Heller, David Lewis, Andrew Morrison, Guillaume Porte, Emma Thomson, Athanasios Velios and Hanno Wijsman:
Harmonizing and Publishing Heterogeneous Pre-Modern Manuscript Metadata as Linked Open Data. Journal of the Association for Information Science and Technology (JASIST), vol. 73, no. 2, pp. 240-257, May, 2021.
bib pdf link Manuscripts are a crucial form of evidence for research into all aspects of premodern European history and culture, and there are numerous databases devoted to describing them in detail. This descriptive information, however, is typically available only in separate data silos based on incompatible data models and user interfaces. As a result, it has been difficult to study manuscripts comprehensively across these various platforms. To address this challenge, a team of manuscript scholars and computer scientists worked to create “Mapping Manuscript Migrations” (MMM), a semantic portal, and a Linked Open Data service. MMM stands as a successful proof of concept for integrating distinct manuscript datasets into a shared platform for research and discovery with the potential for future expansion. This paper will discuss the major products of the MMM project: a unified data model, a repeatable data transformation pipeline, a Linked Open Data knowledge graph, and a Semantic Web portal. It will also examine the crucial importance of an iterative process of multidisciplinary collaboration embedded throughout the project, enabling humanities researchers to shape the development of a digital platform and tools, while also enabling the same researchers to ask more sophisticated and comprehensive research questions of the aggregated data.
Mikko Koho, Esko Ikkala, Petri Leskinen, Minna Tamper, Jouni Tuominen and Eero Hyvönen:
WarSampo Knowledge Graph: Finland in the Second World War as Linked Open Data. Semantic Web – Interoperability, Usability, Applicability, vol. 12, no. 2, pp. 265-278, January, 2021.
bib pdf link The Second World War (WW2) is arguably the most devastating catastrophe of human history, a topic of great interest to not only researchers but the general public. However, data about the Second World War is heterogeneous and distributed in various organizations and countries making it hard to utilize. In order to create aggregated global views of the war, a shared ontology and data infrastructure is needed to harmonize information in various data silos. This makes it possible to share data between publishers and application developers, to support data analysis in Digital Humanities research, and to develop data-driven intelligent applications. As a first step towards these goals, this article presents the WarSampo knowledge graph (KG), a shared semantic infrastructure, and a Linked Open Data (LOD) service for publishing data about WW2, with a focus on Finnish military history. The shared semantic infrastructure is based on the idea of representing war as a spatio-temporal sequence of events that soldiers, military units, and other actors participate in. The used metadata schema is an extension of CIDOC CRM, supplemented by various military historical domain ontologies. With an infrastructure containing shared ontologies, maintaining the interlinked data brings upon new challenges, as one change in an ontology can propagate across several datasets that use it. To support sustainability, a repeatable automatic data transformation and linking pipeline has been created for rebuilding the whole WarSampo KG from the individual source datasets. The WarSampo KG is hosted on a data service based on W3C Semantic Web standards and best practices, including content negotiation, SPARQL API, download, automatic documentation, and other services supporting the reuse of the data. The WarSampo KG, a part of the international LOD Cloud and totalling ca. 14 million triples, is in use in nine end-user application views of the WarSampo portal, which has had over 400 000 end users since its opening in 2015.
Toby Burrows, Mitch Fraas, Eero Hyvönen, Esko Ikkala, Mikko Koho, David Lewis, Andrew Morrison, Kevin Page, Lynn Ransom, Emma Thomson, Jouni Tuominen, Athanasios Velios and Hanno Wijsman:
Linking Data to Explore the History of Medieval and Renaissance Manuscripts: the Mapping Manuscript Migrations Project.
Book of abstracts: 2nd International Conference of the European Association for Digital Humanities (EADH), Krasnoyarsk, Russia, 21 - 25 September 2021, 2021.
bib pdf
2020
Eero Hyvönen, Minna Tamper, Esko Ikkala, Sami Sarsa, Arttu Oksanen, Jouni Tuominen and Aki Hietanen:
Publishing and Using Legislation and Case Law as Linked Open Data on the Semantic Web.
The Semantic Web: ESWC 2020 Satellite Events (Harth, Andreas, Presutti, Valentina, Troncy, Raphaël, Acosta, Maribel, Polleres, Axel, Fernández, Javier D., Xavier Parreira, Josiane, Hartig, Olaf, Hose, Katja and Cochez, Michael (eds.)), Lecture Notes in Computer Science, vol. 12124, pp. 110-114, Springer-Verlag, 2020.
bib pdf link
Heikki Rantala, Esko Ikkala, Ilkka Jokipii, Mikko Koho, Jouni Tuominen and Eero Hyvönen:
WarVictimSampo 1914–1922: A Semantic Portal and Linked Data Service for Digital Humanities Research on War History.
The Semantic Web: ESWC 2020 Satellite Events (Harth, Andreas, Presutti, Valentina, Troncy, Raphaël, Acosta, Maribel, Polleres, Axel, Fernández, Javier D., Xavier Parreira, Josiane, Hartig, Olaf, Hose, Katja and Cochez, Michael (eds.)), Lecture Notes in Computer Science, vol. 12124, pp. 191-196, Springer-Verlag, 2020.
bib pdf link
Minna Tamper, Arttu Oksanen, Jouni Tuominen, Aki Hietanen and Eero Hyvönen:
Automatic Annotation Service APPI: Named Entity Linking in Legal Domain.
The Semantic Web: ESWC 2020 Satellite Events (Harth, Andreas, Presutti, Valentina, Troncy, Raphaël, Acosta, Maribel, Polleres, Axel, Fernández, Javier D., Xavier Parreira, Josiane, Hartig, Olaf, Hose, Katja and Cochez, Michael (eds.)), Lecture Notes in Computer Science, vol. 12124, pp. 208-213, Springer-Verlag, 2020.
bib pdf link
Heikki Rantala, Ilkka Jokipii, Mikko Koho, Esko Ikkala, Jouni Tuominen and Eero Hyvönen:
Building a Linked Open Data Portal of War Victims in Finland 1914-1922.
DHN 2020 Digital Humanities in the Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 5th Conference, pp. 310-317, CEUR Workshop Proceedings, vol. 2612, Riga, Latvia, October, 2020.
bib pdf link
Toby Burrows, Antoine Brix, Douglas Emery, Arthur Mitchell Fraas, Eero Hyvönen, Esko Ikkala, Mikko Koho, David Lewis, Synnove Myking, Kevin Page, Lynn Ransom, Emma Cawlfield Thomson, Jouni Tuominen, Hanno Wijsman and Pip Wilcox:
Linked Open Data Vocabularies and Identifiers for Medieval Studies.
DHN 2020 Digital Humanities in the Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 5th Conference, pp. 211-218, CEUR Workshop Proceedings, vol. 2612, Riga, Latvia, October, 2020.
bib pdf link
Toby Burrows, Douglas Emery, Mitch Fraas, Eero Hyvönen, Esko Ikkala, Mikko Koho, David Lewis, Andrew Morrison, Kevin Page, Lynn Ransom, Emma Thomson, Jouni Tuominen, Athanasios Velios and Hanno Wijsman:
Mapping Manuscript Migrations: Digging into Data for Researching the History and Provenance of Medieval and Renaissance Manuscripts: White Paper. August, 2020.
bib pdf link
Pejam Hassanzadeh, Eero Hyvönen, Esko Ikkala, Jouni Tuominen, Suzie Thomas, Anna Wessman and Ville Rohiola:
FindSampo Platform for Reporting and Studying Archaeological Finds Using Citizen Science.
3rd Workshop on Humanities in the Semantic Web (WHiSe 2020), pp. 33-40, CEUR Workshop Proceedings, vol. 2695, June, 2020.
bib pdf link
Toby Burrows, Douglas Emery, Arthur Mitchell Fraas, Eero Hyvönen, Esko Ikkala, Mikko Koho, David Lewis, Andrew Morrison, Kevin Page, Lynn Ransom, Emma Cawlfield Thomson, Jouni Tuominen, Athanasios Velios, and Hanno Wijsman:
Mapping Manuscript Migrations Knowledge Graph: Data for Tracing the History and Provenance of Medieval and Renaissance Manuscripts. Journal of Open Humanities Data, vol. 6, pp. 3, June, 2020.
bib pdf link
2019
Arttu Oksanen, Jouni Tuominen, Eetu Mäkelä, Minna Tamper, Aki Hietanen and Eero Hyvönen:
Semantic Finlex: Transforming, Publishing, and Using Finnish Legislation and Case Law As Linked Open Data on the Web.
Knowledge of the Law in the Big Data Age (G. Peruginelli and S. Faro (eds.)), Frontiers in Artificial Intelligence and Applications, vol. 317, pp. 212-228, IOS Press, 2019. ISBN 978-1-61499-984-3 (print); ISBN 978-1-61499-985-0 (online).
bib pdf link Governments publish legislation and case law widely in print and on the Web. Such legal information is provided for human consumption, but the information is usually not available as data for algorithmic analysis and applications to use. However, this would be beneficial in many use cases, such as building more intelligent juridical online services and conducting research into legislation and legal practice. To address these needs, this Chapter presents Semantic Finlex, a national in-use data resource and service for publishing Finnish legislation and related case law as Linked Open Data for legal applications to use. The system transforms and interlinks on a regular basis data from the legacy legal database Finlex of the Ministry of Justice into Linked Open Data, based on the European standards ECLI and ELI. The published data is hosted on the 7-star Linked Data Finland service and SPARQL endpoint with a variety of related services available that ease data re-use. Rich Internet Applications using SPARQL for data access are presented as application demonstrators of the data service. In addition, this Chapter presents methods and tools under development to automatically annotate legal texts and to anonymize case law documents prior to their publication on the Web. Anonymization is necessary due to issues of data protection and privacy, and annotation is needed for semantic search and interlinking the documents. The automated approaches could significantly speed up the process and minimize costs of publishing legal documents as Linked Open Data.
Eero Hyvönen, Ruth Ahnert, Sebastian E. Ahnert, Jouni Tuominen, Eetu Mäkelä, Miranda Lewis and Gertjan Filarski:
Reconciling metadata.
Reassembling the Republic of Letters in the Digital Age (H. Hotson and T. Wallnig (eds.)), pp. 223-235, Göttingen University Press, 2019.
bib link
Howard Hotson, Thomas Wallnig, Jouni Tuominen, Eetu Mäkelä, and Eero Hyvönen:
People.
Reassembling the Republic of Letters in the Digital Age (H. Hotson and T. Wallnig (eds.)), pp. 119-136, Göttingen University Press, 2019.
bib link
Mikko Koho, Erkki Heino, Petri Leskinen, Esko Ikkala, Minna Tamper, Kasper Apajalahti, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen:
WarSampo Knowledge Graph. Zenodo, October, 2019. Dataset.
bib link WarSampo Knowledge Graph includes harmonized data of different kinds concerning the Second World War in Finland, separated in different subgraphs representing events, actors, places, photographs, and other aspects and documentation of the war. The data covers the Winter War 1939-1940 against the Soviet attack, the Continuation War 1941-1944 where the occupied areas of the Winter War were temporarily regained, and the Lapland War 1944-1945, where the Finns pushed the German troops away from Lapland.
Anna Wessman, Suzie Thomas, Ville Rohiola, Mikko Koho, Esko Ikkala, Jouni Tuominen, Eero Hyvönen, Jutta Kuitunen, Helinä Parviainen, and Marianna Niukkanen:
Citizen Science in Archaeology: Developing a Collaborative Web Service for Archaeological Finds in Finland.
Transforming Heritage Practice in the 21st Century: Contributions from Community Archaeology (John Jameson and Sergiu Musteață (eds.)), pp. 337-352, Springer, July, 2019.
bib pdf link
Mikko Koho, Lia Gasbarra, Jouni Tuominen, Heikki Rantala, Ilkka Jokipii and Eero Hyvönen:
AMMO Ontology of Finnish Historical Occupations.
Proceedings of the First International Workshop on Open Data and Ontologies for Cultural Heritage (ODOCH 19) (Antonella Poggi (ed.)), vol. 2375, pp. 91-96, CEUR Workshop Proceedings, Rome, Italy, June, 2019.
bib pdf link This paper introduces AMMO Ontology of Finnish Historical Occupations. AMMO is based on thousands of occupation labels extracted from three Finnish military historical datasets of the early 20th century: the first consists of the ca. 40 000 war-related death records around the time of the Finnish Civil War (1914–1922); the second consists of the ca. 95 000 death records of Finnish soldiers in the Winter War and Continuation War (1939–1944); the third contains the ca. 4500 records of Finnish prisoners of war in the Soviet Union during the WW2. Our goal from a Digital Humanities perspective is to use AMMO to study military history and these datasets based on the occupation and social status of the soldiers. AMMO will also be used as a component for faceted search and semantic recommendation in two semantic portals for Finnish military history. AMMO is aligned with the international historical occupation classification HISCO and with a modern Finnish occupational classification for international and national interoperability. The ontology is published as Linked Open Data in an ontology service.
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori:
BiographySampo - Publishing and Enriching Biographies on the Semantic Web for Digital Humanities Research.
The Semantic Web. ESWC 2019 (Pascal Hitzler, Miriam Fernández, Krzysztof Janowicz, Amrapali Zaveri, Alasdair J.G. Gray, Vanessa Lopez, Armin Haller and Karl Hammar (eds.)), pp. 574-589, Springer-Verlag, June, 2019.
bib pdf link
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori:
Demonstrating BiographySampo in Solving Digital Humanities Research Problems in Biography and Prosopography.
The Fourth Digital Humanities in the Nordic Countries 2019 (DHN2019), Book of Abstracts, University of Copenhagen, Copenhagen, Denmark, March, 2019.
bib pdf link
Anna Wessman, Suzie Thomas, Ville Rohiola, Jutta Kuitunen, Esko Ikkala, Jouni Tuominen, Mikko Koho and Eero Hyvönen:
A Citizen Science Approach to Archaeology: Finnish Archaeological Finds Recording Linked Open Database (SuALT).
DHN 2019 Digital Humanities in Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, pp. 469-478, CEUR Workshop Proceedings, Vol-2364, Copenhagen, Denmark, March, 2019.
bib pdf link
Eero Hyvönen, Esko Ikkala, Jouni Tuominen, Mikko Koho, Toby Burrows, Lynn Ransom and Hanno Wijsman:
A Linked Open Data Service and Portal for Pre-modern Manuscript Research.
DHN 2019 Digital Humanities in Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, pp. 220-229, CEUR Workshop Proceedings, Vol-2364, Copenhagen, Denmark, March, 2019.
bib pdf link
2018
Petri Leskinen, Eero Hyvönen and Jouni Tuominen:
Analyzing and Visualizing Prosopographical Linked Data Based on Biographies.
Proceedings of the Second Conference on Biographical Data in a Digital World 2017 (BD2017), vol. 2119, pp. 39-44, CEUR Workshop Proceedings, Linz, Austria, 2018.
bib pdf link This paper shows how faceted search on biographical data can be utilized as a flexible basis for filtering target groups of people and, in particular, how generic data analysis and visualization tools can then be applied for solving prosopographical research questions based on the filtered data. This idea is demonstrated and evaluated in practice by presenting two application case studies: 1) linked data extracted from a printed registry of over 10 000 alumni (1867–1992) of the prominent Finnish high school Norssi, and 2) a knowledge graph extracted from 13 000 short biographies of significant Finnish people (from 3rd century to present times) in the National Biography of Finland. In both cases, the data is enriched by linking their entities with several other external datasets.
Jouni Tuominen, Eero Hyvönen and Petri Leskinen:
Bio CRM: A Data Model for Representing Biographical Data for Prosopographical Research.
Proceedings of the Second Conference on Biographical Data in a Digital World 2017 (BD2017), vol. 2119, pp. 59-66, CEUR Workshop Proceedings, Linz, Austria, 2018.
bib pdf link Biographies make a promising application case of Linked Data: they can be used, e.g., as a basis for Digital Humanities research in prosopography and as a key data and linking resource in semantic Cultural Heritage (CH) portals. In both use cases, a semantic data model for harmonizing and interlinking heterogeneous data from different sources is needed. This paper presents such a data model, Bio CRM, with the following key ideas: 1) The model is a domain specific extension of CIDOC CRM, making it applicable to not only biographical data but to other CH data, too. 2) The model makes a distinction between enduring unary roles of actors, their enduring binary relationships, and perduing events, where the participants can take different roles modeled as a role concept hierarchy. 3) The model can be used as a basis for semantic data validation and enrichment by reasoning. 4) The enriched data conforming to Bio CRM is targeted to be used by SPARQL queries in a flexible ways using a hierarchy of roles in which participants can be involved in events.
Esko Ikkala, Jouni Tuominen, Jaakko Raunamaa, Tiina Aalto, Terhi Ainiala, Helinä Uusitalo and Eero Hyvönen:
NameSampo: A Linked Open Data Infrastructure and Workbench for Toponomastic Research.
Proceedings of the 2nd ACM SIGSPATIAL Workshop on Geospatial Humanities, GeoHumanities 18, pp. 2:1-2:9, ACM, Seattle, WA, USA, November, 2018.
bib pdf link This paper presents a series of projects where one of the main sources for toponomastic research in Finland, the corpora of place names in the Names Archive database of the Institute for the Languages of Finland, was digitized and how the resulting database was converted, enriched and published as Linked Open Data using a data processing pipeline. Utilizing the Linked Data infrastructure and various external data sources, a modern full-stack web application, NameSampo, was created in collaboration between toponomastic researchers and computer scientists for searching, analyzing, and visualizing digital toponomastic data sources.
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori:
Biografiasammon tekoäly yhdistää ja rikastaa suomalaiset elämäkerrat semanttisessa webissä. Aalto-yliopisto, Semanttisen laskennan tutkimusryhmä (SeCo), Nov, 2018.
bib pdf Biografiasampo-järjestelmä käynnistää uuden aikakauden elämäkertakokoelmien julkaisemisessa ja käyttämisessä verkossa. Järjestelmän ydinaineistona on Kansallisbiografia ja muut Suomalaisen Kirjallisuuden Seuran (SKS) ja tieteellisten seurojen toimittamat pienoiselämäkerrat, yhteensä 13 100 elämäntarinaa, joita on kirjoittanut 900 suomalaista tutkijaa. Biografiasammon innovaationa on luoda kieliteknologian, tekoälyn ja semanttisen webin teknologioiden avulla elämäkertojen teksteistä ja niihin eri lähteissä liittyvistä tiedoista tietämysverkko (knowledge graph) ja kansallinen tietoinfrastruktuuri, joka koostuu miljoonista tietojen välisistä yhteyksistä. Tietämysverkko on julkaistu linkitetyn datan palvelussa, jonka varaan on toteutettu seitsemästä sovellusnäkymästä koostuva älykäs, kaikille avoin ja maksuton verkkopalvelu biografiasampo.fi kansalaisten ja digitaalisten ihmistieteiden tutkijoiden käytettäväksi.
Arttu Oksanen, Jouni Tuominen, Eetu Mäkelä, Minna Tamper, Aki Hietanen, and Eero Hyvönen:
Semantic Finlex: Finnish Legislation and Case Law as a Linked Open Data Service.
Proceedings of Law via the Internet 2018 (LVI 2018), Knowledge of the Law in the Big Data Age, abstracts, Florence, Italy, October, 2018.
bib pdf
Jouni Tuominen, Eetu Mäkelä, Eero Hyvönen, Arno Bosse, Miranda Lewis and Howard Hotson:
Reassembling the Republic of Letters - A Linked Data Approach.
Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018), pp. 76-88, CEUR Workshop Proceedings, vol. 2084, Helsinki, Finland, March, 2018.
bib pdf link Between 1500 and 1800, a revolution in postal communication allowed ordinary men and women to scatter letters across and beyond Europe. This exchange helped knit together what contemporaries called the respublica litteraria, Republic of Letters, a knowledge-based civil society, crucial to that era’s intellectual breakthroughs, and formative of many modern European values and institutions. To enable effective Digital Humanities research on the epistolary data distributed in different countries and collections, metadata about the letters have been aggregated, harmonised, and provided for the research community through the Early Modern Letters Online (EMLO) service. This paper discusses the idea and benefits of using Linked Data as a basis for the next digital framework of EMLO, and presents experiences of a first demonstrational implementation of such a system.
Eero Hyvönen, Petri Leskinen, Minna Tamper, Jouni Tuominen and Kirsi Keravuori:
Semantic National Biography of Finland.
Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018), pp. 372-385, CEUR Workshop Proceedings, Vol-2084, Helsinki, Finland, March, 2018.
bib pdf link This paper presents the vision of publishing and utilizing textual biographies as Linked (Open) Data on the Semantic Web. As a case study, we publish the live stories of the National Biography of Finland, created by the Finnish Literature Society, as semantic, i.e., machine “understandable” metadata in a SPARQL endpoint using the Linked Data Finland (LDF.fi) service. On top of the data service various Digital Humanities applications are built. The applications include searching and studying individual personal histories as well as historical research of groups of persons using methods of prosopography. The biographical data is enriched by extracting events from unstructured and semi-structured texts, and by linking entities internally and to external data sources. A faceted semantic search engine is provided for filtering groups of people from the data for prosopographical research. An extension of the event-based CIDOC CRM ontology is used as the underlying data model, where lives are seen as chains of interlinked events populated from the data of the biographies and additional data sources, such as museum collections, library databases, and archives.
Suzie Thomas, Anna Wessman, Jouni Tuominen, Mikko Koho, Esko Ikkala, Eero Hyvönen, Ville Rohiola and Ulla Salmela:
SuALT: Collaborative Research Infrastructure for Archaeological Finds and Public Engagement through Linked Open Data.
Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018), Book of Abstracts, Helsinki, Finland, March, 2018.
bib pdf
Esko Ikkala, Eero Hyvönen and Jouni Tuominen:
Geocoding, Publishing, and Using Historical Places and Old Maps in Linked Data Applications.
Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference, pp. 228-234, CEUR Workshop Proceedings, Vol 2084, Helsinki, Finland, March, 2018.
bib pdf link
2017
Mikko Koho, Eero Hyvönen, Erkki Heino, Jouni Tuominen, Petri Leskinen and Eetu Mäkelä:
Linked Death - Representing, Publishing, and Using Second World War Death Records as Linked Open Data.
The Semantic Web: ESWC 2017 Satellite Events (Eva Blomqvist, Katja Hose, Heiko Paulheim, Agnieszka Ławrynowicz, Fabio Ciravegna and Olaf Hartig (eds.)), pp. 369-383, Springer, Cham, 2017.
bib pdf link War history of the Second World War (WW2), humankind’s largest disaster, is of great interest to both laymen and researchers. Most of us have ancestors and relatives who participated in the war, and in the worst case got killed. Researchers are eager to find out what actually happened then, and even more importantly why, so that future wars could perhaps be prevented. The darkest data of war history are casualty records—from such data we could perhaps learn most about the war. This paper presents a model and system for representing death records as linked data, so that 1) citizens could find out more easily what happened to their relatives during WW2 and 2) digital humanities (DH) researchers could (re)use the data easily for research.
Petri Leskinen, Mikko Koho, Erkki Heino, Minna Tamper, Esko Ikkala, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen:
Modeling and Using an Actor Ontology of Second World War Military Units and Personnel.
Proceedings of the 16th International Semantic Web Conference (ISWC 2017) (Claudia d Amato, Miriam Fernandez, Valentina Tamma, Freddy Lecue, Philippe Cudré-Mauroux, Juan Sequeda, Christoph Lange and Jeff Heflin (eds.)), pp. 280-296, Springer-Verlag, Vienna, Austria, October, 2017.
bib pdf link This paper presents a model for representing historical military personnel and army units, based on large datasets about World War II in Finland. The model is in use in WarSampo data service and semantic portal, which has had tens of thousands of distinct visitors. A key challenge is how to represent ontological changes, since the ranks and units of military personnel, as well as the names and structures of army units change rapidly in wars. This leads to serious problems in both search as well as data linking due to ambiguity and homonymy of names. In our solution, actors are represented in terms of the events they participated in, which facilitates disambiguation of personnel and units in different spatio-temporal contexts. The linked data in the WarSampo Linked Open Data cloud and service has ca. 9 million triples, including actor datasets of ca. 100 000 soldiers and ca. 16 100 army units. To test the model in practice, an application for semantic search and recommending based on data linking was created, where the spatio-temporal life stories of individual soldiers can be reassembled dynamically by linking data from different datasets. An evaluation is presented showing promising results in terms of linking precision.
Petri Leskinen, Jouni Tuominen, Erkki Heino and Eero Hyvönen:
An Ontology and Data Infrastructure for Publishing and Using Biographical Linked Data.
Proceedings of the Workshop on Humanities in the Semantic Web (WHiSe II), pp. 15-26., CEUR Workshop Proceedings, Vol. 2014, Vienna, Austria, October, 2017.
bib pdf link This paper describes the ontology model and published datasets of a digitized biographical person register. The applied ontology model is designed to represent people via their enduring roles and perduring lifetime events. The model is designed to support 1) prosopographical Digital Humanities research, 2) linking to resources in semantic Cultural Heritage portals, and 3) semantic data validation and enrichment by using SPARQL queries. The linked data approach enables to enrich a person s biography by interlinking it with space and time related biographical events, persons relating by social contacts or family relations, historical events, and personal achievements.
Eero Hyvönen, Erkki Heino, Petri Leskinen, Esko Ikkala, Mikko Koho, Minna Tamper, Jouni Tuominen and Eetu Mäkelä:
WarSampo: Publishing and Using Linked Open Data about the Second World War. EuropeanaTech Insight, no. 7, Europeana, September, 2017.
bib pdf link The article overviews the system WarSampo – Finnish World War 2 on the Semantic Web, the winner of the LODLAM Challenge 2017 Open Data Prize on June 29 in Venice, Italy.
Erkki Heino, Minna Tamper, Eetu Mäkelä, Petri Leskinen, Esko Ikkala, Jouni Tuominen, Mikko Koho and Eero Hyvönen:
Named Entity Linking in a Complex Domain: Case Second World War History.
Proceedings, Language, Data and Knowledge (LDK 2017), pp. 120-133, Springer-Verlag, Galway, Ireland, June, 2017.
bib pdf link This paper discusses the challenges of applying named entity linking in a rich, complex domain – specifically, the linking of 1) military units, 2) places and 3) people in the context of rich Second World War data. Multiple sub-scenarios are discussed in detail through concrete evaluations, analyzing the problems faced, and the solutions developed. A key contribution of this work is to highlight the heterogeneity of problems and approaches needed even inside a single domain, depending on both the source data as well as the target authority.
Eero Hyvönen, Petri Leskinen, Erkki Heino, Jouni Tuominen and Laura Sirola:
Reassembling and Enriching the Life Stories in Printed Biographical Registers: Norssi High School Alumni on the Semantic Web.
Proceedings, Language, Data and Knowledge (LDK 2017), pp. 113-119, Springer-Verlag, Galway, Ireland, June, 2017.
bib pdf link This paper presents the idea to enrich printed biographical person registers with linked data related to events that took place after the register was published. By transforming printed historical documents into structured data, semantic search to written texts can be provided for the reader. Even more importantly, life stories of historical persons can be extended based on data linking by extracting semantic structures from printed texts, and by combining this data with external datasets and data services. Such linking provides an enriched context for prosopographical research on people in the register, as well as an enhanced reading experience for anyone interested in reading the biographies. As a concrete case study, a register 1867–1992 of over 10 000 alumni of the prominent Finnish high school “Norssi” was transformed into RDF, was enriched by data linking, was published as a linked data service, and is provided to end users via a faceted search engine and browser for studying lives of historical persons and for prosopographical research.
Minna Tamper, Petri Leskinen, Esko Ikkala, Arttu Oksanen, Eetu Mäkelä, Erkki Heino, Jouni Tuominen, Mikko Koho and Eero Hyvönen:
AATOS – a Configurable Tool for Automatic Annotation.
Proceedings, Language, Data and Knowledge (LDK 2017), pp. 276-289, Springer-Verlag, Galway, Ireland, June, 2017.
bib pdf link This paper presents an automatic annotation tool AATOS for providing documents with semantic annotations. The tool links entities found from the texts to ontologies defined by the user. The application is highly configurable and can be used with different natural language Finnish texts. The application was developed as a part of WarSampo and Semantic Finlex projects and tested using Kansa Taisteli magazine articles and consolidated Finnish legislation of Semantic Finlex. The quality of the automatic annotation was evaluated by measuring precision and recall against existing manual annotations. The results showed that the quality of the input text, as well as the selection and configuration of the ontologies impacted the results.
2016
Eero Hyvönen, Erkki Heino, Petri Leskinen, Esko Ikkala, Mikko Koho, Minna Tamper, Jouni Tuominen and Eetu Mäkelä:
Publishing Second World War History as Linked Data Events on the Semantic Web.
Proceedings of Digital Humanities 2016, short papers, pp. 571-573, Kraków, Poland, July, 2016.
bib pdf link Data about wars is typically heterogeneous, distributed in the data silos of the fighting parties, multilingual, and often controversial depending on the political point of view. It is therefore hard for the historians to get a global picture of what has actually happened, to whom, where, when, and how. We argue that Semantic Web and Linked Data technologies are a very promising approach for modeling, harmonizing, and aggregating data about war history. Our goal is to make it possible, for both historians and laymen, to study history in a contextualized way where linked datasets enrich each other. The paper presents the in-use WarSampo 1 system, where massive collections of heterogeneous data about the (Finnish) history of the Second World War are harmonized using an event-based approach, and provided as a Linked Open Data service for applications to use. As a use case, a semantic portal WarSampo providing six different perspectives to the war based on events is presented.
Eero Hyvönen, Esko Ikkala and Jouni Tuominen:
Linked Data Brokering Service for Historical Places and Maps.
Proceedings of the 1st Workshop on Humanities in the Semantic Web (WHiSe), pp. 39-52, CEUR Workshop Proceedings, Heraklion, Crete, Greece, May, 2016. Vol 1608.
bib pdf link This paper presents a new Linked Open Data brokering service model HIPLA for using and maintaining historical place gazetteers and maps based on distributed SPARQL endpoints. The model introduces several novelties: First, the service facilitates collaborative maintenance of geo-ontologies and maps in real time as a side effect of annotating contents in legacy cataloging systems. The idea is to support a collaborative ecosystem of curators that creates and maintains data about historical places and maps in a sustainable way. Second, in order to foster understanding of historical places, the places can be provided on both modern and historical maps, and with additional contextual Linked Data attached. Third, since data about historical places is typically maintained by different authorities and in different countries, the service can be used and extended in a federated fashion, by including new distributed SPARQL endpoints (or other web services with a suitable API) into the system. To test and demonstrate the model, we created the first prototype implementation Hipla.fi of the HIPLA model. Hipla.fi is based on four Finnish datasets in SPARQL endpoints totaling some 840,000 geocoded places on 450 historical maps from two atlas series aligned on modern maps, and on the Getty Thesaurus of Geographic Names (TGN) SPARQL endpoint in the US. As a first application, a part of the Hipla.fi data service has been applied in creating a 5 million triple semantic portal of historical Second World War data with tens of thousands of end users.
Mikko Koho, Eero Hyvönen, Erkki Heino, Jouni Tuominen, Petri Leskinen and Eetu Mäkelä:
Linked Death - Representing, Publishing, and Using Second World War Death Records as Linked Open Data.
Proceedings of the 1st Workshop on Humanities in the Semantic Web (WHiSe), CEUR Workshop Proceedings, Heraklion, Crete, Greece, May, 2016. Vol 1608.
bib pdf link War history of the Second World War (WW2), humankind s largest disaster, is of great interest to both laymen and researchers. Most of us have ancestors and relatives who participated in the war, and in the worst case got killed. Researchers are eager to find out what actually happened then, and even more importantly why, so that future wars could perhaps be prevented. The darkest data of war history are casualty records---from such data we could perhaps learn most about the war. This paper presents a model and system for representing death records as linked data, so that 1) citizens could find out more easily what happened to their relatives during WW2 and 2) digital humanities (DH) researchers could (re)use the data easily for research.
Eero Hyvönen, Erkki Heino, Petri Leskinen, Esko Ikkala, Mikko Koho, Minna Tamper, Jouni Tuominen and Eetu Mäkelä:
WarSampo Data Service and Semantic Portal for Publishing Linked Open Data about the Second World War History.
The Semantic Web – Latest Advances and New Domains (ESWC 2016) (Harald Sack, Eva Blomqvist, Mathieu d Aquin, Chiara Ghidini, Simone Paolo Ponzetto and Christoph Lange (eds.)), pp. 758-773, Springer-Verlag, May, 2016.
bib pdf link This paper presents the WarSampo system for publishing collections of heterogeneous, distributed data about the Second World War on the Semantic Web. WarSampo is based on harmonizing massive datasets using event-based modeling, which makes it possible to enrich datasets semantically with each others’ contents. WarSampo has two components: First, a Linked Open Data (LOD) service WarSampo Data for Digital Humanities (DH) research and for creating applications related to war history. Second, a semanticWarSampo Portal has been created to test and demonstrate the usability of the data service. The WarSampo Portal allows both historians and laymen to study war history and destinies of their family members in the war from different interlinked perspectives. Published in November 2015, theWarSampo Portal had some 20,000 distinct visitors during the first three days, showing that the public has a great interest in this kind of applications.
2015
Eero Hyvönen, Jouni Tuominen, Esko Ikkala and Eetu Mäkelä:
Ontology Services Based on Crowdsourcing: Case National Gazetteer of Historical Places.
Proceedings of the ISWC 2015 Posters & Demonstrations Track, CEUR-WS Proceedings, Bethlehem, PA, USA, October, 2015. Vol 1486.
bib pdf link This paper introduces the idea of applying crowdsourcing to evolving ontology services; the goal is to facilitate collaborative maintenance of ontologies in real time as a side effect of annotating contents in legacy cataloging systems. The idea is being implemented in the use case of creating and managing a national level gazetteer of historical places in Finland.
Eero Hyvönen, Jouni Tuominen, Eetu Mäkelä, Jérémie Dutruit, Kasper Apajalahti, Erkki Heino, Petri Leskinen and Esko Ikkala:
Second World War on the Semantic Web: The WarSampo Project and Semantic Portal.
Proceedings of the ISWC 2015 Posters & Demonstrations Track, CEUR-WS Proceedings, Bethlehem, PA, USA, October, 2015. Vol 1486.
bib pdf link This paper initiates and fosters work on publishing Linked Open Data about the Second World War. It is argued that the heterogeneous, distributed data about the international world war history makes a promising use case for semantic technologies. We hope that by making war data openly available we can learn from the past and promote peace.
2014
Osma Suominen, Sini Pessala, Jouni Tuominen, Mikko Lappalainen, Susanna Nykyri, Henri Ylikotila, Matias Frosterus and Eero Hyvönen:
Deploying National Ontology Services: From ONKI to Finto.
Proceedings of the Industry Track at the International Semantic Web Conference 2014, CEUR Workshop Proceedings, Riva del Garda, Italy, October, 2014. Vol 1383.
bib pdf link The Finnish Ontology Library Service ONKI was published as a living laboratory prototype for public use in 2008. Its idea is to support content indexers and ontology developers via a browser interface and machine APIs. ONKI has been well-accepted, but being a prototype maintained by the ending research project FinnONTO (2003–2012), a more sustainable service was needed, supported by permanent governmental funding. To achieve this, ONKI was deployed and is being further developed by the National Library of Finland into a new national vocabulary service Finto. We discuss challenges in the deployment of ONKI into Finto and lessons learned during the transition process.
Eero Hyvönen, Jouni Tuominen, Miika Alonen and Eetu Mäkelä:
Linked Data Finland: A 7-star Model and Platform for Publishing and Re-using Linked Datasets.
The Semantic Web: ESWC 2014 Satellite Events. ESWC 2014 (Presutti, V., Blomqvist, E., Troncy, R., Sack, H., Papadakis, I. and Tordai, A. (eds.)), pp. 226-230, Springer-Verlag, May, 2014.
bib pdf link The idea of Linked Data is to aggregate, harmonize, integrate, enrich, and publish data for re-use on the Web in a cost-efficient way using Semantic Web technologies. We concern two major hindrances for re-using Linked Data: It is often difficult for a re-user to 1) understand the characteristics of the dataset and 2) evaluate the quality the data for the intended purpose. This paper introduces the “Linked Data Finland” platform LDF.fi addressing these issues. We extend the famous 5-star model of Tim Berners-Lee, with the sixth star for providing the dataset with a schema that explains the dataset, and the seventh star for validating the data against the schema. LDF.fi also automates data publishing and provides data curation tools. The first prototype of the platform is available on the web as a service, hosting tens of datasets and supporting several applications.
2013
Eero Hyvönen, Miika Alonen, Jouni Tuominen, and Eetu Mäkelä:
Linked Data Finland: Towards a 7-star Service Platform for Linked Datasets.
The First Annual KnowEscape Conference - KnowEscape 2013, Espoo, Finland, November, 2013.
bib pdf The idea of opening data on the Web as Linked Data (LD) is widely adopted in areas such as public government, science, libraries, and cultural heritage. The key idea is to harmonize, integrate, enrich, and re-use existing data repositories in a cost-efficient way via standard APIs in novel applications. This paper concerns two major hindrances for re-using LD: It is often difficult for a re-user to understand the 1) characteristics of the dataset and 2) evaluate the quality of the data for her intended purpose. This paper introduces the “Linked Data Finland” publishing platform LDF.fi addressing these issues. In order to enhance and promote reusability, we propose extending the famous 5-star model of Tim Berners-Lee into a 7-star model: The sixth star requires that the dataset is defined and explained in terms of explicit schemas. Explicit schemas make it possible to explain the re-user the intended characteristics of the data by, e.g., documentation about the schemas, and how the schemas (vocabularies) are actually used in the given dataset. The seventh star is given, if the data has also been validated w.r.t. the schema specifications. The results of the validation may be a human readable document and/or a machine readable reprentation regarding the quality issues found in the data. This paper reports about work in progress, but the first prototype of the platform is already operational on the web as a service http://ldf.fi.
Matias Frosterus, Jouni Tuominen, Sini Pessala, Katri Seppälä and Eero Hyvönen:
Linked Open Ontology Cloud KOKO--Managing a System of Cross-domain Lightweight Ontologies.
The Semantic Web: ESWC 2013 Satellite Events, pp. 296-297, Springer-Verlag, Berlin Heidelberg, Montpellier, France, May 26-30, 2013.
bib pdf
Matias Frosterus, Jouni Tuominen, Mika Wahlroos and Eero Hyvönen:
The Finnish Law as a Linked Data Service.
The Semantic Web: ESWC 2013 Satellite Events, pp. 289-290, Springer-Verlag, Berlin Heidelberg, Montpellier, France, May 26-30, 2013.
bib pdf Juridical information is important to organizations and individuals alike and is linked to from all walks of life. The Finnish government has published the Finlex Data Bank for searching and browsing legislation documents. However, the data there is not yet open, is based on a traditional XML schema, and does not conform to new semantic metadata standards. There are many difficulties in maintaining and using the site in, e.g., data harvesting, interoperability, querying, and linking that could be mitigated by the Semantic Web technologies. This paper presents an approach and a project—including first results—for publishing and using Finnish legislation as a 5-star Linked Open Data service.
Jouni Tuominen, Nina Laurenne, Mikko Koho and Eero Hyvönen:
The Birds of the World Ontology AVIO.
The Semantic Web: ESWC 2013 Satellite Events, pp. 300-301, Springer-Verlag, Berlin Heidelberg, Montpellier, France, May 26-30, 2013.
bib pdf We present an ontology for managing the scientific and common names of birds. The ontology is based on the TaxMeOn meta-ontology model for biological names. The ontology is in use as an ontology service and it has been applied in a bird watching system.
Jouni Tuominen, Nina Laurenne and Eero Hyvönen:
Publishing and Using Plant Names as an Ontology Service.
Proceedings of the first international Workshop on Semantics for Biodiversity (S4BioDiv), ESWC 2013, CEUR Workshop Proceedings, Vol 979, Montpellier, France, May, 2013.
bib pdf link Animals and plants are referred to using scientific or common names depending on the expertise of an audience or a source of data. The names change in time and therefore their usage as identifiers as such is problematic. We present a solution for managing and using plant names as an ontology. The ontology is based on the TaxMeOn meta-ontology for biological names. In order to refer to organisms unambiguously and publish information as Linked Data on the web, the names are given URIs. The ontology is developed collaboratively and it supports the approval process and temporal tracking of the common names. We introduce an ontology service of plant names for end-users and provide user interfaces and APIs for integrating the ontology into applications.
Eero Hyvönen, Miika Alonen, Mikko Koho and Jouni Tuominen:
BirdWatch--Supporting Citizen Scientists for Better Linked Data Quality for Biodiversity Management.
Proceedings of the first international Workshop on Semantics for Biodiversity (S4BioDiv), ESWC 2013, CEUR Workshop Proceedings, Vol 979, Montpellier, France, May, 2013.
bib pdf link Observational data about species of public interest, such as birds and butterflies, is often created and collected by volunteered citizen scientists, and used by professionals for managing biodiversity. The education and skills of the citizens participating in the work varies a lot, and the process of making observations is typically not systematic but rather ad hoc. As a result, the quality of the observational data in repositories, such as the Global Biodiversity Information Facility GBIF Data Portal, is often not good, hampering its utilization severely. This paper presents an approach for enhancing data quality in a citizen science setting, and presents a mobile tool BirdWatch for citizen observers, mitigating difficulties in producing high quality Linked Data for biodiversity management.
2012
Kim Viljanen, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen:
Normalized Access to Ontology Repositories.
Proceedings of the Sixth International Conference on Semantic Computing (IEEE ICSC 2012), IEEE Press, Palermo, Italy, September, 2012.
bib pdf Ontology repositories, such as NCBO Bioportal, ONKI and Cupboard, help finding and using ontologies on the Semantic Web. However, currently each ontology repository constitutes a separate island with its own user interface, APIs, users, ontology languages and set of ontologies. Because there is not a universal way to access all ontology repositories, doing global search, browsing, and inference over all available ontology repositories turns out to be technically difficult and is generally not done. Ontologies are not reused as much as they could and hence the full potential of ontologies is not achieved. To address the problem, we propose the Normalized Ontology Repository (NOR) approach to make the ontology repositories universally accessible while maintaining their unique functionalities and strengths. The SKOS language is used as the lowest common denominator for presenting the ontologies. In addition, a simple API for searching and accessing the ontologies is defined. As a proof-of-concept evaluation, we present three case implementations to demonstrate the NOR approach: 1) the distributed architecture of the ONKI repository, 2) the metasearch for ONKI and NCBO Bioportal, and 3) publishing informal ontological concept collections as NOR end-points, demonstrated with the semantic portal CultureSampo and the metadata editor SAHA.
Jouni Tuominen, Kim Viljanen and Eero Hyvönen:
Ontologiapalvelut semanttisessa webissä (Ontology services on the Semantic Web). (in Finnish), Tietojenkäsittelytiede, no. 34, pp. 17-36, Tietojenkäsittelytieteen Seura ry, April, 2012.
bib pdf Ontologiat ovat keskeinen osa semanttista webiä: ne toimivat yhteisinä jaettuina käsitteistöinä, joiden avulla tietokoneet voivat käsitellä tietoa älykkäämmin. Jotta eri toimijat voivat hyödyntää yhteisiä käsitteistöjä sovelluksissaan, ontologiat on julkaistava heidän käyttöönsä. Yksinkertaisimmillaan ontologiat voidaan julkaista tiedostomuodossa. Tällöin jokainen toimija joutuu toteuttamaan itse toiminnallisuuksia ontologioiden hyödyntämiseen. Koska osa toiminnallisuuksista on yleisiä, useissa järjestelmissä toistuvia, niiden toteuttaminen valmiina palveluina on mielekästä. Palveluita voidaan tarjota ihmiskäyttäjille käyttöliittymäkomponentteina sekä ohjelmalliseen käyttöön rajapintoina, joita käyttämällä toiminnallisuudet voidaan integroida asiakasjärjestelmiin. Tässä artikkelissa kuvataan ontologioiden käyttäjäryhmien tarpeita sekä ontologiapalveluiden toteutuksia. Yleisten ontologioiden käyttämiseen liittyvien toiminnallisuuksien tarjoamiseksi esitetään ontologiapalvelu ONKI, joka on osa Suomalaiset semanttisen webin ontologiat -hankesarjassa (FinnONTO, 2003–2012) kehitettyä ontologiainfrastruktuuria. Artikkeli perustuu Jouni Tuomisen pro gradu -työhön, jolle Tietojenkäsittelytieteen Seura ry myönsi lukuvuoden 2009–2010 pro gradu -palkinnon. Tutkimustyöhön ovat osallistuneet myös Kim Viljanen ja Eero Hyvönen.
2011
Sini Pessala, Katri Seppälä, Osma Suominen, Matias Frosterus, Jouni Tuominen and Eero Hyvönen:
MUTU: An Analysis Tool for Maintaining a System of Hierarchically Linked Ontologies.
ISWC 2011 - Ontologies come of Age Workshop (OCAS), Bonn, Germany, October, 2011.
bib pdf We consider ontology evolution in a system of light-weight Linked Data ontologies, aligned with each other to form a larger ontology system. When one ontology changes, the human editor must keep track of the actual changes and of the modifications needed in the related ontologies in order to keep the system consistent. This paper presents an analysis tool MUTU, by which such changes and their potential effects on other ontologies can be found. Such an analysis is useful for the ontology editors for understanding the differences between ontology versions, and for updating linked ontologies when changes occurred in other components of an ontology system.
Kim Viljanen, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen:
Combining Distributed Ontology Repositories into a Global Service. June, 2011. Draft paper.
bib pdf Ontologies and vocabularies are a key resource for creating interoperable metadata on the Semantic Web. To make finding and using ontologies easier, the idea of Ontology Repositories has been introduced with current implementations including e.g. the NCBO Bioportal, ONKI and Cupboard. There is a genuine need for different kinds of Ontology Repositories, each focusing on different kinds specific user-needs, different ontologies and different organizational requirements which cannot be addressed by a single general implementation. However, at the moment each Ontology Repository is a separate island with its own user interfaces and APIs. They also use varying ontology languages such as OWL, SKOS, and RDF Schema. Due to this, global search, browsing, and inference over the repositories is difficult and generally not done which means that, for example, finding and reusing existing ontologies becomes difficult. To address the problems, we have developed a loosely coupled Network of Ontology Repositories (NOR) architecture that makes the repositories globally interoperable while maintaining their unique functionalities and strengths. To participate in the network, each ontology repository is required to implement a shared API. As a proof-of-concept evaluation, we present three case implementations demonstrating different aspects of the NOR approach: 1) internal distributed architecture of ONKI, 2) global search of ONKI and NCBO Bioportal, 3) publishing non-ontological concept collections as NOR endpoints, demonstrated with the semantic portal CultureSampo and the metadata editor SAHA.
Jouni Tuominen, Nina Laurenne and Eero Hyvönen:
Biological Names and Taxonomies on the Semantic Web - Managing the Change in Scientific Conception.
Proceedings of the 8th Extended Semantic Web Conference (ESWC 2011), Springer-Verlag, Heraklion, Greece, June, 2011.
bib pdf Biodiversity management requires the usage of heterogeneous biological information from multiple sources. Indexing, aggregating, and finding such information is based on names and taxonomic knowledge of organisms. However, taxonomies change in time due to new scientific findings, opinions of authorities, and changes in our conception about life forms. Furthermore, organism names and their meaning change in time, different authorities use different scientific names for the same taxon in different times, and various vernacular names are in use in different languages. This makes data integration and information retrieval difficult without detailed biological information. This paper introduces a meta-ontology for managing the names and taxonomies of organisms, and presents three applications for it: 1) publishing biological species lists as ontology services (ca. 20 taxonomies including more than 80,000 names), 2) collaborative management of the vernacular names of vascular plants (ca. 26,000 taxa), and 3) management of individual scientific name changes based on research results, covering a group of beetles. The applications are based on the databases of the Finnish Museum of Natural History and are used in a living lab environment on the web.
2010
Kim Viljanen, Jouni Tuominen and Eero Hyvönen:
A Network of Ontology Repositories. December, 2010. Draft paper.
bib pdf Ontologies and vocabularies are a key resource for creating interoperable metadata on the Semantic Web. To make the finding and using ontologies easier, the idea of Ontology Repositories have been introduced with current implementations including e.g. the NCBO Bioportal, ONKI and Cupboard. However, at the moment each ontology repository is a separate island with its own user interfaces and APIs. They also use varying ontology languages such as OWL, SKOS, RDF Schema and others. Due to this, global search, browsing, and inference over the repositories is difficult and generally not done. At the same time, there is a genuine need for different kinds of Ontology Repositories, each focusing on different kinds specific user-needs, different ontologies and different organizational requirements which can not be addressed by a single global implementation. Since there are benefits of having interoperability among the repositories, we have developed a loosely coupled Network of Ontology Repository (NOR) architecture that makes the repositories globally interoperable while maintaining their unique functionalities and strengths. To participate in the network, each ontology repository is required to implement a shared API. As a proof-of-concept, we present a global metasearch prototype for searching simultaneously hundreds of ontologies in the ONKI and NCBO Bioportal repositories.
Jouni Tuominen, Mikko Salonoja, Kim Viljanen and Eero Hyvönen:
A User Interface for Ontology Repositories.
Workshop on Ontology Repositories and Editors for the Semantic Web (ORES 2010), the Extended Semantic Web Conference ESWC 2010, CEUR Workshop Proceedings, Vol. 596, Heraklion, Greece, June, 2010.
bib pdf link Finding ontologies and concepts from a collection of ontologies is a recurring task in many use cases, such as content indexing, searching, and ontology developing. To facilitate this, efficient search and browsing methods are needed. This paper introduces ONKI2, an ontology browser providing a user interface for a repository of ontologies. The system provides a multi-facet search facility for finding an ontology. Finding concepts is supported by autocompletion-based text search that can be refined with additional restrictions. ONKI2 is in use in the Finnish Ontology Library Service ONKI for a collection of 79 ontologies and vocabularies.
Kim Viljanen, Jouni Tuominen, Mikko Salonoja and Eero Hyvönen:
Global Access to Distributed Ontology Repositories.
Poster Papers, the Extended Semantic Web Conference ESWC 2010, Heraklion, Greece, June, 2010.
bib pdf Ontology repository systems are used for publishing and sharing ontologies. However, currently the repositories form separate islands of ontologies, which hinders the user from finding and utilizing the most suitable ontological concepts and ontologies on a global level. In contrast, this paper presents the idea of creating a network of Linked Open Ontology Services (LOOS) based on a set of ontology services that publish their content via a shared API. This facilitates global search and browsing over all ontologies in the network. LOOS has been implemented in the National Finnish Ontology Service ONKI serving currently 79 ontologies.
Kim Viljanen, Jouni Tuominen, Mikko Salonoja and Eero Hyvönen:
Linked Open Ontology Services.
Workshop on Ontology Repositories and Editors for the Semantic Web (ORES 2010), the Extended Semantic Web Conference ESWC 2010, CEUR Workshop Proceedings, Vol. 596, Heraklion, Greece, June, 2010.
bib pdf link Ontology repository systems are used for publishing and sharing ontologies and vocabularies for content indexing, information retrieval, content integration, and other purposes. However, interlinking these distributed repositories to provide global search and browsing over the repositories has not been made. In the spirit of Linked Open Data, we propose creating a network of Linked Open Ontology Services (LOOS) consisting of ontology repositories that publish their content using a shared API. To test the approach, we have defined an HTTP API and present a proof-of-concept implementation consisting of three client applications that are used for accessing a LOOS network of over 50 ontology servers, part of the Ontology Library Service ONKI.
Jouni Tuominen:
Helppokytkentäiset ontologiapalvelut semanttisessa webissä. MSc Thesis (in Finnish), University of Helsinki, Department of Computer Science, May, 2010.
bib pdf link Ontologiat luovat semanttisen webin perustan: ne toimivat yhteisinä jaettuina käsitteistöinä, joiden avulla tietokoneet voivat käsitellä tietoa älykkäämmin. Jotta eri toimijat voivat hyödyntää yhteisiä käsitteistöjä sovelluksissaan, ontologiat on julkaistava heidän käyttöönsä. Yksinkertaisimmillaan ontologiat voidaan julkaista datana, tiedostomuodossa. Tällöin jokainen toimija joutuu toteuttamaan itse toiminnallisuuksia ontologioiden hyödyntämiseen. Osa toiminnallisuuksista on yleisiä, useissa järjestelmissä toistuvia, kuten ontologian visualisointi, selaaminen ja käsitehaku. On kuitenkin kustannustehokkaampaa toteuttaa yleisiä ontologiatoiminnallisuuksia valmiina palveluina. Palveluita voidaan tarjota ihmiskäyttäjille käyttöliittymäkomponentteina sekä ohjelmalliseen käyttöön rajapintoina, joita käyttämällä toiminnallisuudet voidaan integroida asiakasjärjestelmiin. Lisäksi käytettäessä ontologioita palveluina toimijoiden käytössä on aina ontologioiden ajantasaiset versiot. Tässä tutkielmassa kuvataan ontologioiden käyttäjäryhmien -- ontologioiden kehittäjien, tiedon annotoijien, tiedon hakijoiden ja semanttisen webin sovellusten kehittäjien -- tarpeita sekä esitellään ontologioiden hyödyntämiseen kehitettyjä sovelluksia. Yleisten ontologioiden käyttämiseen liittyvien toiminnallisuuksien tarjoamiseksi esitetään ontologiapalvelu ONKI, joka julkistettiin virallisesti käyttöön syyskuussa 2008.
2009
Jouni Tuominen, Tomi Kauppinen, Kim Viljanen and Eero Hyvönen:
Ontology-Based Query Expansion Widget for Information Retrieval.
Proceedings of the 5th Workshop on Scripting and Development for the Semantic Web (SFSW 2009), 6th European Semantic Web Conference (ESWC 2009), CEUR Workshop Proceedings, Vol. 449, Heraklion, Greece, May 31 - June 4, 2009.
bib pdf link In this paper we present an ontology-based query expansion widget which utilizes the ontologies published in the ONKI Ontology Service. The widget can be integrated into a web page, e.g. a search system of a museum catalogue, enhancing the page by providing a query expansion functionality. We have tested the system with general, domain-specific and spatio-temporal ontologies.
Jouni Tuominen, Matias Frosterus, Kim Viljanen and Eero Hyvönen:
ONKI SKOS Server for Publishing and Utilizing SKOS Vocabularies and Ontologies as Services.
Proceedings of the 6th European Semantic Web Conference (ESWC 2009), pp. 768-780, Springer-Verlag, Heraklion, Greece, May 31 - June 4, 2009.
bib pdf Vocabularies are the building blocks of the Semantic Web providing shared terminological resources for content indexing, information retrieval, data exchange, and content integration. Most semantic web applications in practical use are based on lightweight ontologies and, more recently, on the Simple Knowledge Organization System (SKOS) data model being standardized by W3C. Easy and cost-efficient publication, integration, and utilization methods of vocabulary services are therefore highly important for the proliferation of the Semantic Web. This paper presents the ONKI SKOS Server for these tasks. Using ONKI SKOS, a SKOS vocabulary or a lightweight ontology can be published on the web as ready-to-use services in a matter of minutes. The services include not only a browser for human usage, but also Web Service and AJAX interfaces for concept finding, selecting and transporting resources from the ONKI SKOS Server to connected systems. Code generation services for AJAX and Web Service APIs are provided automatically, too. ONKI SKOS services are also used for semantic query expansion in information retrieval tasks. The idea of publishing ontologies as services is analogous to Google Maps. In our case, however, vocabulary services are provided and mashed-up in applications. ONKI SKOS was published in the beginning of 2008 and is to our knowledge the first generic SKOS server of its kind. The system has been used to publish and utilize some 60 vocabularies and ontologies in the National Finnish Ontology Service ONKI www.yso.fi.
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä, Joeli Takala, Kimmo Puputti, Heini Kuittinen, Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg:
CultureSampo - A National Publication System of Cultural Heritage on the Semantic Web 2.0.
Proceedings of the 6th European Semantic Web Conference (ESWC2009), Heraklion, Greece, May 31 - June 4, 2009. Springer-Verlag.
bib pdf CULTURESAMPO is an application demonstration of a national level publication system of cultural heritage contents on the Web, based on ideas and technologies of the Semantic (Web and) Web 2.0. On the semantic side, the system presents new solutions to interoperability problems of dealing with multiple ontologies of different domains, and to problems of integrating multiple metadata schemas and cross-domain content into a homogeneous semantic portal. A novelty of the system is to use semantic models based on events and narrative process descriptions for modeling and visualizing cultural phenomena, and for semantic recommendations. On the Web 2.0 side, CULTURESAMPO proposes and demonstrates a content creation process for collaborative, distributed ontology and content development including different memory organizations and citizens. The system provides the cultural heritage contents to end-users in a new way through multiple (nine) thematic perspectives, based on semantic visualizations. Furthermore, CULTURESAMPO services are available for external web-applications to use through semantic AJAX widgets.
Kim Viljanen, Jouni Tuominen and Eero Hyvönen:
Ontology Libraries for Production Use: The Finnish Ontology Library Service ONKI.
Proceedings of the 6th European Semantic Web Conference (ESWC 2009), pp. 781-795, Springer-Verlag, Heraklion, Greece, May 31 - June 4, 2009.
bib pdf This paper discusses problems of creating and using ontology library services in production use. One approach to a solution is presented with an online implementation--the Finnish Ontology Library Service ONKI--that is in pilot use on a national level in Finland. ONKI contributes to previous research on ontology libraries in many ways: First, mashup and web service support with various tools is provided for cost-efficient utilization of ontologies in indexing and search applications. Second, services covering the different phases of the ontology life cycle are provided. Third, the services are provided and used in real world applications on a national scale. Fourth, the ontology framework is being developed by a collaborative effort by organizations representing different application domains, such as health, culture, and business.
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä, Joeli Takala, Kimmo Puputti, Heini Kuittinen, Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg:
CultureSampo - Finnish Culture on the Semantic Web 2.0. Thematic Perspectives for the End-user.
Proceedings, Museums and the Web 2009, Indianapolis, USA, April 15-18, 2009.
bib pdf We present an overview of CultureSampo, an ambitious system for creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic web challenge of aggregating highly heterogeneous, cross-domain cultural heritage collections and other contents into a semantically rich intelligent system for human and machine users. At the same time, CultureSampo is an approach to solve the social and practical Web 2.0 challenge of organizing the underlying collaborative ontology development and content creation work of memory organizations and citizens. This paper focuses on CultureSampo’s search, recommendation, and visualization services for the end-users. The key idea here is to access cultural heritage on the Semantic Web through nine “thematic perspectives”, such as places on the maps, the social network of cultural persons, timelines, and narrative texts, e.g. biographies and literary works.
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä, Joeli Takala, Kimmo Puputti, Heini Kuittinen, Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg:
CultureSampo - Finnish Cultural Heritage Collections on the Semantic Web 2.0.
Proceedings of the 1st International Symposium on Digital Humanities for Japanese Arts and Cultures (DH-JAC-2009), Ritsumeikan University, Kyoto, Japan, March, 2009.
bib pdf This paper presents an overview of the SemanticWeb 2.0 application CultureSampo, an ambitious system for creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic web challenge of aggregating highly heterogeneous, cross-domain cultural heritage content into a semantically rich intelligent system for human and machine users. At the same time, CultureSampo is an approach to solve the social and practical Web 2.0 challenge of organizing the underlying collaborative ontology development and content creation work of memory organizations and citizens.
Tomi Kauppinen, Heini Kuittinen, Jouni Tuominen, Katri Seppälä and Eero Hyvönen:
Extending an Ontology by Analyzing Annotation Co-occurrences in a Semantic Cultural Heritage Portal.
Proceedings of the ASWC 2008 Workshop on Collective Intelligence (ASWC-CI 2008) organized as a part of the 3rd Asian Semantic Web Conference (ASWC 2008), Bangkok, Thailand, February 2-5, 2009.
bib pdf Ontologies aim to capture knowledge about things and their relationships. Publishing ontologies on the Semantic Web enables people and organizations to use shared ontologies in annotating e.g. photographs, videos, music, and other types of cultural objects. Search engines also use relationships provided by ontologies in semantic search, e.g. for query expansion or for view-based search. However, building ontologies is a time-consuming process, and it should be helped by automatic finding of interesting, possible relationships. Finding the correct concept for annotation purposes is helped by subsumption and partonomy hierarchies and associative relationships. In this paper we show how an analysis of co-occurrences of concepts in annotations can be used to provide interesting relationships for enriching ontological structures. We use association rule mining techniques and test the idea using a set of annotations of cultural objects in CULTURESAMPO portal and the Finnish General Upper Ontology YSO. The results are visualized in the ONKI SKOS browser to give an additional layer on top of the original relationships of the YSO ontology. An analysis shows that best ranked relationships should also be included in the ontology as subclassof or associative relationships.
2008
Tuukka Ruotsalo, Katri Seppälä, Kim Viljanen, Eetu Mäkelä, Jussi Kurki, Olli Alm, Tomi Kauppinen, Jouni Tuominen, Matias Frosterus, Reetta Sinkkilä and Eero Hyvönen:
Ontology-based Approach for Interoperability of Digital Collections. Signum, no. 5, 2008.
bib pdf This paper presents solutions and lessons learned in FinnONTO project carried out in Finland in 2003–2007. The paper focuses on three aspects of interoperability of digital collections. First, transforming thesauri to ontologies. Second, publishing ontologies for the use of indexers and content providers. Third, ontology based methods for improving end user access to digital collections. The first aspect is analysed through case studies done with Finnish thesauri. The second is discussed by presenting the ONKI ontology server. The last aspect is demonstrated in the scope of the semantic portal CultureSampo for publishing cultural heritrage on the Semantic Web.
Eero Hyvönen, Kim Viljanen, Jouni Tuominen, Katri Seppälä, Tomi Kauppinen, Matias Frosterus, Reetta Sinkkilä, Jussi Kurki, Olli Alm, Eetu Mäkelä and Joonas Laitio:
National Ontology Infrastructure Service ONKI. Oct 1, 2008.
bib pdf This paper presents the national level cross-domain ontology and ontology service infrastructure ONKI used in Finland. The novelty of ONKI is based on two ideas. First, the core ontologies are developed collaboratively by experts transforming thesauri into mutually aligned lightweight ontologies, based on a large top ontology that is extended by various domain specific ontologies. Second, the National Ontology Service ONKI has been implemented for publishing ontologies cost-efficiently as ready to use services. ONKI provides legacy and other applications with ready to use functionalities for using ontologies on the HTML level by Ajax and semantic widgets. ONKI has been used in various applications for creating mash-up applications in a way analogous to using Google Maps, but in our case external applications are mashed-up with ontology support for indexing and information retrieval.
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg:
CultureSampo - A Collective Memory of Finnish Cultural Heritage on the Semantic Web 2.0.
Semantic Computing Research Group, Helsinki University of Technology and University of Helsinki, Sept 29, 2008.
bib pdf This paper presents the Semantic Web 2.0 application CULTURESAMPO, an ambitious system of creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic challenge of aggregating highly heterogeneous, cross-domain cultural heritage into a semantically rich intelligent system for human and machine users. At the same time, CULTURESAMPO is an approach to solve the social and practical Web 2.0 challenge of organizing the underlying collaborative ontology development and content creation work of memory organizations and citizens.
Jouni Tuominen, Matias Frosterus, Kim Viljanen and Eero Hyvönen:
ONKI-SKOS - Publishing and Utilizing Thesauri in the Semantic Web.
AI and Machine Consciousness - Proceedings of the 13th Finnish Artificial Intelligence Conference STeP 2008, Espoo, Finland, August 20-22, 2008.
bib pdf Thesauri and other controlled vocabularies act as building blocks of the Semantic Web by providing shared terminology for facilitating information retrieval, data exchange and integration. Representation and publishing methods are needed for utilizing thesauri efficiently, e.g., in content indexing and searching. W3C has provided the Simple Knowledge Organization System (SKOS) data model for expressing concept schemes, such as thesauri. A standard representation format for thesauri eliminates the need for implementing thesaurus specific rules or applications for processing them. However, there do not exist general tools which provide out of the box support for publishing and utilizing SKOS vocabularies in applications, without needing to implement application specific user interfaces for end users. For solving this problem the ONKI-SKOS server is presented.
Kim Viljanen, Jouni Tuominen and Eero Hyvönen:
Publishing and Using Ontologies as Mash-Up Services.
Proceedings of the 4th Workshop on Scripting for the Semantic Web (SFSW2008), 5th European Semantic Web Conference 2008 (ESWC 2008), CEUR Workshop Proceedings, Vol. 368, Tenerife, Spain, June 1-5, 2008.
bib pdf link The Semantic Web is based on using ontologies for enabling semantically disambiguated data exchange between distributed systems on the web. This requires efficient means for publishing ontologies on the web to ensure the availability, sharing and acceptance of the ontologies. Support services are needed for utilizing ontologies easily and cost-effectively in applications and legacy systems lacking ontology support. To address these vital needs, this paper presents the ONKI ontology service which provides ready-to-use mash-up functionalities, such as semantic disambiguation, concept finding and concept fetching as ready-to-use web widgets for adding ontology support to e.g. HTML forms using JavaScript. Two implementations of the ONKI Server are presented: ONKI-SKOS for ontologies presented in the Simple Knowledge Organization System (SKOS) language and ONKI-Geo for geographical ontologies with a map interface. The presented ONKI systems are operational on the web, used in the National Finnish Ontology Service. They have been successfully used in several pilot applications.
Eero Hyvönen, Kim Viljanen, Jouni Tuominen and Katri Seppälä:
Building a National Semantic Web Ontology and Ontology Service Infrastructure - The FinnONTO Approach.
Proceedings of the European Semantic Web Conference ESWC 2008, pp. 95-109, Springer, Tenerife, Spain, June, 2008.
bib pdf This article presents the vision and results of creating a national level cross-domain ontology service infrastructure in Finland in the FinnONTO project. The novelty of the infrastructure is based on two ideas. First, a system of open source core ontologies is being developed by transforming thesauri into mutually aligned lightweight ontologies, including a top ontology of 20,000 concepts that is extended by various domain specific ontologies. Second, the ONKI Ontology Server framework for publishing ontologies as ready to use services has been designed and implemented. ONKI provides legacy and other applications with ready to use functionalities for using ontologies on the user interface level as semantic widgets. The idea is to use ONKI for creating mash-up applications in a way analogous to using Google or Yahoo Maps, but in our case external applications are mashed-up with ontology support. The ontology framework presented is operational on the web and is being used in creating the application demonstrations.
Kim Viljanen, Jouni Tuominen, Teppo Känsälä and Eero Hyvönen:
Distributed Semantic Content Creation and Publication for Cultural Heritage Legacy Systems.
Proceedings of the 2008 IEEE International Conference on Distributed Human-Machine Systems, IEEE Press, Athens, Greece, March 9-12, 2008.
bib pdf Cultural heritage is by nature strongly interlinked, e.g. thematically and historically, but at the same time distributed in heterogeneous collections of different memory organizations at different locations. In order to provide the end-users with aggregated homogeneous views to distributed heterogeneous contents, semantic portals have been created successfully based on metadata and shared (or aligned) ontologies. This paper discusses two problems encountered in such a distributed semantic content creation environment. First, during the content creation work, how could a publisher start using shared ontologies in legacy cataloguing and annotation systems that do not support ontologies. Second, during content publication, how could a publisher re-use the aggregated content in its own legacy publication system, e.g., on the ordinary web pages of a museum or in a collection browser. As a solution, we present the ONKI Ontology Server for adding shared ontological annotation functionalities to legacy cataloguing systems in a practical, cost-efficient and lightweight way. For distributed publishing of the aggregated semantic portal services, we introduce the lightweight mash-up web widget components called floatlets . A major idea behind both the ONKI functionalities and floatlets is that they can be easily integrated with legacy systems on the user interface level, in the same spirit as e.g. Google Maps.
2007
Eetu Mäkelä, Kim Viljanen, Olli Alm, Jouni Tuominen, Onni Valkeapää, Tomi Kauppinen, Jussi Kurki, Reetta Sinkkilä, Teppo Känsälä, Robin Lindroos, Osma Suominen, Tuukka Ruotsalo and Eero Hyvönen:
Enabling the Semantic Web with Ready-to-Use Web Widgets.
Proceedings of the First Industrial Results of Semantic Technologies Workshop, ISWC2007, pp. 56-69, CEUR Workshop Proceedings, Vol. 293, November 11, 2007.
bib pdf link A lot of functionality is needed when an application, such as a museum cataloguing system, is extended with semantic capabilities, for example ontological indexing functionality or multi-facet search. To avoid duplicate work and to enable easy and cost-efficient integration of information systems with the Semantic Web, we propose a web widget approach. Here, data sources are combined with functionality into readyto-use software components that allow adding semantic functionality to systems with just a few lines of code. As a proof of the concept, we present a collection of general semantic web widgets and case applications that use them, such as the ontology server ONKI, the annotation editor SAHA and the culture portal CultureSampo.
Kim Viljanen, Jouni Tuominen, Eero Hyvönen, Eetu Mäkelä and Osma Suominen:
Extending Content Management Systems with Ontological Annotation Capabilities.
Poster proceedings of the 6th International Semantic Web Conference, Busan, Korea, November 11-15, 2007.
bib pdf Producing semantic metadata requires efficient methods, e.g., concept finding, for accessing and using ontologies. To add such functionalities to metadata applications such as cataloging systems in museums, we propose a \emphmash-up approach where ready-to-use user interface components for using specific ontologies are made available to be integrated into applications. As a proof-of-concept, we present the \emphOntology Service ONKI wich implements semantic autocompletion concept search and concept browsing for ontologies as shared mash-up components.
Kim Viljanen, Jouni Tuominen and Eero Hyvönen:
ONKI Ontology Server--Extending Legacy Systems with Ontology Mash-up Services. November, 2007. Draft paper.
bib pdf The Semantic Web is based on using shared ontologies for enabling semantically disambiguated data exchange between distributed systems on the web. This requires, from the ontology publisher s viewpoint, efficient means for publishing ontologies on the web to ensure the availability and acceptance of the ontologies. From the ontology user s viewpoint, support services are needed for utilizing ontologies easily and cost-effectively in the users own systems that are typically legacy systems without ontology support. This paper presents the ONKI ontology server for addressing these vital needs. For the publisher, ONKI provides a server and a Simple Knowledge Organization (SKOS) compatible light-weight ontology browser with ready-made web interfaces for making ontologies available both for human and machine users. For external legacy and other applications, ONKI provides centralized ontology services for semantic disambiguation, concept finding, and concept fetching. A major contribution of ONKI is to provide these services as ready-to-use functionalities for creating mash-up applications very cost-efficiently. Two prototypes of the system---ONKI-SKOS for all kinds of ontologies and ONKI-Geo for geographical ontologies with a map mash-up interface---are operational on the web and are currently being successfully used in several pilot applications.
Eero Hyvönen, Kim Viljanen, Eetu Mäkelä, Tomi Kauppinen, Tuukka Ruotsalo, Onni Valkeapää, Katri Seppälä, Osma Suominen, Olli Alm, Robin Lindroos, Teppo Känsälä, Riikka Henriksson, Matias Frosterus, Jouni Tuominen, Reetta Sinkkilä and Jussi Kurki:
Elements of a National Semantic Web Infrastructure - Case Study Finland on the Semantic Web (Invited paper).
Proceedings of the First International Semantic Computing Conference (IEEE ICSC 2007), Irvine, California, September, 2007. IEEE Press.
bib pdf This article presents the vision and results of creating the basis for a national semantic web content infrastructure in Finland in 2003-2007. The main elements of the infrastructure are shared and open metadata schemas, core ontologies, and public ontology services. Several practical applications testing and demonstrating the usefulness of the infrastructure are overviewed in the fields of eCulture, eHealth, eGovernment, eLearning, and eCommerce.
Kim Viljanen, Eero Hyvönen, Eetu Mäkelä, Osma Suominen and Jouni Tuominen:
Mash-up Ontology Services for the Semantic Web. Demo track at the European Semantic Web Conference ESWC 2007, Innsbruck, Austria, June 4-5, 2007.
bib pdf We present ONKI ontology server, a mash-up approach for integrating ontology library services with semantic web applications. The idea of ONKI is to provide applications with ready-to-use ontology service functionalities, such as semantic autocompletion, browsing, and annotation support, at the user interface level using AJAX mash-up technologies. The system is being integrated with various semantic web applications.
(total: 149 publications)