- FIN-CLARIAH Research Infrastructure
 A new national research infrastructure initiative FIN-CLARIAH for...
 8.12.2021 8:12 by eahyvone
- WarMemoirSampo published on December 3, 2021
 A new “Sampo” application, “WarMemoirSampo”...
 8.12.2021 8:04 by eahyvone
- Five new SeCo papers accepted for the ISWC 2021
 The 20th International Semantic Web Conference (ISWC 2021), the...
 2.8.2021 6:53 by eahyvone
- Annastiina Ahola, Lilli Peura, Rafael Leal, Heikki Rantala and Eero Hyvönen: Using generative AI and LLMs to enrich art collection metadata for searching, browsing, and studying art history in Digital Humanities
- Eero Hyvönen, Petri Leskinen, Henna Poikkimäki, Heikki Rantala, Jouni Tuominen, Senka Drobac, Ossi Koho, Ilona Pikkanen and Hanna-Leena Paloposki: LetterSampo Finland (1809–1917) Data Service and Portal: Searching, Exploring, and Analyzing Historical Letters and Their Underlying Networks
- Michael Lewis, Eljas Oksanen, Frida Ehrnsten, Heikki Rantala, Jouni Tuominen and Eero Hyvönen: The Impact of Human Decision-making on the Research Value of Archaeological Data
- Tomaž Erjavec, Matyáš Kopp, Nikola Ljubešić, Taja Kuzman, Paul Rayson, Petya Osenova, Maciej Ogrodniczuk, Çağrı Çöltekin, Danijel Koržinek, Katja Meden, Jure Skubic, Peter Rupnik, Tommaso Agnoloni, José Aires, Starkaður Barkarson, Roberto Bartolini, Núria Bel, María Calzada Pérez, Roberts Darģis, Sascha Diwersy, Maria Gavriilidou, Ruben van Heusden, Mikel Iruskieta, Neeme Kahusk, Anna Kryvenko, Noémi Ligeti-Nagy, Carmen Magariños, Martin Mölder, Costanza Navarretta, Kiril Simov, Lars Magne Tungland, Jouni Tuominen, John Vidler, Adina Ioana Vladu, Tanja Wissik, Väinö Yrjänäinen and and Darja Fišer: ParlaMint II: Advancing Comparable Parliamentary Corpora Across Europe
OUTDATED INFORMATION: THIS IS A HISTORICAL PAGE OF A FORMER SECO MEMBER
Minna Tamper 
phone: +358 50 431 6071 (office)
email: firstnames.lastname@aalto.fi
room: 3171 @ Department of Computer Science, Maarintie 8, Espoo 
postal address: Department of Computer Science, P.O. Box 15500, FI-00076 Aalto, Finland
Currently working in the ParliamentSampo and InTaVia projects.
Also worked in the following projects:
- COST Action European network for Web-centred linguistic data science
- Anoppi
- Severi
- Semantic Finlex
- WarSampo
See see citation indices in Google Scholar.
Minna Tamper is a doctoral candidate at Aalto University and University of Helsinki. She received her M.Sc. (Tech.) at Aalto University in 2016. Her research interests include linked data, natural language processing and data analysis. She has published over 20 research articles since 2016, and has received national and international awards. She is currently a substitute member of COST NexusLinguarum management committee, an organiser of Aalto Digi Platform Digital Humanities Pizza Seminar, and as a reviewer on scientific conferences and journals. She has collaborated with libraries, archives, and Finnish ministries on cultural heritage data analysis since 2015.
Publications
2023
Minna Tamper, Laura Sinikallio, Jouni Tuominen and Eero Hyvönen: Transforming Linguistically Annotated Finnish Parliamentary Debates Into the Parla-CLARIN Format.   Digital Humanities in the Nordic and Baltic Countries Seventh Conference (DHNB 2023), Book of Abstracts (Sofie Gilbert and Annika Rockenberger (eds.)),        pp. 118,  University of Oslo Library, Oslo, Norway,    March, 2023. bib link 
Minna Tamper: From Text to Knowledge: Methods, Tools, and Applications for Digital Humanities Based on Linked Data.  Dissertation (in English),          Aalto University, Department of Computer Science,  February, 2023. bib pdf link 
Minna Tamper, Petri Leskinen, Eero Hyvönen, Risto Valjus and Kirsi Keravuori: Analyzing Biography Collection Historiographically as Linked Data: Case National Biography of Finland.        Semantic Web – Interoperability, Usability, Applicability, vol. 14,  no. 2,  pp. 385-419,  IOS Press,     2023. bib pdf link 
2022
Paul Groth, Anisa Rula, Jodi Schneider, Ilaria Tiddi, Elena Simperl, Panos Alexopoulos, Rinke Hoekstra, Mehwish Alam, Anastasia Dimou and Minna Tamper (eds.): The Semantic Web: ESWC 2022 Satellite Events - Hersonissos, Crete, Greece, May 29 - June 2, 2022, Proceedings.       Lecture Notes in Computer Science,  vol. 13384,    Springer,     2022. bib pdf link 
Paul Groth, Maria-Esther Vidal, Fabian M. Suchanek, Pedro A. Szekely, Pavan Kapanipathi, Catia Pesquita, Hala Skaf-Molli and Minna Tamper (eds.): The Semantic Web - 19th International Conference, ESWC 2022, Hersonissos, Crete, Greece, May 29 - June 2, 2022, Proceedings.       Lecture Notes in Computer Science,  vol. 13261,    Springer,     2022. bib pdf link 
Arttu Oksanen, Eero Hyvönen, Minna Tamper, Jouni Tuominen, Henna Ylimaa, Katja Löytynoja, Matti Kokkonen and Aki Hietanen: An Anonymization Tool for Open Data Publication of Legal Documents.   AI4LEGAL-KGSUM 2022: Artificial Intelligence Technologies for Legal Documents and Knowledge Graph Summarization 2022,       vol. 3257,   pp. 12-21,  CEUR Workshop Proceedings,    August, 2022. bib pdf link 
Eero Hyvönen, Minna Tamper, Esko Ikkala, Mikko Koho, Rafael Leal, Joonas Kesäniemi, Arttu Oksanen,  Jouni Tuominen and Aki Hietanen: LawSampo Portal and Data Service for Publishing and Using Legislation and Case Law as Linked Open Data on the Semantic Web.   AI4LEGAL-KGSUM 2022: Artificial Intelligence Technologies for Legal Documents and Knowledge Graph Summarization 2022,       vol. 3257,   pp. 41-50,  CEUR Workshop Proceedings,    August, 2022. bib pdf link 
Henna Poikkimäki, Petri Leskinen, Minna Tamper and Eero Hyvönen: Analyses of Networks of Politicians Based on Linked Data: Case ParliamentSampo - Parliament of Finland on the Semantic Web.   New Trends in Database and Information Systems,         pp. 585-592,  Springer International Publishing,    August, 2022. bib pdf link 
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Matti La Mela, Jouni Tuominen, Kimmo Elo, Senka Drobac, Mikko Koho, Esko Ikkala, Minna Tamper, Rafael Leal and Joonas Kesäniemi: Linked Data Approach for Studying Parliamentary Speeches and Networks of Politicians in Finland 1907-2021 (long paper).   Digital Humanities 2022, Conference Abstracts, July 25-29, 2022 Online, Tokyo. Japan, University of Tokyo,         pp. 254-257,  ADHO,    July, 2022. bib link 
Eero Hyvönen, Esko Ikkala, Mikko Koho, and Rafael Leal, Heikki Rantala and Minna Tamper: How to Search and Contextualize Scenes inside Videos for Enriched Watching Experience: Case Stories of the Second World War Veterans.   The Semantic Web: ESWC 2022 Satellite Events,     Lecture Notes in Computer Science,  vol. 13384,   pp. 163-167,  Springer,    July, 2022. bib pdf link 
Mikko Koho, Rafael Leal, Esko Ikkala, Minna Tamper, Heikki Rantala and Eero Hyvönen: Building Lightweight Ontologies for Faceted Search with Named Entity Recognition: Case WarMemoirSampo.   Proceedings of the 1st International Workshop on Knowledge Graph Generation From Text and the 1st International Workshop on Modular Knowledge co-located with 19th Extended Semantic Conference (ESWC 2022) (Sanju Tiwari, Nandana Mihindukulasooriya, Francesco Osborne, Dimitris Kontokostas, Jennifer D’Souza and Mayank Kejriwal (eds.)),      vol. 3184,   pp. 19-35,  CEUR Workshop Proceedings,    May, 2022. International Knowledge Graph Generation From Text (TEXT2KG).  bib pdf link 
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Matti La Mela, Jouni Tuominen, Kimmo Elo, Senka Drobac, Mikko Koho, Esko Ikkala, Minna Tamper, Rafael Leal and Joonas Kesäniemi: Finnish Parliament on the Semantic Web:  Using ParliamentSampo Data Service and Semantic Portal for Studying Political Culture and Language.   Digital Parliamentary data in Action (DiPaDA 2022), Workshop at the 6th Digital Humanities in Nordic and Baltic Countries Conference, long paper,         pp. 69-85,  CEUR Workshop Proceedings, Vol. 3133,    May, 2022. bib pdf link 
Minna Tamper, Rafael Leal, Laura Sinikallio, Petri Leskinen, Jouni Tuominen and Eero Hyvönen: Extracting Knowledge from Parliamentary Debates for Studying Political Culture and Language.   Proceedings of the 1st International Workshop on Knowledge Graph Generation From Text and the 1st International Workshop on Modular Knowledge co-located with 19th Extended Semantic Conference (ESWC 2022) (Sanju Tiwari, Nandana Mihindukulasooriya, Francesco Osborne, Dimitris Kontokostas, Jennifer D’Souza and Mayank Kejriwal (eds.)),      vol. 3184,   pp. 70-79,  CEUR WS,    May, 2022. International Workshop on Knowledge Graph Generation from Text (TEXT2KG 2022).  bib pdf link 
Arttu Oksanen, Minna Tamper, Jouni Tuominen, Aki Hietanen and Eero Hyvönen: A Tool for Pseudonymization of Textual Documents for Digital Humanities Research and Publication.   6th Digital Humanities in Nordic and Baltic Countries Conference, poster paper, book of abstracts,         pp. 107-108,      March, 2022. bib pdf link 
Minna Tamper, Jouni Tuominen and Eero Hyvönen: Extending the Finnish Linked Data Infrastructure with Natural Language Processing Services in FIN-CLARIAH.   DHNB 2022 The 6th Digital Humanities in Nordic and Baltic Countries Conference,         pp. 443-446,  CEUR Workshop Proceedings, Vol. 3232,     2022. bib pdf link 
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori: Linked Data – A Paradigm Shift for Publishing and Using Biography Collections on the Semantic Web.   Proceedings of the Third Conference on Biographical Data in a Digital World (BD 2019),         pp. 16-23,  CEUR-WS Proceedings, vol. 3152,     2022. bib pdf link 
2021
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and  Kirsi Keravuori: Biografiasampo yhdistää ja rikastaa suomalaiset elämäkerrat linkitettynä datana semanttisessa webissä  (Biographysampo links and enriches Finnish biographies as linked data on the Semantic Web.        Informaatiotutkimus, vol. 40,  no. 3,  pp. 346-368,      November, 2021. bib pdf link 
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Senka Drobac, Jouni Tuominen, Kimmo Elo, Matti La Mela, Mikko Koho, Esko Ikkala, Minna Tamper, Rafael Leal and Joonas Kesäniemi: Parlamenttisampo: eduskunnan aineistojen linkitetyn avoimen datan palvelu ja sen käyttömahdollisuudet.        Informaatiotutkimus, vol. 40,  no. 3,  pp. 216-244,      November, 2021. bib pdf link 
Minna Tamper, Eero Hyvönen and Petri Leskinen: Visualizing and Analyzing Networks of Named Entities in Biographical Dictionaries for Digital Humanities Research.   Proceedings of the 20th International Conference on Computational Linguistics and Intelligent Text Processing (CICling 2019),          Springer-Verlag,    October, 2021. Forth-coming.  bib pdf 
This paper shows how named entity extraction and networkanalysis can be used to examine biographies individually and in groupsto aid historians in biographical and prosopographical research. For this purpose a reference network of 13 100 biographies in the collections ofthe Biographical Centre of the Finnish Literature Society was created, based on links between the biographies as well as automatically extracted named entities found in the texts. The data was published in a SPARQL endpoint as a Linked Data knowledge graph on top of which network analytic tools were created and analysis were done showing the usefulness of the approach in Digital Humanities. The reference graph has been utilized for network analysis to examine egocentric networks of individual persons as well as networks among groups of people in prosopography. The data and tools presented are in use since autumn 2018 in the semantic portal BiographySampo that has had tens of thousands of users.
Laura Sinikallio, Senka Drobac, Minna Tamper, Rafael Leal, Mikko Koho, Jouni Tuominen, Matti La Mela and Eero Hyvönen: Plenary Debates of the Parliament of Finland as Linked Open Data and in Parla-CLARIN Markup.   3rd Conference on Language, Data and Knowledge, LDK 2021,     Open Access Series in Informatics (OASIcs),  vol. 93,   pp. 8:1-8:17,  Schloss Dagstuhl - Leibniz-Zentrum für Informatik GmbH,   Zaragoza, Spain, August, 2021. bib pdf link 
Mikko Koho, Esko Ikkala, Petri Leskinen, Minna Tamper, Jouni Tuominen and Eero Hyvönen: WarSampo Knowledge Graph: Finland in the Second World War as Linked Open Data.        Semantic Web – Interoperability, Usability, Applicability, vol. 12,  no. 2,  pp. 265-278,      January, 2021. bib pdf link 
The Second World War (WW2) is arguably the most devastating catastrophe of human history, a topic of great interest to not only researchers but the general public. However, data about the Second World War is heterogeneous and distributed in various organizations and countries making it hard to utilize. In order to create aggregated global views of the war, a shared ontology and data infrastructure is needed to harmonize information in various data silos. This makes it possible to share data between publishers and application developers, to support data analysis in Digital Humanities research, and to develop data-driven intelligent applications. As a first step towards these goals, this article presents the WarSampo knowledge graph (KG), a shared semantic infrastructure, and a Linked Open Data (LOD) service for publishing data about WW2, with a focus on Finnish military history. The shared semantic infrastructure is based on the idea of representing war as a spatio-temporal sequence of events that soldiers, military units, and other actors participate in. The used metadata schema is an extension of CIDOC CRM, supplemented by various military historical domain ontologies. With an infrastructure containing shared ontologies, maintaining the interlinked data brings upon new challenges, as one change in an ontology can propagate across several datasets that use it. To support sustainability, a repeatable automatic data transformation and linking pipeline has been created for rebuilding the whole WarSampo KG from the individual source datasets. The WarSampo KG is hosted on a data service based on W3C Semantic Web standards and best practices, including content negotiation, SPARQL API, download, automatic documentation, and other services supporting the reuse of the data. The WarSampo KG, a part of the international LOD Cloud and totalling ca. 14 million triples, is in use in nine end-user application views of the WarSampo portal, which has had over 400 000 end users since its opening in 2015.
2020
Minna Tamper, Arttu Oksanen, Jouni Tuominen, Aki Hietanen and Eero Hyvönen: Automatic Annotation Service APPI: Named Entity Linking in Legal Domain.   The Semantic Web: ESWC 2020 Satellite Events (Harth, Andreas, Presutti, Valentina, Troncy, Raphaël, Acosta, Maribel, Polleres, Axel, Fernández, Javier D., Xavier Parreira, Josiane, Hartig, Olaf, Hose, Katja and Cochez, Michael (eds.)),    Lecture Notes in Computer Science,  vol. 12124,   pp. 208-213,  Springer-Verlag,     2020. bib pdf link 
Eero Hyvönen, Minna Tamper, Esko Ikkala, Sami Sarsa, Arttu Oksanen,  Jouni Tuominen and Aki Hietanen: Publishing and Using Legislation and Case Law as Linked Open Data on the Semantic Web.   The Semantic Web: ESWC 2020 Satellite Events (Harth, Andreas, Presutti, Valentina, Troncy, Raphaël, Acosta, Maribel, Polleres, Axel, Fernández, Javier D., Xavier Parreira, Josiane, Hartig, Olaf, Hose, Katja and Cochez, Michael (eds.)),    Lecture Notes in Computer Science,  vol. 12124,   pp. 110-114,  Springer-Verlag,     2020. bib pdf link 
Minna Tamper, Petri Leskinen, Jouni Tuominen and Eero Hyvönen: Modeling and Publishing Finnish Person Names as a Linked Open Data Ontology.   3rd Workshop on Humanities in the Semantic Web (WHiSe 2020),         pp. 3-14,  CEUR Workshop Proceedings, vol. 2695,    June, 2020. bib pdf link 
2019
Arttu Oksanen, Jouni Tuominen, Eetu Mäkelä, Minna Tamper, Aki Hietanen and Eero Hyvönen: Semantic Finlex: Transforming, Publishing, and Using Finnish Legislation and Case Law As Linked Open Data on the Web.   Knowledge of the Law in the Big Data Age (G. Peruginelli and S. Faro (eds.)),    Frontiers in Artificial Intelligence and Applications,  vol. 317,   pp. 212-228,  IOS Press,     2019. ISBN 978-1-61499-984-3 (print); ISBN 978-1-61499-985-0 (online).  bib pdf link 
Governments publish legislation and case law widely in print and on the Web. Such legal information is provided for human consumption, but the information is usually not available as data for algorithmic analysis and applications to use. However, this would be beneficial in many use cases, such as building more intelligent juridical online services and conducting research into legislation and legal practice. To address these needs, this Chapter presents Semantic Finlex, a national in-use data resource and service for publishing Finnish legislation and related case law as Linked Open Data for legal applications to use. The system transforms and interlinks on a regular basis data from the legacy legal database Finlex of the Ministry of Justice into Linked Open Data, based on the European standards ECLI and ELI. The published data is hosted on the  7-star  Linked Data Finland service and SPARQL endpoint with a variety of related services available that ease data re-use. Rich Internet Applications using SPARQL for data access are presented as application demonstrators of the data service. In addition, this Chapter presents methods and tools under development to automatically annotate legal texts and to anonymize case law documents prior to their publication on the Web. Anonymization is necessary due to issues of data protection and privacy, and annotation is needed for semantic search and interlinking the documents. The automated approaches could significantly speed up the process and minimize costs of publishing legal documents as Linked Open Data.
Arttu Oksanen, Minna Tamper, Jouni Tuominen, Aki Hietanen and Eero Hyvönen: Anoppi: A Pseudonymization Service for Finnish Court Documents.   Legal Knowledge and Information Systems. JURIX 2019: The Thirty-second Annual Conference (Araszkiewicz, M. and Rodríguez-Doncel, V. (eds.)),        pp. 251-254,  IOS Press,    December, 2019. bib pdf link 
Mikko Koho, Erkki Heino, Petri Leskinen, Esko Ikkala, Minna Tamper, Kasper Apajalahti, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen: WarSampo Knowledge Graph.            Zenodo,    October, 2019. Dataset.  bib link 
WarSampo Knowledge Graph includes harmonized data of different kinds concerning the Second World War in Finland, separated in different subgraphs representing events, actors, places, photographs, and other aspects and documentation of the war. The data covers the Winter War 1939-1940 against the Soviet attack, the Continuation War 1941-1944 where the occupied areas of the Winter War were temporarily regained, and the Lapland War 1944-1945, where the Finns pushed the German troops away from Lapland.
Minna Tamper, Arttu Oksanen, Jouni Tuominen, Aki Hietanen and Eero Hyvönen: Automatic Annotation Service: Utilizing a Named Entity Linking Tool in Legal Domain.                September, 2019. Submitted.  bib pdf 
Eero Hyvönen, Minna Tamper, Esko Ikkala, Sami Sarsa, Arttu Oksanen, Jouni Tuominen and Aki Hietanen: LawSampo: A Semantic Portal on a Linked Open Data Service for Finnish Legislation and Case Law.                September, 2019. Submitted.  bib pdf 
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori: BiographySampo - Publishing and Enriching Biographies on the Semantic Web for Digital Humanities Research.   The Semantic Web. ESWC 2019 (Pascal Hitzler, Miriam Fernández, Krzysztof Janowicz, Amrapali Zaveri, Alasdair J.G. Gray, Vanessa Lopez, Armin Haller and Karl Hammar (eds.)),        pp. 574-589,  Springer-Verlag,    June, 2019. bib pdf link 
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori: Demonstrating BiographySampo in Solving Digital Humanities Research Problems in Biography and Prosopography.   The Fourth Digital Humanities in the Nordic Countries 2019 (DHN2019), Book of Abstracts,          University of Copenhagen,   Copenhagen, Denmark, March, 2019. bib pdf link 
Matti La Mela, Minna Tamper and Kimmo Kettunen: Finding Nineteenth-century Berry Spots: Recognizing and Linking Place Names in a Historical Newspaper Berry-picking Corpus.   The Fourth Digital Humanities in the Nordic Countries 2019 (DHN2019),          CEUR Workshop Proceedings,   Copenhagen, Denmark, March, 2019. bib pdf link 
2018
Minna Tamper, Petri Leskinen, Kasper Apajalahti and Eero Hyvönen: Using Biographical Texts as Linked Data for Prosopographical Research and Applications.   Digital Heritage. Progress in Cultural Heritage: Documentation, Preservation, and Protection. 7th International Conference, EuroMed 2018, Nicosia, Cyprus (Marinos Ioannides, Eleanor Fink, Raffaella Brumana, Petros Patias, Anastasios Doulamis, João Martins and Manolis Wallace (eds.)),        pp. 125-137,  Springer-Verlag,    November, 2018. bib pdf link 
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori: Biografiasammon tekoäly yhdistää ja rikastaa suomalaiset elämäkerrat semanttisessa webissä.            Aalto-yliopisto, Semanttisen laskennan tutkimusryhmä (SeCo),    Nov, 2018. bib pdf 
Biografiasampo-järjestelmä käynnistää uuden aikakauden elämäkertakokoelmien julkaisemisessa ja käyttämisessä verkossa. Järjestelmän ydinaineistona on Kansallisbiografia ja muut Suomalaisen Kirjallisuuden Seuran (SKS) ja tieteellisten seurojen toimittamat pienoiselämäkerrat, yhteensä 13 100 elämäntarinaa, joita on kirjoittanut 900 suomalaista tutkijaa. Biografiasammon innovaationa on luoda kieliteknologian, tekoälyn ja semanttisen webin teknologioiden avulla elämäkertojen teksteistä ja niihin eri lähteissä liittyvistä tiedoista tietämysverkko (knowledge graph) ja kansallinen tietoinfrastruktuuri, joka koostuu miljoonista tietojen välisistä yhteyksistä. Tietämysverkko on julkaistu linkitetyn datan palvelussa, jonka varaan on toteutettu seitsemästä sovellusnäkymästä koostuva älykäs, kaikille avoin ja maksuton verkkopalvelu biografiasampo.fi kansalaisten ja digitaalisten ihmistieteiden tutkijoiden käytettäväksi.
Arttu Oksanen, Jouni Tuominen, Eetu Mäkelä, Minna Tamper, Aki Hietanen, and Eero Hyvönen: Semantic Finlex: Finnish Legislation and Case Law as a Linked Open Data Service.   Proceedings of Law via the Internet 2018 (LVI 2018), Knowledge of the Law in the Big Data Age, abstracts,             Florence, Italy, October, 2018. bib pdf 
Minna Tamper, Arttu Oksanen, Jouni Tuominen, Eero Hyvönen and Aki Hietanen: Anonymization Service for Finnish Case Law: Opening Data without Sacrificing Data Protection and Privacy of Citizens.   Proceedings of Law via the Internet 2018 (LVI 2018), Knowledge of the Law in the Big Data Age, abstracts,             Florence, Italy, October, 2018. bib pdf 
Minna Tamper, Arttu Oksanen, Eero Hyvönen: Schema.org - hakukonejättien semanttinen web (Schema.org - The Semantic Web of Search Engine Giants).        Tietojohtaminen,        April, 2018. bib pdf 
Eero Hyvönen, Petri Leskinen, Minna Tamper, Jouni Tuominen and Kirsi Keravuori: Semantic National Biography of Finland.   Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018),         pp. 372-385,  CEUR Workshop Proceedings, Vol-2084,   Helsinki, Finland, March, 2018. bib pdf link 
This paper presents the vision of publishing and utilizing textual biographies as Linked (Open) Data on the Semantic Web. As a case study, we publish the live stories of the National Biography of Finland, created by the Finnish Literature Society, as semantic, i.e., machine “understandable” metadata in a SPARQL endpoint using the Linked Data Finland (LDF.fi) service. On top of the data service various Digital Humanities applications are built. The applications include searching and studying individual personal histories as well as historical research of groups of persons using methods of prosopography. The biographical data is enriched by extracting events from unstructured and semi-structured texts, and by linking entities internally and to external data sources. A faceted semantic search engine is provided for filtering groups of people from the data for prosopographical research. An extension of the event-based CIDOC CRM ontology is used as the underlying data model, where lives are seen as chains of interlinked events populated from the data of the biographies and additional data sources, such as museum collections, library databases, and archives.
2017
Petri Leskinen, Mikko Koho, Erkki Heino, Minna Tamper, Esko Ikkala, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen: Modeling and Using an Actor Ontology of Second World War Military Units and Personnel.   Proceedings of the 16th International Semantic Web Conference (ISWC 2017) (Claudia d Amato, Miriam Fernandez, Valentina Tamma, Freddy Lecue, Philippe Cudré-Mauroux, Juan Sequeda, Christoph Lange and Jeff Heflin (eds.)),        pp. 280-296,  Springer-Verlag,   Vienna, Austria, October, 2017. bib pdf link 
This paper presents a model for representing historical military personnel and army units, based on large datasets about World War II in Finland. The model is in use in WarSampo data service and semantic portal, which has had tens of thousands of distinct visitors. A key challenge is how to represent ontological changes, since the ranks and units of military personnel, as well as the names and structures of army units change rapidly in wars. This leads to serious problems in both search as well as data linking due to ambiguity and homonymy of names. In our solution, actors are represented in terms of the events they participated in, which facilitates disambiguation of personnel and units in different spatio-temporal contexts. The linked data in the WarSampo Linked Open Data cloud and service has ca. 9 million triples, including actor datasets of ca. 100 000 soldiers and ca. 16 100 army units. To test the model in practice, an application for semantic search and recommending based on data linking was created, where the spatio-temporal life stories of individual soldiers can be reassembled dynamically by linking data from different datasets. An evaluation is presented showing promising results in terms of linking precision.
Eero Hyvönen, Erkki Heino, Petri Leskinen, Esko Ikkala, Mikko Koho, Minna Tamper, Jouni Tuominen and Eetu Mäkelä: WarSampo: Publishing and Using Linked Open Data about the Second World War.        EuropeanaTech Insight,  no. 7,   Europeana,    September, 2017. bib pdf link 
The article overviews the system WarSampo – Finnish World War 2 on the Semantic Web, the winner of the LODLAM Challenge 2017 Open Data Prize on June 29 in Venice, Italy.
Erkki Heino, Minna Tamper, Eetu Mäkelä, Petri Leskinen, Esko Ikkala, Jouni Tuominen, Mikko Koho and Eero Hyvönen: Named Entity Linking in a Complex Domain: Case Second World War History.   Proceedings, Language, Data and Knowledge (LDK 2017),         pp. 120-133,  Springer-Verlag,   Galway, Ireland, June, 2017. bib pdf link 
This paper discusses the challenges of applying named entity linking in a rich, complex domain – specifically, the linking of 1) military units, 2) places and 3) people in the context of rich Second World War data. Multiple sub-scenarios are discussed in detail through concrete evaluations, analyzing the problems faced, and the solutions developed. A key contribution of this work is to highlight the heterogeneity of problems and approaches needed even inside a single domain, depending on both the source data as well as the target authority.
Minna Tamper, Petri Leskinen, Esko Ikkala, Arttu Oksanen, Eetu Mäkelä, Erkki Heino, Jouni Tuominen, Mikko Koho and Eero Hyvönen: AATOS – a Configurable Tool for Automatic Annotation.   Proceedings, Language, Data and Knowledge (LDK 2017),         pp. 276-289,  Springer-Verlag,   Galway, Ireland, June, 2017. bib pdf link 
This paper presents an automatic annotation tool AATOS for providing documents with semantic annotations. The tool links entities found from the texts to ontologies defined by the user. The application is highly configurable and can be used with different natural language Finnish texts. The application was developed as a part of WarSampo and Semantic Finlex projects and tested using Kansa Taisteli magazine articles and consolidated Finnish legislation of Semantic Finlex. The quality of the automatic annotation was evaluated by measuring precision and recall against existing manual annotations. The results showed that the quality of the input text, as well as the selection and configuration of the ontologies impacted the results.
Eero Hyvönen, Arttu Oksanen, Jouni Tuominen, Eetu Mäkelä and Minna Tamper: Semanttinen Finlex. Laki ja oikeus avoimena linkitettynä datana. (Semantic Finlex. Law and Justice as Linked Open Data.).        Oikeus-lehti, vol. 46,  no. 1,  pp. 107-115,      March, 2017. bib pdf link 
2016
Minna Tamper: Extraction of Entities and Concepts from Finnish Texts.  MSc Thesis (in English),          Aalto University, School of Science, Degree Programme in Computer Science and Engineering,  Dec, 2016. bib pdf 
Keywords are used in many document databases to improve search. The process of assigning keywords from controlled vocabularies to a document is called subject indexing. If the controlled vocabulary used for indexing is an ontology, with semantic relations and descriptions of concepts, the process is also called semantic annotation. In this thesis an automatic annotation tool was created to provide the documents with semantic annotations. The application links entities found from the texts to ontologies defined by the user. The application is highly configurable and can be used with different Finnish texts. The application was developed as a part of WarSampo and Semantic Finlex projects and tested using Kansa Taisteli magazine articles and consolidated legislation of Finnish legislation. The quality of the automatic annotation was evaluated by measuring precision and recall against existing manual annotations. The results showed that the quality of the input text, as well as the selection and configuration of the ontologies impacted the results.
Eero Hyvönen, Erkki Heino, Petri Leskinen, Esko Ikkala, Mikko Koho, Minna Tamper, Jouni Tuominen and Eetu Mäkelä: Publishing Second World War History as Linked Data Events on the Semantic Web.   Proceedings of Digital Humanities 2016, short papers,         pp. 571-573,     Kraków, Poland, July, 2016. bib pdf link 
Data about wars is typically heterogeneous, distributed in the data silos of the fighting parties, multilingual, and often controversial depending on the political point of view. It is therefore hard for the historians to get a global picture of what has actually happened, to whom, where, when, and how. We argue that Semantic Web and Linked Data technologies are a very promising approach for modeling, harmonizing, and aggregating data about war history. Our goal is to make it possible, for both historians and laymen, to study history in a contextualized way where linked datasets enrich each other. The paper presents the in-use WarSampo 1 system, where massive collections of heterogeneous data about the (Finnish) history of the Second World War are harmonized using an event-based approach, and provided as a Linked Open Data service for applications to use. As a use case, a semantic portal WarSampo providing six different perspectives to the war based on events is presented.
Eero Hyvönen, Erkki Heino, Petri Leskinen, Esko Ikkala, Mikko Koho, Minna Tamper, Jouni Tuominen and Eetu Mäkelä: WarSampo Data Service and Semantic Portal for Publishing Linked Open Data about the Second World War History.   The Semantic Web – Latest Advances and New Domains (ESWC 2016) (Harald Sack, Eva Blomqvist, Mathieu d Aquin, Chiara Ghidini, Simone Paolo Ponzetto and Christoph Lange (eds.)),        pp. 758-773,  Springer-Verlag,    May, 2016. bib pdf link 
This paper presents the WarSampo system for publishing collections of heterogeneous, distributed data about the Second World War on the Semantic Web. WarSampo is based on harmonizing massive datasets using event-based modeling, which makes it possible to enrich datasets semantically with each others’ contents. WarSampo has two components: First, a Linked Open Data (LOD) service WarSampo Data for Digital Humanities (DH) research and for creating applications related to war history. Second, a semanticWarSampo Portal has been created to test and demonstrate the usability of the data service. The WarSampo Portal allows both historians and laymen to study war history and destinies of their family members in the war from different interlinked perspectives. Published in November 2015, theWarSampo Portal had some 20,000 distinct visitors during the first three days, showing that the public has a great interest in this kind of applications.
(total: 46 publications)


