- FIN-CLARIAH Research Infrastructure
A new national research infrastructure initiative FIN-CLARIAH for...
8.12.2021 8:12 by eahyvone - WarMemoirSampo published on December 3, 2021
A new “Sampo” application, “WarMemoirSampo”...
8.12.2021 8:04 by eahyvone - Five new SeCo papers accepted for the ISWC 2021
The 20th International Semantic Web Conference (ISWC 2021), the...
2.8.2021 6:53 by eahyvone
- Eero Hyvönen, Annastiina Ahola, Petri Leskinen and Jouni Tuominen: Aggregating and Aligning Knowledge Graphs into a Global Service: SampoSampo System for Cross-cultural Data Search, Exploration, and Analysis
- Eero Hyvönen, Petri Leskinen, Henna Poikkimäki, Heikki Rantala, Jouni Tuominen, Senka Drobac, Ossi Koho, Ilona Pikkanen and Hanna-Leena Paloposki: Searching, exploring, and analyzing historical letters and the underlying networks: LetterSampo Finland (1809–1917) data service and semantic portal
- Eljas Oksanen, Frida Ehrnsten, Heikki Rantala and Eero Hyvönen: Semantic Solutions for Democratising Archaeological and Numismatic Data Analysis
- Annastiina Ahola, Lilli Peura, Rafael Leal, Heikki Rantala and Eero Hyvönen: Using generative AI and LLMs to enrich art collection metadata for searching, browsing, and studying art history in Digital Humanities
Publications
2025
Eero Hyvönen, Annastiina Ahola, Petri Leskinen and Jouni Tuominen: Aggregating and Aligning Knowledge Graphs into a Global Service: SampoSampo System for Cross-cultural Data Search, Exploration, and Analysis. 2025. Abstract, submitted for peer review. bib pdf
Eero Hyvönen, Petri Leskinen, Henna Poikkimäki, Heikki Rantala, Jouni Tuominen, Senka Drobac, Ossi Koho, Ilona Pikkanen and Hanna-Leena Paloposki: Searching, exploring, and analyzing historical letters and the underlying networks: LetterSampo Finland (1809–1917) data service and semantic portal. 2025. Abstract, submitted for peer review. bib pdf
2024
Eljas Oksanen, Frida Ehrnsten, Heikki Rantala and Eero Hyvönen: Semantic Solutions for Democratising Archaeological and Numismatic Data Analysis. ACM Journal of Computing and Cultural Heritage, vol. 16, no. 4, Association for Computing Machinery, 2024. bib pdf link
Annastiina Ahola, Lilli Peura, Rafael Leal, Heikki Rantala and Eero Hyvönen: Using generative AI and LLMs to enrich art collection metadata for searching, browsing, and studying art history in Digital Humanities. Proceedings, 2nd International Conference on Data & Digital Humanities Generative Artificial Intelligence for Text and Multimodal Data 12th - 13th December 2024, University of Minho, Braga, Portugal, November, 2024. Accepted, forth-coming. bib pdf
Eero Hyvönen, Patrik Boman, Heikki Rantala, Annastiiina Ahola and Petri Leskinen: ConfermentSampo - A Knowledge Graph, Data Service, and Semantic Portal for Intangible Academic Cultural Heritage 1643-2023 in Finland. Proceedings of the 6th International Knowledge Graph and Semantic Web Conference, Dec 11-13. 2024, Paris, France, Springer-Verlag, October, 2024. Accepted. bib pdf
Petri Leskinen: Modeling and Using Biographical Linked Data for Prosopographical Data Analysis. Dissertation, Aalto University, School of Science, Department of Computer Science, October, 2024. bib pdf
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Senka Drobac, Rafael Leal, Matti La Mela, Jouni Tuominen, Henna Poikkimäki and Heikki Rantala: Publishing and Using Parliamentary Linked Data on the Semantic Web: ParliamentSampo System for Parliament of Finland. Semantic Web, October, 2024. In print. bib pdf
Aija Valleala: Suomalaisten säädösten muutoshistorian kuvaaminen ja käyttö Lakisampo-järjestelmässä. MSc Thesis (in Finnish), Aalto University, Department of Computer Science, October, 2024. bib pdf
Eero Hyvönen and Jouni Tuominen: 8-star Linked Open Data Model: Extending the 5-star Model for Better Reuse, Quality, and Trust of Data. Posters, Demos, Workshops, and Tutorials of the 20th International Conference on Semantic Systems (SEMANTiCS 2024), vol. 3759, CEUR Workshop Proceedings, September, 2024. bib pdf link
Annastiina Ahola, Eero Hyvönen, Heikki Rantala and Anne Kauppala: Historical Opera and Music Theatre Performances on the Semantic Web: OperaSampo 1830-1960. SEMANTiCS 2024, 20th International Conference on Semantic Systems, proceedings, IOS Press, September, 2024. Accepted. bib pdf
Eero Hyvönen, Hien Cao, Rafael Leal, Heikki Rantala and Aki Hietanen: A Model and Case Study for Searching and Reading Cross-border Multilingual Legislation on the Semantic Web. SEMANTiCS 2024, 20th International Conference on Semantic Systems, proceedings, IOS Press, September, 2024. Accepted. bib pdf
Heikki Rantala, Petri Leskinen, Lilli Peura and Eero Hyvönen: Representing and searching associations in cultural heritage knowledge graphs using faceted search. Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI. Proceedings of the 20th International Conference on Semantic Systems, 17–19 September 2024, Amsterdam, The Netherlands, pp. 420-435, IOS Press, September, 2024. bib link
Heikki Rantala, Petri Leskinen, Lilli Peura and Eero Hyvönen: Representing and searching associations in cultural heritage knowledge graphs using faceted search. Knowledge Graphs in the Age of Language Models and Neuro-Symbolic AI. Proceedings of the 20th International Conference on Semantic Systems, 17–19 September 2024, Amsterdam, The Netherlands, pp. 420-435, IOS Press, September, 2024. bib pdf link
Henna Poikkimäki, Petri Leskinen, Eero Hyvönen: Using Network Analysis for Studying Cultural Heritage Knowledge Graphs – Case Correspondence Networks in Grand Duchy of Finland 1809–1917. August, 2024. Under review. bib pdf
Henna Poikkimäki, Kati Katajisto, Petri Leskinen and Eero Hyvönen: Applying Network and Bibliometric Analyses to Mentions of Politicians in Plenary Speeches: Case ParliamentSampo - Parliament of Finland on the Semantic Web. July, 2024. Submitted for evaluation. bib pdf
Eero Hyvönen: Military History on the Semantic Web: Lessons Learned from Developing Three In-use Linked Open Data Services and Semantic Portals for Digital Humanities. Intelligent Computing for Cultural Heritage: Global Developments and China´s Innovations, Francis & Taylor, Routledge, July, 2024. Book chapter. bib pdf link
Michael Lewis, Eljas Oksanen, Frida Ehrnsten, Heikki Rantala, Jouni Tuominen and Eero Hyvönen: The Impact of Human Decision-making on the Research Value of Archaeological Data. June, 2024. Submitted for evaluation. bib pdf
Rafael Leal, Annastiina Ahola and Eero Hyvönen: Using LLMs for Enriching Metadata with Links to KOS and Knowledge Graphs: Case Finnish Named Entity Linking. NKOS Workshop on AI and KOS at DCMI-2024 in Toronto, June, 2024. Accepted, https://www.dublincore.org/news/2024/06-29_nkos-workshop/. bib pdf
Annastiina Ahola, Heikki Rantala and Eero Hyvönen: ArtSampo - Finnish art on the Semantic Web. The Semantic Web: ESWC 2024 Satellite Events, Hersonissos, Crete, Greece, May 26 - 30, 2024, Proceedings, Springer, May, 2024. bib pdf
Patrik Boman: Promootiosampo: Järjestelmä suomalaisen promootioperinteen kuvaamiseen, julkaisemiseen ja tutkimiseen semanttisessa webissä (ConfermentSampo: A System for modeling, publishing, and studying conferment traditions on the Semantic Web. MSc Thesis (in Finnish), Aalto University, Department of Computer Science, May, 2024. bib pdf
Heikki Rantala, Eljas Oksanen, Frida Ehrnsten and Eero Hyvönen: Publishing Numismatic Public Finds on the Semantic Web for Digital Humanities Research – CoinSampo Linked Open Data Service and Semantic Portal. SemDH 2024, First International Workshop of Semantic Digital Humanities, co-located with ESWC 2024, Hersonissos, Greece, Proceedings, CEUR Workshop Proceedings, Vol. 3724, May, 2024. bib pdf link
Heikki Rantala, Eljas Oksanen, Frida Ehrnsten and Eero Hyvönen: Searching and Analyzing Coin Finds with a Linked Data Based Web Application. The Semantic Web: ESWC 2024 Satellite Events, Hersonissos, Crete, Greece, May 26 - 30, 2024, Proceedings, Springer, May, 2024. bib pdf
Eero Hyvönen, Hien Cao, Rafael Leal, Heikki Rantala and Aki Hietanen: Searching and analyzing cross-border multilingual legislation on the Semantic Web. The Semantic Web: ESWC 2024 Satellite Events, Hersonissos, Crete, Greece, May 26 - 30, 2024, Proceedings, Springer, May, 2024. bib pdf
Annastiina Ahola, Telma Peura, Eero Hyvönen: Using Linked Data for Data Analytic Literary Research: Case BookSampo - Finnish Fiction Literature on the Semantic Web. May, 2024. Submitted for review. bib pdf link
Tomaž Erjavec, Matyáš Kopp, Nikola Ljubešić, Taja Kuzman, Paul Rayson, Petya Osenova, Maciej Ogrodniczuk, Çağrı Çöltekin, Danijel Koržinek, Katja Meden, Jure Skubic, Peter Rupnik, Tommaso Agnoloni, José Aires, Starkaður Barkarson, Roberto Bartolini, Núria Bel, María Calzada Pérez, Roberts Darģis, Sascha Diwersy, Maria Gavriilidou, Ruben van Heusden, Mikel Iruskieta, Neeme Kahusk, Anna Kryvenko, Noémi Ligeti-Nagy, Carmen Magariños, Martin Mölder, Costanza Navarretta, Kiril Simov, Lars Magne Tungland, Jouni Tuominen, John Vidler, Adina Ioana Vladu, Tanja Wissik, Väinö Yrjänäinen and and Darja Fišer: ParlaMint II: Advancing Comparable Parliamentary Corpora Across Europe. March, 2024. Submitted. bib link
Eero Hyvönen, Mikko Koho, Angel Daza and Gregor Pobežin (eds.): BD2022 Proceedings of the BD2022 Biographical Data in a Digital World 2022 Conference. Institute of Cultural History, ZRC SAZUs, Ljubljana, January, 2024. bib link
Petri Leskinen and Eero Hyvönen: Biographical and Prosopographical Analyses of Finnish Academic People 1640–1899 Based on Linked Open Data. Proceedings of the Biographical Data in a Digital World 2022 (BD 2022), Tokyo, Institute of Cultural History, ZRC SAZU, Ljubljana, Slovenia, January, 2024. bib pdf link
Eero Hyvönen: Creating and Using Biographical Dictionaries for Digital Humanities Based on Linked Data: A Survey of Web Services in Use in Finland. Proceedings of the Biographical Data in a Digital World 2022 (BD 2022), Tokyo, Institute of Cultural History, ZRC SAZU, Ljubljana, Slovenia, January, 2024. bib pdf link
Eero Hyvönen, Mikko Koho, Angel Daza and Gregor Pobežin: Editorial of the Proceedings of the BD2022 Biographical Data in a Digital World 2022 Conference. Proceedings of the Biographical Data in a Digital World 2022 (BD 2022), Tokyo, Institute of Cultural History, ZRC SAZU, Ljubljana, Slovenia, January, 2024. bib pdf link
Mikko Koho and Eero Hyvönen: Studying Occupations and Social Measures of Perished Soldiers in WarSampo Linked Open Data. Proceedings of the Biographical Data in a Digital World 2022 (BD 2022), Tokyo, Institute of Cultural History, ZRC SAZU, Ljubljana, Slovenia, January, 2024. bib pdf link
Eero Hyvönen: How to Create a National Cross-domain Ontology and Linked Data Infrastructure and Use It on the Semantic Web. Semantic Web - Interoperability, Usability, Applicability, IOS Press, 2024. DOI: 10.3233/SW-243468. bib pdf link
Eero Hyvönen, Eljas Oksanen, Heikki Rantala and Jouni Tuominen: Kansalaistiedettä Helsingin yliopistossa - Löytösampo kokoaa kansalaisten arkeologiset löydöt semanttisessa webissä. 2024. Blogi-kirjoitus. bib pdf link
Petri Leskinen, Javier Ureña-Carrion, Jouni Tuominen, Mikko Kivelä, Eero Hyvönen: Knowledge Graphs and Data Services for Studying Historical Epistolary Data in Network Science on the Semantic Web. Semantic Web, IOS Press, 2024. Under open review. bib pdf link
Eero Hyvönen: Sampo-järjestelmien verkosto avaa linkitettyä kulttuuridataa tutkijoille ja kansalaisille semanttisessa webissä. Tieteessä tapahtuu, no. 2, 2024. bib pdf
2023
Senka Drobac, Johanna Enqvist, Petri Leskinen, Muhammad Faiz Wahjoe, Heikki Rantala, Mikko Koho, Ilona Pikkanen, Iida Jauhiainen, Jouni Tuominen, Hanna-Leena Paloposki, Matti La Mela and Eero Hyvönen: The Laborious Cleaning: Acquiring and Transforming 19th-Century Epistolary Metadata. Digital Humanities in the Nordic and Baltic Countries Publication, DHNB2023 Conference Proceeding, vol. 5, no. 1, pp. 248-262, University of Oslo Library, Norway, 2023. bib pdf link
Senka Drobac, Laura Sinikallio and Eero Hyvönen: An OCR Pipeline for Transforming Parliamentary Debates into Linked Data: Case ParliamentSampo – Parliament of Finland on the Semantic Web. Digital Humanities in the Nordic and Baltic Countries Publication, DHNB2023 Conference Proceedings, vol. 5, no. 1, pp. 287-296, University of Oslo Library, Norway, 2023. bib pdf link
Mehwish Alam, Victor de Boer, Enrico Daga, Marieke van Erp, Eero Hyvönen and Albert Meroño-Peñuela: Editorial of Special Issue on Cultural Heritage and Semantic Web. Semantic Web - Interoperability, Usability, Applicability, vol. 14, no. 2, pp. 155-158, IOS Press, 2023. bib link
Eljas Oksanen and Michael Lewis: Evaluating Transformations in Small Metal Finds Following the Black Death. Medieval Archaeology, vol. 67, no. 1, pp. 159-186, 2023. bib link
Heikki Rantala, Eero Hyvönen and Petri Leskinen: Finding and explaining relations in a biographical knowledge graph based on life events: Case BiographySampo. ESWC 2023 Workshops and tutorials joint proceedings, CEUR Workshop Proceedings, 2023. Forth-coming. bib pdf
Heikki Rantala, Eero Hyvönen and Petri Leskinen: Finding relations between entities in a knowledge graph: Case artists of the Getty Union List of Artist Names (ULAN). 2023. Submitted for review. bib pdf
Eero Hyvönen, Petri Leskinen and Jouni Tuominen: LetterSampo – Historical Letters on the Semantic Web: A Framework and Its Application to Publishing and Using Epistolary Data of the Republic of Letters. Journal on Computing and Cultural Heritage, vol. 16, no. 1, 2023. bib pdf link
Eljas Oksanen and Johanna Roiha: Methodological Perspectives for Applying Spatial Point Pattern Analyses to Finnish Iron Age Remote Sending Data. Moving Northward. Professor olker Heyd’s Festschrift as he turn 60, pp. 426-444, The Archaeological Society of Finland, 2023. bib pdf
Senka Drobac, Petri Leskinen and Muhammad Faiz Wahjoe: Navigating the Challenges of Deduplicating Actors in Historical Letter Exchanges. Proceedings of the 24th European Conference on Knowledge Management, vol. 24, no. 2, pp. 1694-1697, Academic Conferences International Limited, 2023. bib link
Eero Hyvönen: Parlamenttisampo avaa eduskunnan miljoona puhetta ja kansanedustajien verkostot kaikkien tutkittaviksi. Tieteessä tapahtuu, vol. 41, no. 1, Tieteellisten seurain valtuuskunta (TSV), 2023. bib pdf link
Mikko Koho, L. P. Coladangelo, Lynn Ramson and Doug Emery: Wikibase Model for Premodern Manuscript Metadata Harmonization, Linked Data Integration, and Discovery. ACM Journal of Computing and Cultural Heritage, vol. 16, no. 3, pp. 1-25, 2023. bib pdf link
Matthias Schlögl, Jouni Tuominen, Joonas Kesäniemi, Petri Leskinen, Victor de Boer, Go Sugimoto and Joh Dokler: The InTaVia Knowledge Graph - Publishing European National Biographical and Cultural Heritage Object Data. December, 2023. Submitted for review. bib pdf
Eero Hyvönen, Petri Leskinen and Jouni Tuominen: A Data-driven Approach to Create an Ontology of Parliamentary Work: Case Parliament of Finland on the Semantic Web. Proceedings of SWODCH 2023. Semantic Web and Ontology Design for Cultural Heritage. Co-located with the 22nd International Semantic Web Conference (ISWC 2023) in Athens, Greece, CEUR Workshop Proceedings, Vol-3540, November, 2023. bib pdf link
Eero Hyvönen, Annastiina Ahola, Heikki Rantala and Anne Kauppala: OperaSampo – Opera and Music Theatre Performances in Finland 1830–1960 on the Semantic Web. Proceedings of the ISWC 2023 Posters, Demos and Industry Tracks: From Novel Ideas to Industrial Practice co-located with 21st International Semantic Web Conference (ISWC 2023) (Irini Fundulaki, Kouji Kozaki, Jose Manuel Gomez-Perez and Daniel Garijo (eds.)), CEUR Workshop Proceedings, November, 2023. bib pdf
Annastiina Ahola, Eero Hyvönen, Heikki Rantala and Anne Kauppala: Publishing and studying historical opera and music theatre performances on the Semantic Web: case OperaSampo 1830–1960. Proceedings of SWODCH 2023. Semantic Web and Ontology Design for Cultural Heritage. Co-located with the 22nd International Semantic Web Conference (ISWC 2023) in Athens, Greece, CEUR Workshop Proceedings, Vol-3540, November, 2023. bib pdf link
Heikki Rantala, Annastiina Ahola, Esko Ikkala and Eero Hyvönen: How to create easily a data analytic semantic portal on top of a SPARQL endpoint: introducing the configurable Sampo-UI framework. VOILA! 2023 Visualization and Interaction for Ontologies, Linked Data and Knowledge Graphs 2023, CEUR Workshop Proceedings, Vol. 3508, October, 2023. bib pdf link
Eero Hyvönen, Patrik Boman, Heikki Rantala, Annastiina Ahola and Petri Leskinen: Promootiosampo - Helsingin yliopiston filosofisen tiedekunnan 100 promootiota 1643-2023 semanttisessa webissä. Aalto-yliopisto ja Helsingin yliopisto, Semanttisen laskennan tutkimusryhmä (SeCo), October, 2023. Artikkelin käsikirjoitus, arvioitavana. bib pdf
Muhammad Faiz Wahjoe: Advancing Disambiguation of Actors Against Multiple Linked Open Data Sources. MSc Thesis (in English), Aalto University, Department of Computer Science, September, 2023. bib link
Petri Leskinen and Eero Hyvönen: Biographical and Prosopographical Analyses of Finnish Academic People 1640–1899 Based on Linked Open Data. Biographical Data in a Digital World 2022 (BD 2022), Tokyo, Proceedings, accepted, August, 2023. Forth-coming. bib pdf
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Senka Drobac, Rafael Leal, Matti La Mela, Jouni Tuominen, Henna Poikkimäki and Heikki Rantala: Plenary Speeches of the Parliament of Finland as Linked Open Data and Data Services. Joint Proceedings of the Second International Workshop on Knowledge Graph Generation From Text and the First International BiKE Challenge co-located with 20th Extended Semantic Conference (ESWC 2023), pp. 1-20, CEUR Workshop Proceedings, Vol. 3447, August, 2023. bib pdf link
Mikko Koho and Eero Hyvönen: Studying Occupations and Social Measures of Perished Soldiers in WarSampo Linked Open Data. Biographical Data in a Digital World 2022 (BD 2022), Tokyo, CEUR Workshop Sproceedings, August, 2023. bib pdf
Annastiina Ahola and Eero Hyvönen: Visualizing Literary Linked Data for Public Library Users in the New User Interface for BookSampo – Finnish Fiction Literature on the Semantic Web. VOILA! 2023 Visualization and Interaction for Ontologies, Linked Data and Knowledge Graphs 2023, CEUR Workshop Proceedings, Vol. 3508, July, 2023. bib pdf link
Eero Hyvönen: Creating and Using a Linked Open Ontology and Data Infrastructure for Digital Humanities in Finland: Lessons Learned 2003-2023. June, 2023. Under review. bib pdf
Eero Hyvönen: Creating and Using a National Linked Open Data Infrastructure for Cultural Heritage Applications and Digital Humanities Research: Lessons Learned. DARIAH Annual Event 2023, Budapest, Hungary, abstracts of papers, DARIAH-EU, June, 2023. bib link
Eero Hyvönen, Petri Leskinen and Heikki Rantala: Integrating Faceted Search with Data Analytic Tools in the User Interface of ParliamentSampo - Parliament of Finland on the Semantic Web. The Semantic Web: ESWC 2023 Satellite Events, pp. 16-21, Sringer-Verlag, June, 2023. bib pdf
Annastiina Ahola, Telma Peura and Eero Hyvönen: Interfacing the BookSampo Knowledge Graph of Finnish Literature for Data Analyses in Digital Humanities. DARIAH Annual Event 2023, poster paper, DARIAH-EU, June, 2023. bib link
Annastiina Ahola, Eero Hyvönen and Heikki Rantala: A User Interface Model for Digital Humanities Research: Case BookSampo – Finnish Fiction Literature on the Semantic Web. Proceedings of ESWC 2023, poster and demo papers, Springer-Verlag, June, 2023. bib pdf
Annastiina Ahola, Eero Hyvönen and Heikki Rantala: A User Interface Model for Digital Humanities Research: Case BookSampo – Finnish Fiction Literature on the Semantic Web. Proceedings of ESWC 2023, poster and demo papers, Springer-Verlag, June, 2023. bib
Matias Frosterus: Building Ontology and Data Infrastructure for Semantic Web Applications. Dissertation, Aalto University, School of Science, Department of Computer Science, April, 2023. bib link
Frida Erhnsten, Eljas Oksanen, Heikki Rantala and Eero Hyvönen: DigiNUMA ja Rahasampo – uusi digitaalinen palvelu rahalöydöistä kiinnostuneille. Numismaattinen aikakausilehti, April, 2023. bib pdf link
Matthias Schlögl, Joonas Kesäniemi, Jouni Tuominen, Victor de Boer, Go Sugimoto and Carla Ebel: Dos and Don’ts of Building a Pan-European Biographical Knowledge Graph: Statistical Analysis of the InTaVia-Platform. Digital Humanities in the Nordic and Baltic Countries Seventh Conference (DHNB 2023), Book of Abstracts (Sofie Gilbert and Annika Rockenberger (eds.)), pp. 106, University of Oslo Library, Oslo, Norway, March, 2023. bib link
Eero Hyvönen: How to Create a National Cross-domain Ontology and Linked Data Infrastructure and Use It on the Semantic Web. Programming and Data Infrastructure in Digital Humanities, Book of Abstracts, pp. 7, High Performance Computing Centre, University of Évora, Portugal, March, 2023. bib link
Joonas Kesäniemi, Matthias Schlögl, Jouni Tuominen, Victor de Boer and Go Sugimoto: Towards Reusable Aggregated Biographical Research Data: Provenance and Versioning in the InTaVia Knowledge Graph. Digital Humanities in the Nordic and Baltic Countries Seventh Conference (DHNB 2023), Book of Abstracts (Sofie Gilbert and Annika Rockenberger (eds.)), pp. 117, University of Oslo Library, Oslo, Norway, March, 2023. bib link
Minna Tamper, Laura Sinikallio, Jouni Tuominen and Eero Hyvönen: Transforming Linguistically Annotated Finnish Parliamentary Debates Into the Parla-CLARIN Format. Digital Humanities in the Nordic and Baltic Countries Seventh Conference (DHNB 2023), Book of Abstracts (Sofie Gilbert and Annika Rockenberger (eds.)), pp. 118, University of Oslo Library, Oslo, Norway, March, 2023. bib link
Annastiina Ahola: Developing a tool for information retrieval and research purposes utilizing BookSampo data. MSc Thesis (in English), Aalto University, Department of Computer Science, February, 2023. bib pdf link
Henna Poikkimäki: Eduskunnan täysistuntojen puheenvuorojen henkilömainintoihin perustuvien verkostoiden analyysi. MSc Thesis (in Finnish), Aalto University, Department of Computer Science, February, 2023. bib pdf
Minna Tamper: From Text to Knowledge: Methods, Tools, and Applications for Digital Humanities Based on Linked Data. Dissertation (in English), Aalto University, Department of Computer Science, February, 2023. bib pdf link
Eero Hyvönen, Petri Leskinen, Laura Sinikallio, Senka Drobac, Rafael Leal, Matti La Mela, Jouni Tuominen, Henna Poikkimäki and Heikki Rantala: ParliamentSampo Infrastructure for Publishing the Plenary Speeches and Networks of Politicians of the Parliament of Finland as Open Data Services. Aalto University, Dept. of Computer Science, February, 2023. Paper published at the publication event of the ParliamentSampo data service and portal. bib pdf
Telma Peura: Suomeksi yli rajojen. Kvantitatiivinen tutkimus suomenkielisten romaanien monimuotoisuudesta 1970-2020. MSc Thesis (in Finnish), University of Helsinki, Department of Digital Humanities, Helsinki Centre for Digital Humanities (HELDIG), January, 2023. bib pdf link
Minna Tamper, Petri Leskinen, Eero Hyvönen, Risto Valjus and Kirsi Keravuori: Analyzing Biography Collection Historiographically as Linked Data: Case National Biography of Finland. Semantic Web – Interoperability, Usability, Applicability, vol. 14, no. 2, pp. 385-419, IOS Press, 2023. bib pdf link
Eero Hyvönen: Digital Humanities on the Semantic Web: Sampo Model and Portal Series. Semantic Web – Interoperability, Usability, Applicability, vol. 14, no. 4, pp. 729-744, IOS Press, 2023. bib pdf link
2022
Eljas Oksanen, Heikki Rantala, Jouni Tuominen, Michael Lewis, David Wigg-Wolf, Frida Ehrnsten and Eero Hyvönen: Digital Humanities Solutions for Pan-European Numismatic and Archaeological Heritage Based on Linked Open Data. DHNB 2022 The 6th Digital Humanities in Nordic and Baltic Countries Conference, pp. 352-360, CEUR Workshop Proceedings, Vol. 3232, 2022. bib pdf link
Paul Groth, Maria-Esther Vidal, Fabian M. Suchanek, Pedro A. Szekely, Pavan Kapanipathi, Catia Pesquita, Hala Skaf-Molli and Minna Tamper (eds.): The Semantic Web - 19th International Conference, ESWC 2022, Hersonissos, Crete, Greece, May 29 - June 2, 2022, Proceedings. Lecture Notes in Computer Science, vol. 13261, Springer, 2022. bib pdf link
Paul Groth, Anisa Rula, Jodi Schneider, Ilaria Tiddi, Elena Simperl, Panos Alexopoulos, Rinke Hoekstra, Mehwish Alam, Anastasia Dimou and Minna Tamper (eds.): The Semantic Web: ESWC 2022 Satellite Events - Hersonissos, Crete, Greece, May 29 - June 2, 2022, Proceedings. Lecture Notes in Computer Science, vol. 13384, Springer, 2022. bib pdf link
Toby Burrows, Laura Cleaver, Doug Emery, Eero Hyvönen, Mikko Koho, Lynn Ransom, Emma Thomson and Hanno Wijsman: Medieval manuscripts and their migrations: Using SPARQL to investigate the research potential of an aggregated Knowledge Graph. Digital Medievalist, vol. 15, 2022. bib pdf link
Anna Wessman and Eljas Oksanen: Metal-detecting data as citizen science archaeology. Odes to Mika. Professor Mika Lavento s Festschrift as he turns 60 years old (Petri Halinen, Volker Heyd and Kristiina Mannermaa (eds.)), pp. 293-302, The Archaeological Society of Finland, 2022. bib
Telma Peura, Petri Leskinen and Eero Hyvönen: What Linked Data Can Tell about Geographical Trends in Finnish Fiction Literature - Using the BookSampo Knowledge Graph in Digital Humanities. 2022. Abstract under peer review. bib
Bernardo S. Buarque, Aline Deicke, Malte Doehne, Martin Düring, Heiner Fangerau, Catherine Herfeld, Charles van den Heuvel, Eero Hyvönen, Roberto Lalli, Malte Vogl, Lea Weiß, Dirk Wintergrün: White Paper of the ModelSEN Workshop (April 2022). October, 2022. bib link
Arttu Oksanen, Eero Hyvönen, Minna Tamper, Jouni Tuominen, Henna Ylimaa, Katja Löytynoja, Matti Kokkonen and Aki Hietanen: An Anonymization Tool for Open Data Publication of Legal Documents. AI4LEGAL-KGSUM 2022: Artificial Intelligence Technologies for Legal Documents and Knowledge Graph Summarization 2022, vol. 3257, pp. 12-21, CEUR Workshop Proceedings, August, 2022. bib pdf link
Henna Poikkimäki, Petri Leskinen, Minna Tamper and Eero Hyvönen: Analyses of Networks of Politicians Based on Linked Data: Case ParliamentSampo - Parliament of Finland on the Semantic Web. New Trends in Database and Information Systems, pp. 585-592, Springer International Publishing, August, 2022. bib pdf link
Eero Hyvönen, Minna Tamper, Esko Ikkala, Mikko Koho, Rafael Leal, Joonas Kesäniemi, Arttu Oksanen, Jouni Tuominen and Aki Hietanen: LawSampo Portal and Data Service for Publishing and Using Legislation and Case Law as Linked Open Data on the Semantic Web. AI4LEGAL-KGSUM 2022: Artificial Intelligence Technologies for Legal Documents and Knowledge Graph Summarization 2022, vol. 3257, pp. 41-50, CEUR Workshop Proceedings, August, 2022. bib pdf link
Joonas Kesäniemi, Mikko Koho, Esko Ikkala and Eero Hyvönen: Using Wikibase for Managing Cultural Heritage Linked Open Data Based on CIDOC CRM. New Trends in Database and Information Systems, pp. 542-549, Springer International Publishing, August, 2022. bib pdf link
Mikko Koho, L. P. Coladangelo, Lynn Ransom and Doug Emery: A Wikibase Model for Premodern Manuscript Metadata Harmonization, Linked Data Integration, and Discovery. August, 2022. Submitted. bib
Angel Daza, Antske Fokkens, Richard Hadden, Eero Hyvönen, Mikko Koho and Eveline Wandl-Vogt: Biographical Data in a Digital World 2022 (BD 2022) Workshop. Digital Humanities 2022, Conference Abstracts, July 25-29, 2022 Online, Tokyo. Japan, University of Tokyo, pp. 39-42, ADHO, July, 2022. bib link
Heikki Rantala, Eljas Oksanen and Eero Hyvönen: Harmonizing and Using Numismatic Linked Data in Digital Humanities Research and Application Development: Case DigiNUMA. The Semantic Web: ESWC 2022 Satellite Events, Lecture Notes in Computer Science, vol. 13384, pp. 26-30, Springer, July, 2022. bib pdf link
Eero Hyvönen, Esko Ikkala, Mikko Koho, and Rafael Leal, Heikki Rantala and Minna Tamper: How to Search and Contextualize Scenes inside Videos for Enriched Watching Experience: Case Stories of the Second World War Veterans. The Semantic Web: ESWC 2022 Satellite Events, Lecture Notes in Computer Science, vol. 13384, pp. 163-167, Springer, July, 2022. bib pdf link
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Matti La Mela, Jouni Tuominen, Kimmo Elo, Senka Drobac, Mikko Koho, Esko Ikkala, Minna Tamper, Rafael Leal and Joonas Kesäniemi: Linked Data Approach for Studying Parliamentary Speeches and Networks of Politicians in Finland 1907-2021 (long paper). Digital Humanities 2022, Conference Abstracts, July 25-29, 2022 Online, Tokyo. Japan, University of Tokyo, pp. 254-257, ADHO, July, 2022. bib link
Mikko Koho, Esko Ikkala and Eero Hyvönen: Reassembling the Lives of Finnish Prisoners of the Second World War on the Semantic Web. Proceedings of the Third Conference on Biographical Data in a Digital World (BD 2019), pp. 31-39, CEUR Workshop Proceedings, June, 2022. bib pdf link
This paper presents first results of a new, ninth application perspective for the semantic portal WarSampo - Finnish WW2 on the Semantic Web, based on a database of ca. 4450 Finnish prisoners of war in the Soviet Union. Our key idea is to reassemble the life of each prisoner of war by using Linked Data, based on information about the person in different data sources. Using the enriched aggregated data, a biographical global home page for each prisoner of war can be created, that is more complete than information in individual data sources. The application perspective is targeted to researchers of military history, to study and analyze the data in order to form new research questions or hypotheses, as well as to public in the large looking for information e.g., about their relatives that were captured as prisoners of war. Employing the faceted search of the application perspective, prosopographical research on subgroups of prisoners is possible.
Mikko Koho, Rafael Leal, Esko Ikkala, Minna Tamper, Heikki Rantala and Eero Hyvönen: Building Lightweight Ontologies for Faceted Search with Named Entity Recognition: Case WarMemoirSampo. Proceedings of the 1st International Workshop on Knowledge Graph Generation From Text and the 1st International Workshop on Modular Knowledge co-located with 19th Extended Semantic Conference (ESWC 2022) (Sanju Tiwari, Nandana Mihindukulasooriya, Francesco Osborne, Dimitris Kontokostas, Jennifer D’Souza and Mayank Kejriwal (eds.)), vol. 3184, pp. 19-35, CEUR Workshop Proceedings, May, 2022. International Knowledge Graph Generation From Text (TEXT2KG). bib pdf link
Javier Ureña-Carrion, Petri Leskinen, Jouni Tuominen, Charles van den Heuvel, Eero Hyvönen and Mikko Kivelä: Communication Now and Then: Analyzing the Republic of Letters as a Communication Network. Applied Network Science, vol. 7, May, 2022. bib pdf link
Matti La Mela, Fredrik Norén and Eero Hyvönen: Digital Parliamentary Data in Action (DiPaDA 2022): Introduction. Proceedings of the Digital Parliamentary Data in Action (DiPaDA 2022) Workshop, CEUR Workshop Proceedings, Vol. 3133, May, 2022. bib pdf link
Minna Tamper, Rafael Leal, Laura Sinikallio, Petri Leskinen, Jouni Tuominen and Eero Hyvönen: Extracting Knowledge from Parliamentary Debates for Studying Political Culture and Language. Proceedings of the 1st International Workshop on Knowledge Graph Generation From Text and the 1st International Workshop on Modular Knowledge co-located with 19th Extended Semantic Conference (ESWC 2022) (Sanju Tiwari, Nandana Mihindukulasooriya, Francesco Osborne, Dimitris Kontokostas, Jennifer D’Souza and Mayank Kejriwal (eds.)), vol. 3184, pp. 70-79, CEUR WS, May, 2022. International Workshop on Knowledge Graph Generation from Text (TEXT2KG 2022). bib pdf link
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Matti La Mela, Jouni Tuominen, Kimmo Elo, Senka Drobac, Mikko Koho, Esko Ikkala, Minna Tamper, Rafael Leal and Joonas Kesäniemi: Finnish Parliament on the Semantic Web: Using ParliamentSampo Data Service and Semantic Portal for Studying Political Culture and Language. Digital Parliamentary data in Action (DiPaDA 2022), Workshop at the 6th Digital Humanities in Nordic and Baltic Countries Conference, long paper, pp. 69-85, CEUR Workshop Proceedings, Vol. 3133, May, 2022. bib pdf link
Petri Leskinen, Javier Ureña-Carrion, Petri Leskinen, Jouni Tuominen, Mikko Kivelä and Eero Hyvönen: Knowledge Graphs and Data Services for Studying Historical Epistolary Data in Network Science on the Semantic Web. May, 2022. Submitted for review. bib pdf
Matti La Mela, Fredrik Norén and Eero Hyvönen (eds.): Proceedings of the Digital Parliamentary Data in Action (DiPaDA 2022) Workshop. CEUR Workshop Proceedings, vol. 3133, May, 2022. bib link
Heikki Rantala and Eero Hyvönen: Who is Related to What and How? Using Biographical Knowledge Graphs for Explainable Relational Search in BiographySampo. May, 2022. Submitted for review. bib pdf
Petri Leskinen, Heikki Rantala and Eero Hyvönen: Analyzing the Lives of Finnish Academic People 1640–1899 in Nordic and Baltic Countries: AcademySampo Data Service and Portal. DHNB 2022 The 6th Digital Humanities in Nordic and Baltic Countries Conference, CEUR Workshop Proceedings, long papers, Vol. 3232, March, 2022. bib pdf link
Jouni Tuominen, Mikko Koho, Ilona Pikkanen, Senka Drobac, Johanna Enqvist, Eero Hyvönen, Matti La Mela, Petri Leskinen, Hanna-Leena Paloposki and Heikki Rantala: Constellations of Correspondence: a Linked Data Service and Portal for Studying Large and Small Networks of Epistolary Exchange in the Grand Duchy of Finland. DHNB 2022 The 6th Digital Humanities in Nordic and Baltic Countries Conference, pp. 415-423, CEUR Workshop Proceedings, Vol. 3232, March, 2022. bib pdf link
Mikko Koho, Heikki Rantala and Eero Hyvönen: Digital Humanities and Military History: Analyzing Casualties of the WarSampo Knowledge Graph. DHNB 2022 The 6th Digital Humanities in Nordic and Baltic Countries Conference (Karl Berglund, Matti La Mela and Inge Zwart (eds.)), vol. 3232, CEUR Workshop Proceedings, Uppsala, Sweden, March, 2022. bib pdf link
Heikki Rantala, Esko Ikkala, Jouni Tuominen, Eero Hyvönen, Ville Rohiola, Eljas Oksanen and Mikko Koho: FindSampo: A Linked Data Based Service for Analyzing and Disseminating Archaeological Finds. 6th Digital Humanities in Nordic and Baltic Countries Conference, poster paper, book of abstracts, pp. 118-119, March, 2022. bib link
Pere Brunet, Livio de Luca, Eero Hyvönen, Adeline Joffres, Peter Plassmayer, Martijn Pronk, Roberto Scopigno and Gabor Sonkoly: Report on a European Collaborative Cloud for Cultural Heritage. Ex-ante Impact Assessment. European Commission, Directorate-general for Research and Innovation, March, 2022. 108 pp. bib pdf link
Arttu Oksanen, Minna Tamper, Jouni Tuominen, Aki Hietanen and Eero Hyvönen: A Tool for Pseudonymization of Textual Documents for Digital Humanities Research and Publication. 6th Digital Humanities in Nordic and Baltic Countries Conference, poster paper, book of abstracts, pp. 107-108, March, 2022. bib pdf
Joonas Kesäniemi, Mikko Koho, Esko Ikkala and Eero Hyvönen: Using Wikibase for Managing Cultural Heritage Linked Open Data Based on CIDOC CRM. 6th Digital Humanities in Nordic and Baltic Countries Conference, poster paper, pp. 74-75, March, 2022. Book of Abstracts. bib link
Rafael Leal, Heikki Rantala, Mikko Koho, Esko Ikkala, Markus Merenmies and Eero Hyvönen: WarMemoirSampo: A Semantic Portal for War Veteran Interview Videos. DHNB 2022 The 6th Digital Humanities in Nordic and Baltic Countries Conference, CEUR Workshop Proceedings, long papers, Vol. 3232, March, 2022. bib pdf link
Laura Sinikallio: Eduskunnan täysistuntojen pöytäkirjojen muuntaminen semanttiseksi dataksi ja julkaiseminen verkkopalveluna (Transformation of the Debates of the Parliament of Finland into Semantic Data and a Data Service. (in Finnish), University of Helsinki, Department of Computer Science, February, 2022. MSc Thesis. bib pdf link
Toby Burrows, Laura Cleaver, Doug Emery, Mikko Koho, Lynn Ransom and Emma Thomson: Exploring a large graph of historical objects: the Mapping Manuscript Migrations knowledge graph. February, 2022. Graphs and Networks in the Humanities 2022 conference, extended abstract. bib pdf
Heikki Rantala, Ilkka Jokipii, Esko Ikkala and Eero Hyvönen: WarVictimSampo 1914–1922: a National War Memorial on the Semantic Web for Digital Humanities Research and Applications. ACM Journal on Computing and Cultural Heritage, vol. 15, no. 1, ACM, Assoc. of Computing Machinery, February, 2022. bib pdf link
Esko Ikkala, Eero Hyvönen, Heikki Rantala and Mikko Koho: Sampo-UI: A Full Stack JavaScript Framework for Developing Semantic Portal User Interfaces. Semantic Web – Interoperability, Usability, Applicability, vol. 13, no. 1, pp. 69-84, January, 2022. Online version published in 2021, print version in 2022. bib pdf link
Eero Hyvönen, Annastiina Ahola and Esko Ikkala: BookSampo Fiction Literature Knowledge Graph Revisited: Building a Faceted Search Interface with Seamlessly Integrated Data-analytic Tools. Theory and Practice of Digital Libraries (TDPL 2022), Accelerating Innovations Track, Padova, Italy, pp. 506–511, Springer, 2022. bib pdf link
Minna Tamper, Jouni Tuominen and Eero Hyvönen: Extending the Finnish Linked Data Infrastructure with Natural Language Processing Services in FIN-CLARIAH. DHNB 2022 The 6th Digital Humanities in Nordic and Baltic Countries Conference, pp. 443-446, CEUR Workshop Proceedings, Vol. 3232, 2022. bib pdf link
Heikki Rantala, Esko Ikkala, Ville Rohiola, Mikko Koho, Jouni Tuominen, Eljas Oksanen, Anna Wessman and Eero Hyvönen: FindSampo: A Linked Data Based Portal and Data Service for Analyzing and Disseminating Archaeological Object Finds. The Semantic Web: ESWC 2022, Lecture Notes in Computer Science, vol. 13261, pp. 478-494, Springer, 2022. bib pdf link
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori: Linked Data – A Paradigm Shift for Publishing and Using Biography Collections on the Semantic Web. Proceedings of the Third Conference on Biographical Data in a Digital World (BD 2019), pp. 16-23, CEUR-WS Proceedings, vol. 3152, 2022. bib pdf link
List of publications of SeCo Group in BibTex. 2022. bib
2021
Eero Hyvönen, Esko Ikkala, Mikko Koho, Jouni Tuominen, Toby Burrows, Lynn Ransom and Hanno Wijsman: Mapping Manuscript Migrations on the Semantic Web: A Semantic Portal and Linked Open Data Service for Premodern Manuscript Research. The Semantic Web - ISWC 2021, Lecture Notes in Computer Science, vol. 12922, pp. 615-630, Springer, 2021. bib pdf link
Rafael Leal, Joonas Kesäniemi, Mikko Koho and Eero Hyvönen: Relevance Feedback Search Based on Automatic Annotation and Classification of Texts. 3rd Conference on Language, Data and Knowledge (LDK 2021) (Dagmar Gromann, Gilles Sérasset, Thierry Declerck, John P. McCrae, Jorge Gracia, Julia Bosque-Gil, Fernando Bobillo and Barbara Heinisch (eds.)), Open Access Series in Informatics (OASIcs), vol. 93, pp. 18:1-18:15, Schloss Dagstuhl - Leibniz-Zentrum für Informatik, 2021. bib pdf link
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori: Biografiasampo yhdistää ja rikastaa suomalaiset elämäkerrat linkitettynä datana semanttisessa webissä (Biographysampo links and enriches Finnish biographies as linked data on the Semantic Web. Informaatiotutkimus, vol. 40, no. 3, pp. 346-368, November, 2021. bib pdf link
Eero Hyvönen, Laura Sinikallio, Petri Leskinen, Senka Drobac, Jouni Tuominen, Kimmo Elo, Matti La Mela, Mikko Koho, Esko Ikkala, Minna Tamper, Rafael Leal and Joonas Kesäniemi: Parlamenttisampo: eduskunnan aineistojen linkitetyn avoimen datan palvelu ja sen käyttömahdollisuudet. Informaatiotutkimus, vol. 40, no. 3, pp. 216-244, November, 2021. bib pdf link
Eero Hyvönen: How to Create a National Cross-domain Ontology and Linked Data Infrastructure and Use It on the Semantic Web. Oct, 2021. Keynote presentation for the DCMI 2021 conference. bib pdf
The vision behind the Semantic Wed is to build a global Web of Data (Giant Global Graph, GGG) for machines to use: based on this an interoperable and intelligent transnational WWW for humans can be created cost-efficiently. This keynote presentation for the DCMI 2001 conference addresses this grand challenge on a national level, as in practice much of the data available are often related to each other within national cultures, borders, organizations, and are represented using national languages, metadata models, vocabularies, and local conventions. This presentation overviews and discusses the vision and lessons learned in Finland on developing and deploying a cross-domain national ontology service infrastructure and Linked Open Data (LOD) publishing framework, extending the classic 5-star model to a 7-star model for better data re-usability (6. star) and quality (7. star). To test and demonstrate the infrastructure, a series of semantic portals and LOD services have been created using the Sampo model that has evolved gradually in 2002--2021 through lessons learned when developing and publishing the Sampo series of systems, including MuseumFinland (2004), HealthFinland (2009), CultureSampo (2009), BookSampo (2011), WarSampo (2015), BiographySampo (2018), NameSampo (2019), WarWictimSampo (2019), Mapping Manuscript Migrations (2020), AcademySampo (2021), as well as FindSampo, Law\-Sampo, and ParliamentSampo underway. These systems cover a wide range of application domains and have attracted up to millions of users on the Semantic Web depending on the application, suggesting feasibility of the proposed model. This work shows a shift of focus in research on semantic portals from data aggregation and exploration systems (1. generation systems) to systems supporting research with data analytic tools (2. generation systems), and finally to automatic knowledge discovery and Artificial Intelligence (3. generation systems).
Petri Leskinen and Eero Hyvönen: Reconciling and Using Historical Person Registers as Linked Open Data in the AcademySampo Knowledge Graph. Proceedings of the 20th International Semantic Web Conference (ISWC 2021), Springer, October, 2021. bib pdf link
Petri Leskinen, Eero Hyvönen and Jouni Tuominen: Sparql2GraphServer: a Server-side Tool for Extracting Networks from Linked Data for Data Analysis. ISWC-Posters-Demos-Industry 2021 International Semantic Web Conference (ISWC) 2021: Posters, Demos, and Industry Tracks, CEUR Workshop Proceedings, Oct, 2021. bib pdf link
Heikki Rantala, Esko Ikkala, Mikko Koho, Jouni Tuominen, Ville Rohiola and Eero Hyvönen: Using FindSampo Linked Open Data Service and Portal for Spatio-temporal Data Analysis of Archaeological Finds in Digital Humanities. ISWC-Posters-Demos-Industry 2021 International Semantic Web Conference (ISWC) 2021: Posters, Demos, and Industry Tracks, CEUR Workshop Proceedings, Oct, 2021. bib pdf link
Petri Leskinen and Eero Hyvönen: Using the AcademySampo Portal and Data Service for Biographical and Prosopographical Research in Digital Humanities. ISWC-Posters-Demos-Industry 2021 International Semantic Web Conference (ISWC) 2021: Posters, Demos, and Industry Tracks, CEUR Workshop Proceedings, Oct, 2021. bib pdf link
Minna Tamper, Eero Hyvönen and Petri Leskinen: Visualizing and Analyzing Networks of Named Entities in Biographical Dictionaries for Digital Humanities Research. Proceedings of the 20th International Conference on Computational Linguistics and Intelligent Text Processing (CICling 2019), Springer-Verlag, October, 2021. Forth-coming. bib pdf
This paper shows how named entity extraction and networkanalysis can be used to examine biographies individually and in groupsto aid historians in biographical and prosopographical research. For this purpose a reference network of 13 100 biographies in the collections ofthe Biographical Centre of the Finnish Literature Society was created, based on links between the biographies as well as automatically extracted named entities found in the texts. The data was published in a SPARQL endpoint as a Linked Data knowledge graph on top of which network analytic tools were created and analysis were done showing the usefulness of the approach in Digital Humanities. The reference graph has been utilized for network analysis to examine egocentric networks of individual persons as well as networks among groups of people in prosopography. The data and tools presented are in use since autumn 2018 in the semantic portal BiographySampo that has had tens of thousands of users.
Marcia Zeng, Chris Sula, Karen Gracy, Eero Hyvönen and Vania Mara Alves Lima: JASIST Special Issue on Digital Humanities (DH) Introduction. Journal of the Association for Information Scinece and Technology (JASIST), pp. 1-5, September, 2021. bib link
Marcia Zeng, Chris Sula, Karen Gracy, Eero Hyvönen and Vania Mara Alves Lima (eds.): Special Issue on Digital Humanities. Journal of the Association for Information Scinece and Technology (JASIST), September, 2021. Forth-coming. bib link
Eero Hyvönen, Heikki Rantala, Esko Ikkala, Mikko Koho, Jouni Tuominen, Babatunde Anafi, Suzie Thomas, Anna Wessman, Eljas Oksanen, Ville Rohiola, Jutta Kuitunen and Minna Ryyppö: Citizen Science Archaeological Finds on the Semantic Web: The FindSampo Framework. Antiquity, A Review of World Archaeology, vol. 95, no. 382, pp. e24, Cambridge University Press, August, 2021. bib pdf link
Laura Sinikallio, Senka Drobac, Minna Tamper, Rafael Leal, Mikko Koho, Jouni Tuominen, Matti La Mela and Eero Hyvönen: Plenary Debates of the Parliament of Finland as Linked Open Data and in Parla-CLARIN Markup. 3rd Conference on Language, Data and Knowledge, LDK 2021, Open Access Series in Informatics (OASIcs), vol. 93, pp. 8:1-8:17, Schloss Dagstuhl - Leibniz-Zentrum für Informatik GmbH, Zaragoza, Spain, August, 2021. bib pdf link
Toby Burrows, Doug Emery, Arthur Mitchell Fraas, Eero Hyvönen, Esko Ikkala, Mikko Koho, David Lewis, Andrew Morrison, Kevin Page, Lynn Ransom, Emma Cawfield Thomson, Jouni Tuominen, Athanasios Velios and Hanno Wijsman: A New Model for Manuscript Provenance Research: The Mapping Manuscripts Migrations Project. Manuscript Studies, vol. 6, no. 1, pp. 131-144, The University of Pennsylvania Press, Philadelphia, US, July, 2021. bib pdf link
Eero Hyvönen: Sammon taontaa semanttisessa webissä (Forging Sampos on the Semantic Web). Tekniikan Waiheita, vol. 39, no. 2, pp. 87-105, Tekniikan Historian Seura ry, July, 2021. bib pdf link
Toby Burrows, Laura Cleaver, Doug Emery, Mikko Koho, Lynn Ransom and Emma Thomson: Using SPARQL to investigate the research potential of an aggregated Linked Open Data dataset for the Mapping Manuscript Migrations project. July, 2021. Association for Computers and the Humanities (ACH) conference 2021 abstract. bib
Toby Burrows, Mikko Koho, Jouni Tuominen, Eero Hyvönen, Kevin Page, David Lewis, Doug Emery, Hanno Wijsman, Lynn Ransom and Emma Cawlfield Thomson: Modelling the History of Medieval and Renaissance Manuscripts for the Mapping Manuscript Migrations Portal. Data for History 2021: Modelling Time, Places, Agents, June, 2021. Abstract. bib link
Toby Burrows, Laura Cleaver, Doug Emery, Mikko Koho, Lynn Ransom and Emma Thomson: Using SPARQL to investigate the research potential of an aggregated Linked Open Data dataset: the Mapping Manuscript Migrations project. DH Benelux 2021 abstract, June, 2021. bib pdf link
Eero Hyvönen, Petri Leskinen, Heikki Rantala, Esko Ikkala and Jouni Tuominen: Akatemiasampo-portaali ja -datapalvelu henkilöiden ja henkilöryhmien historialliseen tutkimukseen (AcademySampo Portal and Data Service for Biographical and Prosopographical Research). Informaatiotutkimus, vol. 40, no. 2, pp. 28-56, May, 2021. bib pdf link
Heikki Rantala, Eero Hyvönen and Esko Ikkala: Creating the HISTO Ontology of Finnish History Events. Data for History 2021: Modelling Time, Places, Agents, May, 2021. Abstract. bib pdf link
Mikko Koho, Toby Burrows, Eero Hyvönen, Esko Ikkala, Kevin Page, Lynn Ransom, Jouni Tuominen, Doug Emery, Mitch Fraas, Benjamin Heller, David Lewis, Andrew Morrison, Guillaume Porte, Emma Thomson, Athanasios Velios and Hanno Wijsman: Harmonizing and Publishing Heterogeneous Pre-Modern Manuscript Metadata as Linked Open Data. Journal of the Association for Information Science and Technology (JASIST), vol. 73, no. 2, pp. 240-257, May, 2021. bib pdf link
Manuscripts are a crucial form of evidence for research into all aspects of premodern European history and culture, and there are numerous databases devoted to describing them in detail. This descriptive information, however, is typically available only in separate data silos based on incompatible data models and user interfaces. As a result, it has been difficult to study manuscripts comprehensively across these various platforms. To address this challenge, a team of manuscript scholars and computer scientists worked to create “Mapping Manuscript Migrations” (MMM), a semantic portal, and a Linked Open Data service. MMM stands as a successful proof of concept for integrating distinct manuscript datasets into a shared platform for research and discovery with the potential for future expansion. This paper will discuss the major products of the MMM project: a unified data model, a repeatable data transformation pipeline, a Linked Open Data knowledge graph, and a Semantic Web portal. It will also examine the crucial importance of an iterative process of multidisciplinary collaboration embedded throughout the project, enabling humanities researchers to shape the development of a digital platform and tools, while also enabling the same researchers to ask more sophisticated and comprehensive research questions of the aggregated data.
Eero Hyvönen: Löytösampo: kansalaisten arkeologiset löydöt semanttisessa webissä (FindSampo: Archaeological Finds on the Semantic Web). May, 2021. Järjestelmän lyhyt esittely. bib pdf
Kimmo Kettunen and Matti La Mela: Semantic tagging and the Nordic tradition of Everyman’s rights. Digital Scholarship in the Humanities (DSH), pp. preprint, Oxford University Press, April, 2021. Accepted. bib pdf
This article uses semantic tagging to analyse the Nordic concept of everyman’s rights (a right of public access to nature) in protocols of the Finnish parliament. In the analysis, we use a novel tool, a lexical semantic tagger for Finnish (FiST), which is used to tag key discussions about everyman’s rights in the Finnish parliament. The article has two contributions: first, it presents a method which combines semantic tagging and similarity analysis of corpora (keyness) for studying the formation of political concepts in large textual data. Second, it sheds light on the Nordic access rights and the underlying customary everyman’s rights. Despite its central role in public debate, the history of the concept has not been well researched. Our analysis shows that the legislative context could be clearly detected with our approach, and that the method allowed us to describe shifts in the meaning of everyman’s rights in the legislative discussion.
Eero Hyvönen and Heikki Rantala: Knowledge-based Relational Search in Cultural Heritage Linked Data. Digital Scholarship in the Humanities (DSH), vol. 16, pp. ii155-ii164, Oxford University Press, March, 2021. bib pdf link
This paper presents a new knowledge-based approach for finding serendipitous semantic relations between resources in a knowledge graph. The idea is to characterize the notion of “interesting connection” in terms of generic ontological explanation patterns that are applied to an underlying linked data repository to instantiate connections. In this way, 1) semantically uninteresting connections can be ruled out effectively, and 2) natural language explanations about the connections can be created for the end-user. The idea has been implemented and tested based on a knowledge graph of biographical data extracted from the life stories of 13,144 prominent historical persons in Finland, enriched by data linking to collection databases of museums, libraries, and archives. The demonstrator is in use as part of the BiographySampo portal of interlinked biographies.
Babatunde Anafi: Representing and Using Temporal Linked Data in Semantic Cultural Heritage Portals. MSc Thesis (in English), University of Helsinki, Department of Computer Science, March, 2021. bib link
Eero Hyvönen: Digitaalisten ihmistieteiden keskus HELDIG profiloi Helsingin yliopiston humanistisia aloja (Helsinki Centre for Digital Humanitites HELDIG Profiled Areas in Humanities at the University of Helsinki). Tieteessä tapahtuu, no. 1, Tieteellisten seurain valtuuskunta, February, 2021. bib pdf link
Antonis Bikakis, Eero Hyvönen, Stéphane Jean, Béatrice Markhoff and Alessandro Mosca (eds.): Special Issue on Semantic Web for Cultural Heritage. Semantic Web – Interoperability, Usability, Applicability, vol. 12, no. 2, January, 2021. bib pdf link
Mikko Koho, Esko Ikkala, Petri Leskinen, Minna Tamper, Jouni Tuominen and Eero Hyvönen: WarSampo Knowledge Graph: Finland in the Second World War as Linked Open Data. Semantic Web – Interoperability, Usability, Applicability, vol. 12, no. 2, pp. 265-278, January, 2021. bib pdf link
The Second World War (WW2) is arguably the most devastating catastrophe of human history, a topic of great interest to not only researchers but the general public. However, data about the Second World War is heterogeneous and distributed in various organizations and countries making it hard to utilize. In order to create aggregated global views of the war, a shared ontology and data infrastructure is needed to harmonize information in various data silos. This makes it possible to share data between publishers and application developers, to support data analysis in Digital Humanities research, and to develop data-driven intelligent applications. As a first step towards these goals, this article presents the WarSampo knowledge graph (KG), a shared semantic infrastructure, and a Linked Open Data (LOD) service for publishing data about WW2, with a focus on Finnish military history. The shared semantic infrastructure is based on the idea of representing war as a spatio-temporal sequence of events that soldiers, military units, and other actors participate in. The used metadata schema is an extension of CIDOC CRM, supplemented by various military historical domain ontologies. With an infrastructure containing shared ontologies, maintaining the interlinked data brings upon new challenges, as one change in an ontology can propagate across several datasets that use it. To support sustainability, a repeatable automatic data transformation and linking pipeline has been created for rebuilding the whole WarSampo KG from the individual source datasets. The WarSampo KG is hosted on a data service based on W3C Semantic Web standards and best practices, including content negotiation, SPARQL API, download, automatic documentation, and other services supporting the reuse of the data. The WarSampo KG, a part of the international LOD Cloud and totalling ca. 14 million triples, is in use in nine end-user application views of the WarSampo portal, which has had over 400 000 end users since its opening in 2015.
Heikki Rantala and Eero Hyvönen: Knowledge-based Approach to Relational Search in Knowledge Graphs with Explanations: Case BiographySampo – Biographies on the Semantic. 2021. Submitted for evaluation. bib pdf
Toby Burrows, Mitch Fraas, Eero Hyvönen, Esko Ikkala, Mikko Koho, David Lewis, Andrew Morrison, Kevin Page, Lynn Ransom, Emma Thomson, Jouni Tuominen, Athanasios Velios and Hanno Wijsman: Linking Data to Explore the History of Medieval and Renaissance Manuscripts: the Mapping Manuscript Migrations Project. Book of abstracts: 2nd International Conference of the European Association for Digital Humanities (EADH), Krasnoyarsk, Russia, 21 - 25 September 2021, 2021. bib pdf
Petri Leskinen, Eero Hyvönen and Jouni Tuominen: Members of Parliament in Finland Knowledge Graph and Its Linked Open Data Service. Further with Knowledge Graphs. Proceedings of the 17th International Conference on Semantic Systems, 6-9 September 2021, Amsterdam, The Netherlands, pp. 255-269, IOS Press, 2021. bib pdf link
2020
Minna Tamper, Arttu Oksanen, Jouni Tuominen, Aki Hietanen and Eero Hyvönen: Automatic Annotation Service APPI: Named Entity Linking in Legal Domain. The Semantic Web: ESWC 2020 Satellite Events (Harth, Andreas, Presutti, Valentina, Troncy, Raphaël, Acosta, Maribel, Polleres, Axel, Fernández, Javier D., Xavier Parreira, Josiane, Hartig, Olaf, Hose, Katja and Cochez, Michael (eds.)), Lecture Notes in Computer Science, vol. 12124, pp. 208-213, Springer-Verlag, 2020. bib pdf link
Eero Hyvönen, Minna Tamper, Esko Ikkala, Sami Sarsa, Arttu Oksanen, Jouni Tuominen and Aki Hietanen: Publishing and Using Legislation and Case Law as Linked Open Data on the Semantic Web. The Semantic Web: ESWC 2020 Satellite Events (Harth, Andreas, Presutti, Valentina, Troncy, Raphaël, Acosta, Maribel, Polleres, Axel, Fernández, Javier D., Xavier Parreira, Josiane, Hartig, Olaf, Hose, Katja and Cochez, Michael (eds.)), Lecture Notes in Computer Science, vol. 12124, pp. 110-114, Springer-Verlag, 2020. bib pdf link
Sami Sarsa and Eero Hyvönen: Searching Case Law Judgements by Using Other Judgements as a Query. Artificial Intelligence and Natural Language. 9th Conference, AINL 2020, Helsinki, Finland, October 7–9, 2020 (Filchenkov A., Kauttonen J. and Pivovarova L. (eds.)), pp. 145-157, Springer-Verlag, 2020. bib pdf link
Eero Hyvönen: Semantic Sampo Portals for Digital Humanities Based on a National Linked Open Data Infrastructure. 2020. White paper, Aalto University, Semantic Computing Research Group (SeCo). bib pdf
Matti La Mela: Tracing the Emergence of Nordic Allemansrätten through Digitised Parliamentary Sources. Digital histories: Emergent approaches within the new digital history (Fridlund, Mats, Oiva, Mila and Paju, Petri (eds.)), pp. 181-197, Helsinki University Press, 2020. bib pdf link
Heikki Rantala, Esko Ikkala, Ilkka Jokipii, Mikko Koho, Jouni Tuominen and Eero Hyvönen: WarVictimSampo 1914–1922: A Semantic Portal and Linked Data Service for Digital Humanities Research on War History. The Semantic Web: ESWC 2020 Satellite Events (Harth, Andreas, Presutti, Valentina, Troncy, Raphaël, Acosta, Maribel, Polleres, Axel, Fernández, Javier D., Xavier Parreira, Josiane, Hartig, Olaf, Hose, Katja and Cochez, Michael (eds.)), Lecture Notes in Computer Science, vol. 12124, pp. 191-196, Springer-Verlag, 2020. bib pdf link
Rafael Leal: Unsupervised zero-shot classification of Finnish documents using pre-trained language models. (in English), University of Helsinki, Department of Digital Humanities, Helsinki Centre for Digital Humanities (HELDIG), December, 2020. MSc Thesis. bib pdf link
Babatunde Anafi, Mikko Koho and Eero Hyvönen: Temporal Visualization and Data Analysis of Archaeological Finds: Case FindSampo. Conference on Cultural Heritage and New Technologies (CHNT 25), Museum Stadt Archäologie Wien, Nov, 2020. Posters. bib pdf
Heikki Rantala, Ilkka Jokipii, Mikko Koho, Esko Ikkala, Jouni Tuominen and Eero Hyvönen: Building a Linked Open Data Portal of War Victims in Finland 1914-1922. DHN 2020 Digital Humanities in the Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 5th Conference, pp. 310-317, CEUR Workshop Proceedings, vol. 2612, Riga, Latvia, October, 2020. bib pdf link
Kimmo Kettunen and Matti La Mela: Digging Deeper into the Finnish Parliamentary Protocols – Using a Lexical Semantic Tagger for Studying Meaning Change of Everyman’s Rights (allemansrätten). DHN 2020 Digital Humanities in the Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 5th Conference, pp. 63-80, CEUR Workshop Proceedings, vol. 2612, Riga, Latvia, October, 2020. bib pdf link
Chris A. Sula, Kalani Craig, Michelle Dalmu, Alex Humphreys, Eero Hyvönen, Hannah L. Jacobs, Humphrey Keah, Joseph Kiplangat, Thea Lindquist, Nicholas Weber, Scott B. Weingart: Infrastructures of Digital Humanities. 83rd Association for Information Science and Technology (ASIS&T) Annual Meeting, proceedings, Association for Information Science and Technology, Silver Spring, Maryland, USA, October, 2020. bib pdf link
Mikko Koho, Petri Leskinen and Eero Hyvönen: Integrating Historical Person Registers as Linked Open Data in the WarSampo Knowledge Graph. Semantic Systems. In the Era of Knowledge Graphs. SEMANTiCS 2020 (Eva Blomqvist, Paul Groth, Victor de Boer, Tassilo Pellegrini, Mehwish Alam, Tobias Käfer, Peter Kieseberg, Sabrina Kirrane, Albert Meroño-Peñuela and Harshvardhan J. Pandit (eds.)), Lecture Notes in Computer Science, vol. 12378, pp. 118-126, Springer, Cham, Amsterdam, The Netherlands, October, 2020. bib pdf link
Semantic data integration from heterogeneous, distributed data silos enables Digital Humanities research and application development employing a larger, mutually enriched and interlinked knowledge graph. However, data integration is challenging, involving aligning the data models and reconciling the concepts and named entities, such as persons and places. This paper presents a record linkage process to reconcile person references in different military historical person registers with structured metadata. The information about persons is aggregated into a single knowledge graph. The process was applied to reconcile three person registers of the popular semantic portal WarSampo -- Finnish World War 2 on the Semantic Web . The registers contain detailed information about some 100,000 people and are individually maintained by domain experts. Thus, the integration process needs to be automatic and adaptable to changes in the registers. An evaluation of the record linkage results is promising and provides some insight into military person register reconciliation in general.
Eero Hyvönen: Linked Open Data Infrastructure for Digital Humanities in Finland. DHN 2020 Digital Humanities in the Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 5th Conference, pp. 254-259, CEUR Workshop Proceedings, vol. 2612, Riga, Latvia, October, 2020. bib pdf link
Petri Leskinen and Eero Hyvönen: Linked Open Data Service about Historical Finnish Academic People in 1640–1899. DHN 2020 Digital Humanities in the Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 5th Conference, pp. 284-292, CEUR Workshop Proceedings, vol. 2612, Riga, Latvia, October, 2020. bib pdf link
Toby Burrows, Antoine Brix, Douglas Emery, Arthur Mitchell Fraas, Eero Hyvönen, Esko Ikkala, Mikko Koho, David Lewis, Synnove Myking, Kevin Page, Lynn Ransom, Emma Cawlfield Thomson, Jouni Tuominen, Hanno Wijsman and Pip Wilcox: Linked Open Data Vocabularies and Identifiers for Medieval Studies. DHN 2020 Digital Humanities in the Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 5th Conference, pp. 211-218, CEUR Workshop Proceedings, vol. 2612, Riga, Latvia, October, 2020. bib pdf link
Eero Hyvönen: Sampo Model and Semantic Portals for Digital Humanities on the Semantic Web. DHN 2020 Digital Humanities in the Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 5th Conference, pp. 373-378, CEUR Workshop Proceedings, vol. 2612, Riga, Latvia, October, 2020. bib pdf link
Toby Burrows, Douglas Emery, Mitch Fraas, Eero Hyvönen, Esko Ikkala, Mikko Koho, David Lewis, Andrew Morrison, Kevin Page, Lynn Ransom, Emma Thomson, Jouni Tuominen, Athanasios Velios and Hanno Wijsman: Mapping Manuscript Migrations: Digging into Data for Researching the History and Provenance of Medieval and Renaissance Manuscripts: White Paper. August, 2020. bib pdf link
Pejam Hassanzadeh, Eero Hyvönen, Esko Ikkala, Jouni Tuominen, Suzie Thomas, Anna Wessman and Ville Rohiola: FindSampo Platform for Reporting and Studying Archaeological Finds Using Citizen Science. 3rd Workshop on Humanities in the Semantic Web (WHiSe 2020), pp. 33-40, CEUR Workshop Proceedings, vol. 2695, June, 2020. bib pdf link
Alex Kourijoki: Linkitetyn datan validointi ja korjaus. MSc Thesis (in Finnish), Aalto University, School of Science, Master’s Programme in Computer, Communication and Information Sciences, June, 2020. bib link
Toby Burrows, Douglas Emery, Arthur Mitchell Fraas, Eero Hyvönen, Esko Ikkala, Mikko Koho, David Lewis, Andrew Morrison, Kevin Page, Lynn Ransom, Emma Cawlfield Thomson, Jouni Tuominen, Athanasios Velios, and Hanno Wijsman: Mapping Manuscript Migrations Knowledge Graph: Data for Tracing the History and Provenance of Medieval and Renaissance Manuscripts. Journal of Open Humanities Data, vol. 6, pp. 3, June, 2020. bib pdf link
Minna Tamper, Petri Leskinen, Jouni Tuominen and Eero Hyvönen: Modeling and Publishing Finnish Person Names as a Linked Open Data Ontology. 3rd Workshop on Humanities in the Semantic Web (WHiSe 2020), pp. 3-14, CEUR Workshop Proceedings, vol. 2695, June, 2020. bib pdf link
Mikko Koho: Representing, Using, and Maintaining Military Historical Linked Data on the Semantic Web. Dissertation, Aalto University, School of Science, Department of Computer Science, May, 2020. bib pdf link
Eero Hyvönen: Tekoäly ja semanttinen web tarjoavat huikeat mahdollisuudet suurten aineistojen hallintaan. Kansallisarkiston strategia 2025, näkökulmia tulevaan (Jussi Nuorteva, Päivi Happonen (ed.)), pp. 20-21, Kansallisarkisto, Helsinki, May, 2020. bib link
Lora Aroyo, Franciska de Jong, Eero Hyvönen and Sara Tonelli: Web Semantics for Digital Humanities (editorial for a special issue). Web Semantics: Science, Services and Agents on the World Wide Web, Elsevier, May, 2020. bib pdf
Eero Hyvönen: Building and Using a National Linked Open Data Infrastructure for Digital Humanities: The Finnish Approach. Proceedings of the conferenve: Data for History 2020. Modelling Time, Places, Agents, Berlin, 2020. Accepted, conference postponed to 2021. bib pdf link
Eero Hyvönen: Using the Semantic Web in Digital Humanities: Shift from Data Publishing to Data-analysis and Serendipitous Knowledge Discovery. Semantic Web, vol. 11, no. 1, pp. 187-193, 2020. bib pdf link
2019
Howard Hotson, Thomas Wallnig, Jouni Tuominen, Eetu Mäkelä, and Eero Hyvönen: People. Reassembling the Republic of Letters in the Digital Age (H. Hotson and T. Wallnig (eds.)), pp. 119-136, Göttingen University Press, 2019. bib link
Eero Hyvönen, Ruth Ahnert, Sebastian E. Ahnert, Jouni Tuominen, Eetu Mäkelä, Miranda Lewis and Gertjan Filarski: Reconciling metadata. Reassembling the Republic of Letters in the Digital Age (H. Hotson and T. Wallnig (eds.)), pp. 223-235, Göttingen University Press, 2019. bib link
Arttu Oksanen, Jouni Tuominen, Eetu Mäkelä, Minna Tamper, Aki Hietanen and Eero Hyvönen: Semantic Finlex: Transforming, Publishing, and Using Finnish Legislation and Case Law As Linked Open Data on the Web. Knowledge of the Law in the Big Data Age (G. Peruginelli and S. Faro (eds.)), Frontiers in Artificial Intelligence and Applications, vol. 317, pp. 212-228, IOS Press, 2019. ISBN 978-1-61499-984-3 (print); ISBN 978-1-61499-985-0 (online). bib pdf link
Governments publish legislation and case law widely in print and on the Web. Such legal information is provided for human consumption, but the information is usually not available as data for algorithmic analysis and applications to use. However, this would be beneficial in many use cases, such as building more intelligent juridical online services and conducting research into legislation and legal practice. To address these needs, this Chapter presents Semantic Finlex, a national in-use data resource and service for publishing Finnish legislation and related case law as Linked Open Data for legal applications to use. The system transforms and interlinks on a regular basis data from the legacy legal database Finlex of the Ministry of Justice into Linked Open Data, based on the European standards ECLI and ELI. The published data is hosted on the 7-star Linked Data Finland service and SPARQL endpoint with a variety of related services available that ease data re-use. Rich Internet Applications using SPARQL for data access are presented as application demonstrators of the data service. In addition, this Chapter presents methods and tools under development to automatically annotate legal texts and to anonymize case law documents prior to their publication on the Web. Anonymization is necessary due to issues of data protection and privacy, and annotation is needed for semantic search and interlinking the documents. The automated approaches could significantly speed up the process and minimize costs of publishing legal documents as Linked Open Data.
Howard Hotson and Eero Hyvönen: Topics. Reassembling the Republic of Letters in the Digital Age (H. Hotson and T. Wallnig (eds.)), pp. 137-148, Göttingen University Press, 2019. bib link
Suzie Thomas, Anna Wessman, Esko Ikkala, Jouni Tuominen, Mikko Koho and Eero Hyvönen: (co-)Creating a Sustainable Platform for Finland’s Archaeological Chance Finds: The Story of SuALT. Digital Heritage and Archaeology in Practice (Ethan Watrall and Lynne Goldstein (eds.)), University Press of Florida, December, 2019. Accepted. bib pdf
Arttu Oksanen, Minna Tamper, Jouni Tuominen, Aki Hietanen and Eero Hyvönen: Anoppi: A Pseudonymization Service for Finnish Court Documents. Legal Knowledge and Information Systems. JURIX 2019: The Thirty-second Annual Conference (Araszkiewicz, M. and Rodríguez-Doncel, V. (eds.)), pp. 251-254, IOS Press, December, 2019. bib pdf
Pejam Hassanzadeh: FindSampo: A Citizen Science Platform for Archaeological Finds on the Semantic Web. MSc Thesis, Aalto University, School of Science, Finland, December, 2019. bib link
Kasper Apajalahti: Contributions to Self-Organizing Networks and Network Measurement Data Management. Dissertation, Aalto University, School of Science, Espoo, October, 2019. Aalto University publication series, Doctoral Disserations 193/2019. bib pdf link
Mikko Koho, Erkki Heino, Petri Leskinen, Esko Ikkala, Minna Tamper, Kasper Apajalahti, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen: WarSampo Knowledge Graph. Zenodo, October, 2019. Dataset. bib link
WarSampo Knowledge Graph includes harmonized data of different kinds concerning the Second World War in Finland, separated in different subgraphs representing events, actors, places, photographs, and other aspects and documentation of the war. The data covers the Winter War 1939-1940 against the Soviet attack, the Continuation War 1941-1944 where the occupied areas of the Winter War were temporarily regained, and the Lapland War 1944-1945, where the Finns pushed the German troops away from Lapland.
Minna Tamper, Arttu Oksanen, Jouni Tuominen, Aki Hietanen and Eero Hyvönen: Automatic Annotation Service: Utilizing a Named Entity Linking Tool in Legal Domain. September, 2019. Submitted. bib pdf
Eero Hyvönen: Helsinki Centre for Digital Humanities (HELDIG): Developing the Digital World Together. EuropaNow, Council for European Studies (CES), Columbia University, September, 2019. bib pdf link
Eero Hyvönen, Minna Tamper, Esko Ikkala, Sami Sarsa, Arttu Oksanen, Jouni Tuominen and Aki Hietanen: LawSampo: A Semantic Portal on a Linked Open Data Service for Finnish Legislation and Case Law. September, 2019. Submitted. bib pdf
Eero Hyvönen: Linked Data in Use: Sampo Portals on the Semantic Web. EuropaNow, Council for European Studies (CES), Columbia University, September, 2019. bib pdf link
Eero Hyvönen: National Linked Open Data Infrastructure for Digital Humanities. EuropaNow, Council for European Studies (CES), Columbia University, September, 2019. bib pdf link
Sami Sarsa: Information Retrieval with Finnish Case Law Embeddings. MSc Thesis (in Finnish), University of Helsinki, Department of Computer Science, August, 2019. bib pdf link
Anna Wessman, Suzie Thomas, Ville Rohiola, Mikko Koho, Esko Ikkala, Jouni Tuominen, Eero Hyvönen, Jutta Kuitunen, Helinä Parviainen, and Marianna Niukkanen: Citizen Science in Archaeology: Developing a Collaborative Web Service for Archaeological Finds in Finland. Transforming Heritage Practice in the 21st Century: Contributions from Community Archaeology (John Jameson and Sergiu Musteață (eds.)), pp. 337-352, Springer, July, 2019. bib pdf link
Agata Dominowska, Elsi Hyttinen, Peter Ivanics, Mikko Koho, Ilona Pikkanen and Risto Turunen: Hiding in Plain Sight: Poetry in Newspapers and How to Approach it. Human IT: Journal for Information Technology Studies as a Human Science, vol. 14, no. 2, pp. 145-171, University of Borås, July, 2019. bib link
Mikko Koho, Lia Gasbarra, Jouni Tuominen, Heikki Rantala, Ilkka Jokipii and Eero Hyvönen: AMMO Ontology of Finnish Historical Occupations. Proceedings of the First International Workshop on Open Data and Ontologies for Cultural Heritage (ODOCH 19) (Antonella Poggi (ed.)), vol. 2375, pp. 91-96, CEUR Workshop Proceedings, Rome, Italy, June, 2019. bib pdf link
This paper introduces AMMO Ontology of Finnish Historical Occupations. AMMO is based on thousands of occupation labels extracted from three Finnish military historical datasets of the early 20th century: the first consists of the ca. 40 000 war-related death records around the time of the Finnish Civil War (1914–1922); the second consists of the ca. 95 000 death records of Finnish soldiers in the Winter War and Continuation War (1939–1944); the third contains the ca. 4500 records of Finnish prisoners of war in the Soviet Union during the WW2. Our goal from a Digital Humanities perspective is to use AMMO to study military history and these datasets based on the occupation and social status of the soldiers. AMMO will also be used as a component for faceted search and semantic recommendation in two semantic portals for Finnish military history. AMMO is aligned with the international historical occupation classification HISCO and with a modern Finnish occupational classification for international and national interoperability. The ontology is published as Linked Open Data in an ontology service.
Lia Gasbarra, Mikko Koho, Ilkka Jokipii, Heikki Rantala and Eero Hyvönen: An Ontology of Finnish Historical Occupations. The Semantic Web: ESWC 2019 Satellite Events (Hitzler, Pascal, Kirrane, Sabrina, Hartig, Olaf, de Boer, Victor, Vidal, Maria-Esther, Maleshkova, Maria, Schlobach, Stefan, Hammar, Karl, Lasierra, Nelia, Stadtmüller, Steffen, Hose, Katja and Verborgh, Ruben (eds.)), Lecture Notes in Computer Science, pp. 64-68, Springer, Cham, Portoroz, Slovenia, June, 2019. bib pdf link
Historical datasets often impose the need to study groups of people based on occupation or social status. This paper presents first results in creating an ontology of historical Finnish occupations, AMMO, that enables selection of groups of people based on their occupation, occupational groups, or socioeconomic class. AMMO is linked to the international historical occupation classification HISCO and to a modern Finnish occupational classification for interoperability. AMMO will be used as a component in two semantic portals for Finnish war history.
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori: BiographySampo - Publishing and Enriching Biographies on the Semantic Web for Digital Humanities Research. The Semantic Web. ESWC 2019 (Pascal Hitzler, Miriam Fernández, Krzysztof Janowicz, Amrapali Zaveri, Alasdair J.G. Gray, Vanessa Lopez, Armin Haller and Karl Hammar (eds.)), pp. 574-589, Springer-Verlag, June, 2019. bib pdf link
Petri Leskinen and Eero Hyvönen: Extracting Genealogical Networks of Linked Data from Biographical Texts. The Semantic Web: ESWC 2019 Satellite Events (Hitzler, P., Kirrane, S., Hartig, O., de Boer, V., Vidal, M.-E., Maleshkova, M., Schlobach, S., Hammar, K., Lasierra, N., Stadtmüller, S., Hose, K., Verborgh, R. (ed.)), pp. 121-125, Springer, June, 2019. bib pdf
Anna Wessman, Suzie Thomas, Ville Rohiola, Jutta Kuitunen, Esko Ikkala, Jouni Tuominen, Mikko Koho and Eero Hyvönen: A Citizen Science Approach to Archaeology: Finnish Archaeological Finds Recording Linked Open Database (SuALT). DHN 2019 Digital Humanities in Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, pp. 469-478, CEUR Workshop Proceedings, Vol-2364, Copenhagen, Denmark, March, 2019. bib pdf link
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori: Demonstrating BiographySampo in Solving Digital Humanities Research Problems in Biography and Prosopography. The Fourth Digital Humanities in the Nordic Countries 2019 (DHN2019), Book of Abstracts, University of Copenhagen, Copenhagen, Denmark, March, 2019. bib pdf link
Matti La Mela, Minna Tamper and Kimmo Kettunen: Finding Nineteenth-century Berry Spots: Recognizing and Linking Place Names in a Historical Newspaper Berry-picking Corpus. The Fourth Digital Humanities in the Nordic Countries 2019 (DHN2019), CEUR Workshop Proceedings, Copenhagen, Denmark, March, 2019. bib pdf link
Eero Hyvönen and Heikki Rantala: Knowledge-based Relation Discovery in Cultural Heritage Knowledge Graphs. DHN 2019 Digital Humanities in Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, pp. 230-239, CEUR Workshop Proceedings, Vol-2364, Copenhagen, Denmark, March, 2019. bib pdf link
Eero Hyvönen, Esko Ikkala, Jouni Tuominen, Mikko Koho, Toby Burrows, Lynn Ransom and Hanno Wijsman: A Linked Open Data Service and Portal for Pre-modern Manuscript Research. DHN 2019 Digital Humanities in Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 4th Conference, pp. 220-229, CEUR Workshop Proceedings, Vol-2364, Copenhagen, Denmark, March, 2019. bib pdf link
Eero Hyvönen: Historiallinen paikkatieto semanttisessa webissä: Biografiasampo. Positio, no. 1, Maanmittauslaitos, February, 2019. bib pdf
Heikki Rantala: Yhteyshaku semanttisessa webissä. MSc Thesis (in Finnish), University of Helsinki, Department of Computer Science, January, 2019. bib pdf
Tavallisessa haussa etsitään yksilöitä, kuten henkilöitä tai paikkoja. Joissain tilanteissa esimerkiksi historian tutkija voi olla kiinnostunut myös etsimään yhteyksiä henkilöiden ja paikkojen välillä. Tässä työssä esitetään metodi tällaisen yhteyshaun toteuttamiseksi käyttäen semanttisen webin sisältämää avointa dataa. Työssä muodostettiin graafi, joka sisältää kuvauksia Suomen kulttuu- rihistorian henkilöiden ja paikkojen välisistä kiinnostaviksi arvioiduista yhteyksistä. Graafi luotiin SPARQL CONSTRUCT -kyselyillä. Yhteyksien hakemista varten luotiin web-sovellus, joka hyödyntää fasettihakua. Tarvittavien SPARQL CONSTRUCT -kyselyjen luominen ei osoittautunut erityisen hankalaksi, mutta niiden soveltaminen yleisemmin eri aineistoihin vaatii jonkin verran työtä. Yhteyksien fasettihaku osoittautui mielenkiintoiseksi. Fasettihaku mahdollistaa haun tarkentamisen askel kerrallaan. Lisäksi yhteyksien suhteellisia määriä on mahdollista vertailla erilaisten rajausten mukaan. Tämä tarjoaa aineistoon uusia näkökulmia.
Eero Hyvönen and Heikki Rantala: Relational Search in Cultural Heritage Linked Data: A Knowledge-based Approach. Abstracts and Posters from the Digital Humanities 2019 conference, DataverseNL, Utrecht University, Utrecht, the Netherlands, 2019. bib link
2018
Petri Leskinen, Eero Hyvönen and Jouni Tuominen: Analyzing and Visualizing Prosopographical Linked Data Based on Biographies. Proceedings of the Second Conference on Biographical Data in a Digital World 2017 (BD2017), vol. 2119, pp. 39-44, CEUR Workshop Proceedings, Linz, Austria, 2018. bib pdf link
This paper shows how faceted search on biographical data can be utilized as a flexible basis for filtering target groups of people and, in particular, how generic data analysis and visualization tools can then be applied for solving prosopographical research questions based on the filtered data. This idea is demonstrated and evaluated in practice by presenting two application case studies: 1) linked data extracted from a printed registry of over 10 000 alumni (1867–1992) of the prominent Finnish high school Norssi, and 2) a knowledge graph extracted from 13 000 short biographies of significant Finnish people (from 3rd century to present times) in the National Biography of Finland. In both cases, the data is enriched by linking their entities with several other external datasets.
Jouni Tuominen, Eero Hyvönen and Petri Leskinen: Bio CRM: A Data Model for Representing Biographical Data for Prosopographical Research. Proceedings of the Second Conference on Biographical Data in a Digital World 2017 (BD2017), vol. 2119, pp. 59-66, CEUR Workshop Proceedings, Linz, Austria, 2018. bib pdf link
Biographies make a promising application case of Linked Data: they can be used, e.g., as a basis for Digital Humanities research in prosopography and as a key data and linking resource in semantic Cultural Heritage (CH) portals. In both use cases, a semantic data model for harmonizing and interlinking heterogeneous data from different sources is needed. This paper presents such a data model, Bio CRM, with the following key ideas: 1) The model is a domain specific extension of CIDOC CRM, making it applicable to not only biographical data but to other CH data, too. 2) The model makes a distinction between enduring unary roles of actors, their enduring binary relationships, and perduing events, where the participants can take different roles modeled as a role concept hierarchy. 3) The model can be used as a basis for semantic data validation and enrichment by reasoning. 4) The enriched data conforming to Bio CRM is targeted to be used by SPARQL queries in a flexible ways using a hierarchy of roles in which participants can be involved in events.
Toby Burrows, Eero Hyvönen, Lynn Ransom, Hanno Wijsman: Mapping Manuscript Migrations. Digging into Data for the History and Provenance of Medieval and Renaissance Manuscripts. Manuscript Studies. A Journal of the Schoenberg Institute for Manuscript Studies, vol. 3, no. 1, pp. 249-252, University of Pennsylvania Press, 2018. bib link
Eero Hyvönen, Petri Leskinen, Minna Tamper, Heikki Rantala, Esko Ikkala, Jouni Tuominen and Kirsi Keravuori: Biografiasammon tekoäly yhdistää ja rikastaa suomalaiset elämäkerrat semanttisessa webissä. Aalto-yliopisto, Semanttisen laskennan tutkimusryhmä (SeCo), Nov, 2018. bib pdf
Biografiasampo-järjestelmä käynnistää uuden aikakauden elämäkertakokoelmien julkaisemisessa ja käyttämisessä verkossa. Järjestelmän ydinaineistona on Kansallisbiografia ja muut Suomalaisen Kirjallisuuden Seuran (SKS) ja tieteellisten seurojen toimittamat pienoiselämäkerrat, yhteensä 13 100 elämäntarinaa, joita on kirjoittanut 900 suomalaista tutkijaa. Biografiasammon innovaationa on luoda kieliteknologian, tekoälyn ja semanttisen webin teknologioiden avulla elämäkertojen teksteistä ja niihin eri lähteissä liittyvistä tiedoista tietämysverkko (knowledge graph) ja kansallinen tietoinfrastruktuuri, joka koostuu miljoonista tietojen välisistä yhteyksistä. Tietämysverkko on julkaistu linkitetyn datan palvelussa, jonka varaan on toteutettu seitsemästä sovellusnäkymästä koostuva älykäs, kaikille avoin ja maksuton verkkopalvelu biografiasampo.fi kansalaisten ja digitaalisten ihmistieteiden tutkijoiden käytettäväksi.
Esko Ikkala, Jouni Tuominen, Jaakko Raunamaa, Tiina Aalto, Terhi Ainiala, Helinä Uusitalo and Eero Hyvönen: NameSampo: A Linked Open Data Infrastructure and Workbench for Toponomastic Research. Proceedings of the 2nd ACM SIGSPATIAL Workshop on Geospatial Humanities, GeoHumanities 18, pp. 2:1-2:9, ACM, Seattle, WA, USA, November, 2018. bib pdf link
This paper presents a series of projects where one of the main sources for toponomastic research in Finland, the corpora of place names in the Names Archive database of the Institute for the Languages of Finland, was digitized and how the resulting database was converted, enriched and published as Linked Open Data using a data processing pipeline. Utilizing the Linked Data infrastructure and various external data sources, a modern full-stack web application, NameSampo, was created in collaboration between toponomastic researchers and computer scientists for searching, analyzing, and visualizing digital toponomastic data sources.
Minna Tamper, Petri Leskinen, Kasper Apajalahti and Eero Hyvönen: Using Biographical Texts as Linked Data for Prosopographical Research and Applications. Digital Heritage. Progress in Cultural Heritage: Documentation, Preservation, and Protection. 7th International Conference, EuroMed 2018, Nicosia, Cyprus (Marinos Ioannides, Eleanor Fink, Raffaella Brumana, Petros Patias, Anastasios Doulamis, João Martins and Manolis Wallace (eds.)), pp. 125-137, Springer-Verlag, November, 2018. bib pdf link
Goki Miyakita, Petri Leskinen and Eero Hyvönen: Using Linked Data for Prosopographical Research of Historical Persons: Case U.S. Congress Legislators. Digital Heritage. Progress in Cultural Heritage: Documentation, Preservation, and Protection. 7th International Conference, EuroMed 2018, Nicosia, Cyprus, Springer-Verlag, November, 2018. bib pdf
Minna Tamper, Arttu Oksanen, Jouni Tuominen, Eero Hyvönen and Aki Hietanen: Anonymization Service for Finnish Case Law: Opening Data without Sacrificing Data Protection and Privacy of Citizens. Proceedings of Law via the Internet 2018 (LVI 2018), Knowledge of the Law in the Big Data Age, abstracts, Florence, Italy, October, 2018. bib pdf
Petri Leskinen, Goki Miyakita, Mikko Koho and Eero Hyvönen: Combining Faceted Search with Data-analytic Visualizations on Top of a SPARQL Endpoint. Proceedings of VOILA 2018, Monterey, California. CEUR Workshop Proceedings, Vol. 2187, October, 2018. bib pdf
Mikko Koho, Esko Ikkala and Eero Hyvönen: How to Maintain a Linked Data Cloud in a Deployed Semantic Portal. Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks, CEUR Workshop Proceedings, Monterey, California, USA, October, 2018. Vol 2180. bib pdf link
Mikko Koho, Esko Ikkala, Erkki Heino and Eero Hyvönen: Maintaining a Linked Data Cloud and Data Service for Second World War History. Digital Heritage. Progress in Cultural Heritage: Documentation, Preservation, and Protection. 7th International Conference, EuroMed 2018, Nicosia, Cyprus, vol. 11196, Springer-Verlag, October-November, 2018. bib pdf link
Arttu Oksanen, Jouni Tuominen, Eetu Mäkelä, Minna Tamper, Aki Hietanen, and Eero Hyvönen: Semantic Finlex: Finnish Legislation and Case Law as a Linked Open Data Service. Proceedings of Law via the Internet 2018 (LVI 2018), Knowledge of the Law in the Big Data Age, abstracts, Florence, Italy, October, 2018. bib pdf
Mikko Koho, Erkki Heino, Arttu Oksanen and Eero Hyvönen: Toffee - Semantic Media Search Using Topic Modeling and Relevance Feedback. Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks, CEUR Workshop Proceedings, Monterey, California, USA, October, 2018. Vol 2180. bib pdf link
Goki Miyakita, Petri Leskinen and Eero Hyvönen: U.S. Congress Prosopograher - A Tool for Prosopographical Research of Legislators. Proceedings of the ISWC 2018 Posters & Demonstrations, Industry and Blue Sky Ideas Tracks, CEUR Workshop Proceedings, Monterey, Califonia, USA, October, 2018. Vol 2180. bib pdf link
Kasper Apajalahti, Ermias Andargie Walelgne, Jukka Manner, Eero Hyvönen: Correlation-Based Feature Mapping of Crowdsourced LTE Data. 2018 IEEE 29th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC), Bologna, Italy, September, 2018. bib pdf link
Kasper Apajalahti: Creating Time Series-Based Metadata for Semantic IoT Web Services. International Conference on Database and Expert Systems Applications, pp. 417-427, Springer, Regensburg, Germany, September, 2018. bib pdf link
Esko Ikkala, Eero Hyvönen and Jouni Tuominen: An Ontology of World War II Places for Linking and Enriching Heterogeneous Historical Data Sources. Abstracts, 17th International Conference of Historical Geographers (ICHG 2018), No. 194, Warsaw, Poland, July, 2018. bib pdf
Esko Ikkala, Eero Hyvönen and Jouni Tuominen: A Crowdsourced Old Map Service for Geocoding, Publishing, and Using Historical Places in Linked Data Applications. Abstracts, 17th International Conference of Historical Geographers (ICHG 2018), No. 195, Warsaw, Poland, July, 2018. bib pdf
Vilho Räisänen, Kasper Apajalahti: Reasoning in agent-based network management. NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium, Taipei, Taiwan, April, 2018. bib pdf link
Minna Tamper, Arttu Oksanen, Eero Hyvönen: Schema.org - hakukonejättien semanttinen web (Schema.org - The Semantic Web of Search Engine Giants). Tietojohtaminen, April, 2018. bib pdf
Eero Hyvönen: Sotasampo - talvi- ja jatkosota semanttisessa webissä (WarSampo - Finnish WW2 on the Semantic Web). Tietoasiantuntija, April, 2018. bib pdf
Esko Ikkala, Eero Hyvönen and Jouni Tuominen: Geocoding, Publishing, and Using Historical Places and Old Maps in Linked Data Applications. Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference, pp. 228-234, CEUR Workshop Proceedings, Vol 2084, Helsinki, Finland, March, 2018. bib pdf link
Mikko Koho, Erkki Heino, Esko Ikkala, Eero Hyvönen, Reijo Nikkilä, Tiia Moilanen, Katri Miettinen and Pertti Suominen: Integrating Prisoners of War Dataset into the WarSampo Linked Data Infrastructure. Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018), CEUR Workshop Proceedings, Helsinki, Finland, March, 2018. Vol 2084. bib pdf link
One of the great promises of Linked Data and the Semantic Web standards is to provide a shared data infrastructure into which more and more data can be imported and aligned, forming a sustainable, ever growing knowledge graph or linked data cloud, Web of Data. This paper studies and evaluates this idea in the context of the WarSampo Linked Data cloud, providing an infrastructure for data related to the Second World War in Finland. As a case study, a new database of prisoners of war with related contents is transformed into linked data and integrated into WarSampo. Lessons learned are discussed in relation to using traditional data publishing approaches.
Eetu Mäkelä, Mikko Tolonen and Jouni Tuominen (eds.): Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018). CEUR Workshop Proceedings, vol. 2084, Helsinki, Finland, March, 2018. bib link
Jouni Tuominen, Eetu Mäkelä, Eero Hyvönen, Arno Bosse, Miranda Lewis and Howard Hotson: Reassembling the Republic of Letters - A Linked Data Approach. Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018), pp. 76-88, CEUR Workshop Proceedings, vol. 2084, Helsinki, Finland, March, 2018. bib pdf link
Between 1500 and 1800, a revolution in postal communication allowed ordinary men and women to scatter letters across and beyond Europe. This exchange helped knit together what contemporaries called the respublica litteraria, Republic of Letters, a knowledge-based civil society, crucial to that era’s intellectual breakthroughs, and formative of many modern European values and institutions. To enable effective Digital Humanities research on the epistolary data distributed in different countries and collections, metadata about the letters have been aggregated, harmonised, and provided for the research community through the Early Modern Letters Online (EMLO) service. This paper discusses the idea and benefits of using Linked Data as a basis for the next digital framework of EMLO, and presents experiences of a first demonstrational implementation of such a system.
Eero Hyvönen, Petri Leskinen, Minna Tamper, Jouni Tuominen and Kirsi Keravuori: Semantic National Biography of Finland. Proceedings of the Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018), pp. 372-385, CEUR Workshop Proceedings, Vol-2084, Helsinki, Finland, March, 2018. bib pdf link
This paper presents the vision of publishing and utilizing textual biographies as Linked (Open) Data on the Semantic Web. As a case study, we publish the live stories of the National Biography of Finland, created by the Finnish Literature Society, as semantic, i.e., machine “understandable” metadata in a SPARQL endpoint using the Linked Data Finland (LDF.fi) service. On top of the data service various Digital Humanities applications are built. The applications include searching and studying individual personal histories as well as historical research of groups of persons using methods of prosopography. The biographical data is enriched by extracting events from unstructured and semi-structured texts, and by linking entities internally and to external data sources. A faceted semantic search engine is provided for filtering groups of people from the data for prosopographical research. An extension of the event-based CIDOC CRM ontology is used as the underlying data model, where lives are seen as chains of interlinked events populated from the data of the biographies and additional data sources, such as museum collections, library databases, and archives.
Eero Hyvönen: Semanttinen web. Linkitetyn avoimen datan käsikirja (Semantic Web. Handbook of Linked Open Data). pp. 271, Gaudeamus, Helsinki, Finland, March, 2018. bib link
Suzie Thomas, Anna Wessman, Jouni Tuominen, Mikko Koho, Esko Ikkala, Eero Hyvönen, Ville Rohiola and Ulla Salmela: SuALT: Collaborative Research Infrastructure for Archaeological Finds and Public Engagement through Linked Open Data. Digital Humanities in the Nordic Countries 3rd Conference (DHN 2018), Book of Abstracts, Helsinki, Finland, March, 2018. bib pdf
2017
Kasper Apajalahti, Eero Hyvönen, Juha Niiranen, Vilho Räisänen: Combining ontological modelling and probabilistic reasoning for network management. Journal of Ambient Intelligence and Smart Environments, vol. 9, no. 1, pp. 63-76, IOS Press, 2017. bib pdf link
Advanced automation is needed in future mobile networks to provide adequate service quality economically and with high reliability. In this paper, a system is presented that takes into account the network context, analyses uncertain information, and infers network configurations by means of probabilistic reasoning. The system introduced in this paper is an experimental platform integrating a mobile network simulator, a Markov Logic Network (MLN) model, and an OWL 2 ontology into a runtime environment that can be monitored via a Resource Description Framework (RDF) -based user interface. In this approach, the OWL ontology contains a semantic representation of the relevant concepts, and the MLN model evaluates elements of uncertain information. Experiments based on a prototype implementation demonstrate the value of semantic modelling and probabilistic reasoning in network status characterization, optimization, and visualization.
Mikko Koho, Eero Hyvönen, Erkki Heino, Jouni Tuominen, Petri Leskinen and Eetu Mäkelä: Linked Death - Representing, Publishing, and Using Second World War Death Records as Linked Open Data. The Semantic Web: ESWC 2017 Satellite Events (Eva Blomqvist, Katja Hose, Heiko Paulheim, Agnieszka Ławrynowicz, Fabio Ciravegna and Olaf Hartig (eds.)), pp. 369-383, Springer, Cham, 2017. bib pdf link
War history of the Second World War (WW2), humankind’s largest disaster, is of great interest to both laymen and researchers. Most of us have ancestors and relatives who participated in the war, and in the worst case got killed. Researchers are eager to find out what actually happened then, and even more importantly why, so that future wars could perhaps be prevented. The darkest data of war history are casualty records—from such data we could perhaps learn most about the war. This paper presents a model and system for representing death records as linked data, so that 1) citizens could find out more easily what happened to their relatives during WW2 and 2) digital humanities (DH) researchers could (re)use the data easily for research.
Petri Leskinen, Eero Hyvönen and Jouni Tuominen: Analyzing and Visualizing Prosopographical Linked Data Based on Short Biographies. Biographical Data in a Digital World 2017 (BD2017), Linz, Austria, November, 2017. bib pdf link
Jouni Tuominen, Eero Hyvönen and Petri Leskinen: Bio CRM: A Data Model for Representing Biographical Data for Prosopographical Research. Biographical Data in a Digital World 2017 (BD2017), Linz, Austria, November, 2017. bib pdf link
Eetu Mäkelä, Juha Törnroos, Thea Lindquist and Eero Hyvönen: WW1LOD: An application of CIDOC-CRM to World War 1 linked data. International Journal on Digital Libraries, vol. 18, no. 4, pp. 333-343, Springer, nov, 2017. bib pdf link
The CIDOC-CRM standard indicates that common events, actors, places and timeframes are important in linking together cultural material, and provides a framework for describing them. However, merely describing entities in this way in two datasets does not yet interlink them. To do that, the identities of instances still need to be either reconciled, or be based on a shared vocabulary. The WW1LOD dataset presented in this paper was created to facilitate both of these approaches for collections dealing with the First World War. For this purpose, the dataset includes events, places, agents, times, keywords, and themes related to the war, based on over ten different authoritative data sources from providers such as the Imperial War Museum. The content is harmonized into RDF, and published as a Linked Open Data service. While generally basing on CIDOC-CRM, some modeling choices used also deviate from it where our experience dictated such. In the article, these deviations are discussed in the hope that they may serve as examples where CIDOC-CRM itself may warrant further examination. As a demonstration of use, the dataset and online service have been used to create a contextual reader application that is able link together and pull in information related to WW1 from e.g. 1914–1918 Online, Wikipedia, WW1 Discovery, Europeana and the Digital Public Library of America.
Petri Leskinen, Jouni Tuominen, Erkki Heino and Eero Hyvönen: An Ontology and Data Infrastructure for Publishing and Using Biographical Linked Data. Proceedings of the Workshop on Humanities in the Semantic Web (WHiSe II), pp. 15-26., CEUR Workshop Proceedings, Vol. 2014, Vienna, Austria, October, 2017. bib pdf link
This paper describes the ontology model and published datasets of a digitized biographical person register. The applied ontology model is designed to represent people via their enduring roles and perduring lifetime events. The model is designed to support 1) prosopographical Digital Humanities research, 2) linking to resources in semantic Cultural Heritage portals, and 3) semantic data validation and enrichment by using SPARQL queries. The linked data approach enables to enrich a person s biography by interlinking it with space and time related biographical events, persons relating by social contacts or family relations, historical events, and personal achievements.
Mikko Koho, Agata Dominowska, Elsi Hyttinen, Péter Ivanics, Elizabeth Oakes, Ilona Pikkanen, Leena Tulkki and Risto J. Turunen: Big data approach to 19th-century Finnish newspaper literature. HELDIG Digital Humanities Summit 2017, Helsinki, Finland, October, 2017. bib pdf link
Petri Leskinen, Mikko Koho, Erkki Heino, Minna Tamper, Esko Ikkala, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen: Modeling and Using an Actor Ontology of Second World War Military Units and Personnel. Proceedings of the 16th International Semantic Web Conference (ISWC 2017) (Claudia d Amato, Miriam Fernandez, Valentina Tamma, Freddy Lecue, Philippe Cudré-Mauroux, Juan Sequeda, Christoph Lange and Jeff Heflin (eds.)), pp. 280-296, Springer-Verlag, Vienna, Austria, October, 2017. bib pdf link
This paper presents a model for representing historical military personnel and army units, based on large datasets about World War II in Finland. The model is in use in WarSampo data service and semantic portal, which has had tens of thousands of distinct visitors. A key challenge is how to represent ontological changes, since the ranks and units of military personnel, as well as the names and structures of army units change rapidly in wars. This leads to serious problems in both search as well as data linking due to ambiguity and homonymy of names. In our solution, actors are represented in terms of the events they participated in, which facilitates disambiguation of personnel and units in different spatio-temporal contexts. The linked data in the WarSampo Linked Open Data cloud and service has ca. 9 million triples, including actor datasets of ca. 100 000 soldiers and ca. 16 100 army units. To test the model in practice, an application for semantic search and recommending based on data linking was created, where the spatio-temporal life stories of individual soldiers can be reassembled dynamically by linking data from different datasets. An evaluation is presented showing promising results in terms of linking precision.
Esko Ikkala, Mikko Koho, Erkki Heino, Petri Leskinen, Eero Hyvönen and Tomi Ahoranta: Prosopographical Views to Finnish WW2 Casualties Through Cemeteries and Linked Open Data. Proceedings of the Workshop on Humanities in the Semantic Web (WHiSe II), CEUR Workshop Proceedings, Vienna, Austria, October, 2017. bib pdf link
This paper presents an application for studying the death records of WW2 casualties from a prosopograhical perspective, provided by the various local military cemeteries where the dead were buried. The idea is to provide the end user with a global visual map view on the places in which the casualties were buried as well as with a local historical perspective on what happened to the casualties that lay within a particular cemetery of a village or town. Plenty of data exists about the Second World War (WW2), but the data is typically archived in unconnected, isolated silos in different organizations. This makes it difficult to track down, visualize, and study information that is contained within multiple distinct datasets. In our work, this problem is solved using aggregated Linked Open Data provided by the WarSampo Data Service and SPARQL endpoint.
Eero Hyvönen, Erkki Heino, Petri Leskinen, Esko Ikkala, Mikko Koho, Minna Tamper, Jouni Tuominen and Eetu Mäkelä: WarSampo: Publishing and Using Linked Open Data about the Second World War. EuropeanaTech Insight, no. 7, Europeana, September, 2017. bib pdf link
The article overviews the system WarSampo – Finnish World War 2 on the Semantic Web, the winner of the LODLAM Challenge 2017 Open Data Prize on June 29 in Venice, Italy.
Mikko Rinne: Event Processing Using Semantic Web Technologies. Dissertation, Aalto University, School of Science, Espoo, July, 2017. bib link
Minna Tamper, Petri Leskinen, Esko Ikkala, Arttu Oksanen, Eetu Mäkelä, Erkki Heino, Jouni Tuominen, Mikko Koho and Eero Hyvönen: AATOS – a Configurable Tool for Automatic Annotation. Proceedings, Language, Data and Knowledge (LDK 2017), pp. 276-289, Springer-Verlag, Galway, Ireland, June, 2017. bib pdf link
This paper presents an automatic annotation tool AATOS for providing documents with semantic annotations. The tool links entities found from the texts to ontologies defined by the user. The application is highly configurable and can be used with different natural language Finnish texts. The application was developed as a part of WarSampo and Semantic Finlex projects and tested using Kansa Taisteli magazine articles and consolidated Finnish legislation of Semantic Finlex. The quality of the automatic annotation was evaluated by measuring precision and recall against existing manual annotations. The results showed that the quality of the input text, as well as the selection and configuration of the ontologies impacted the results.
Erkki Heino, Minna Tamper, Eetu Mäkelä, Petri Leskinen, Esko Ikkala, Jouni Tuominen, Mikko Koho and Eero Hyvönen: Named Entity Linking in a Complex Domain: Case Second World War History. Proceedings, Language, Data and Knowledge (LDK 2017), pp. 120-133, Springer-Verlag, Galway, Ireland, June, 2017. bib pdf link
This paper discusses the challenges of applying named entity linking in a rich, complex domain – specifically, the linking of 1) military units, 2) places and 3) people in the context of rich Second World War data. Multiple sub-scenarios are discussed in detail through concrete evaluations, analyzing the problems faced, and the solutions developed. A key contribution of this work is to highlight the heterogeneity of problems and approaches needed even inside a single domain, depending on both the source data as well as the target authority.
Jouni Tuominen: Ontology Services for Knowledge Organization Systems. Dissertation, Aalto University, School of Science, Espoo, June, 2017. bib pdf link
Eero Hyvönen, Petri Leskinen, Erkki Heino, Jouni Tuominen and Laura Sirola: Reassembling and Enriching the Life Stories in Printed Biographical Registers: Norssi High School Alumni on the Semantic Web. Proceedings, Language, Data and Knowledge (LDK 2017), pp. 113-119, Springer-Verlag, Galway, Ireland, June, 2017. bib pdf link
This paper presents the idea to enrich printed biographical person registers with linked data related to events that took place after the register was published. By transforming printed historical documents into structured data, semantic search to written texts can be provided for the reader. Even more importantly, life stories of historical persons can be extended based on data linking by extracting semantic structures from printed texts, and by combining this data with external datasets and data services. Such linking provides an enriched context for prosopographical research on people in the register, as well as an enhanced reading experience for anyone interested in reading the biographies. As a concrete case study, a register 1867–1992 of over 10 000 alumni of the prominent Finnish high school “Norssi” was transformed into RDF, was enriched by data linking, was published as a linked data service, and is provided to end users via a faceted search engine and browser for studying lives of historical persons and for prosopographical research.
Erkki Heino: Sotahistorian kuvaaminen ja rikastaminen linkitettynä datana. MSc Thesis (in Finnish), University of Helsinki, Department of Computer Science, June, 2017. bib pdf link
Linkitetty data mahdollistaa erillisten aineistojen yhdistämisen, mistä syntyvä kokonaisuus mahdollistaa aineistojen tietojen paremman ymmärtämisen. Aineistojen välisten linkkien avulla voidaan päätellä uutta tietoa helpommin kuin tarkastelemalla aineistoja erikseen. Tutkielmassa käsitellään sotahistoriallisten aineistojen mallintamista ja julkaisua linkitettynä avoimena datana sekä aineistojen automaattista rikastamista muiden aineistojen avulla. Työn tavoitteena oli selvittää miten tällaisia aineistoja kannattaa mallintaa linkitettynä datana, miten niitä kannattaa yhdistää muihin aineistoihin, mitä lisäarvoa tästä saadaan ja miten aineistot kannattaa visualisoida. Aineistoina käytettiin tietokirjoista digitoituja tapahtumia sekä Sotamuseon SA-kuvapalvelun valokuvien metatietoja. Aineistot mallinnettiin käyttäen CIDOC CRM -standardia ja niitä rikastettiin linkittämällä niiden sisältämiä resursseja automaattisesti henkilö-, joukko-osasto- ja paikkaontologioiden avulla. CIDOC CRM:n määrittämä tapahtumakeskeinen mallinnustapa mahdollistaa aineistojen yhteentoimivuuden paitsi toistensa myös muiden historiallisten aineistojen kanssa. Automaattiseen rikastamiseen liittyi monia haasteita, sillä viittaukset toisiin aineistoihin oli poimittava suurelta osin tekstimuotoisista kuvauksista, jolloin ongelmaksi nousee nimettyjen entiteettien kuten henkilöiden ja paikkojen tunnistaminen ja yksilöinti tekstistä. Työssä käsitellään kyseisiä haasteita, esitellään käytetyt ratkaisut ja arvioidaan näiden toimivuutta. Aineistoja visualisoimaan toteutettiin myös JavaScript-sovellukset. Aineistot ja sovellukset on julkaistu osana Sotasampo-portaalia, joka muodostaa yhteenlinkitetyn kokonaisuuden erilaisista aineistoista liittyen toiseen maailmansotaan Suomessa. Portaali palvelee paitsi Suomen historiasta ja sodissa taistelleiden omaistensa liikkeistä kiinnostuneita kansalaisia, myös historian tutkijoita tarjoamalla aineistot vapaasti kyseltävässä rakenteisessa muodossa.
Kasper Apajalahti, Juha Niiranen, Shubhan Kapoor and Vilho Räisänen: Sharing Performance Measurement Events Across Domains. IFIP/IEEE International Symposium on Integrated Network Management, Proceedings, pp. 463-469, IEEE, Lisbon, Portugal, May, 2017. bib pdf link
Abstract—Network management activities, such as fault analysis and configuration management, are eventually related to changes in network measurements. Some measurement event might be either a trigger or objective of a management activity. We argue that sharing the semantics of performance data across networks provides a basis for more advanced automation. This paper presents an ontology-based system for sharing information about network measurements across network domains. The represented information contains correlations and human-defined mappings between network measurements and the system is based on semantic reasoning that identifies dependencies which arise by combining local and shared information.We demonstrate the usage of the system in a Long Term Evolution (LTE) network domain. Our experiments from an LTE simulator and LTE test network show that a combination of correlations, human defined mappings, and ontological reasoning produces useful cross-domain information that can be accessed with ontology queries.
Eero Hyvönen: Digitaalisten ihmistieteiden keskus HELDIG käynnisti toimintansa (Digital Humanities Centre HELDIG Started Operations). Tieteessä tapahtuu, vol. 35, no. 2, March, 2017. bib pdf link
Eero Hyvönen, Arttu Oksanen, Jouni Tuominen, Eetu Mäkelä and Minna Tamper: Semanttinen Finlex. Laki ja oikeus avoimena linkitettynä datana. (Semantic Finlex. Law and Justice as Linked Open Data.). Oikeus-lehti, vol. 46, no. 1, March, 2017. bib pdf
2016
Eero Hyvönen: Cultural Heritage Linked Data on the Semantic Web: Three Case Studies Using the Sampo Model. VIII Encounter of Documentation Centres of Contemporary Art: Open linked data and integral management of information in cultural centres, 2016. Artium, Vitoria-Gasteiz, Spain, October 19-20, 2016. bib pdf
A major challenge in publishing linked Cultural Heritage (CH) collections on the web is interoperability. This is due to the heterogeneity of CH contents and the distributed content creation model where publishers focus on their own data with little consideration on the others’ data. As a solution approach, the “Sampo” model is presented based on using domain independent modeling standards, on a model for aligning metadata models, and on sharing domain ontologies for populating the matadata models. The harmonized data is published for machines as a linked data service, to be used by applications for human users. To illustrate and evaluate the model, three online systems on the Web, Culture- Sampo, BookSampo, and WarSampo are presented.
Tuula Pääkkönen, Jukka Kervinen, Asko Nivala, Kimmo Kettunen and Eetu Mäkelä: Exporting Digitized Historical Newspaper Contents for Offline Use. D-Lib, vol. 22, no. 7/8, 2016. bib link
Digital collections of the National Library of Finland (NLF) contain over 10 million pages of historical newspapers, journals and some technical ephemera. The material ranges from the early Finnish newspapers from 1771 until the present day. The material up to 1910 can be viewed in the public web service, where as anything later is available at the six legal deposit libraries in Finland. A recent user study noticed that a different type of researcher use is one of the key uses of the collection. National Library of Finland has gotten several requests to provide the content of the digital collections as one offline bundle, where all the needed content is included. For this purpose we introduced a new format, which contains three different information sets: the full metadata of a publication page, the actual page content as ALTO XML, and the raw text content. We consider these formats most useful to be provided as raw data for the researchers. In this paper we will describe how the export format was created, how other parties have packaged the same data and what the benefits are of the current approach. We shall also briefly discuss word level quality of the content and show a real research scenario for the data.
Minna Tamper: Extraction of Entities and Concepts from Finnish Texts. MSc Thesis (in English), Aalto University, School of Science, Degree Programme in Computer Science and Engineering, Dec, 2016. bib pdf
Keywords are used in many document databases to improve search. The process of assigning keywords from controlled vocabularies to a document is called subject indexing. If the controlled vocabulary used for indexing is an ontology, with semantic relations and descriptions of concepts, the process is also called semantic annotation. In this thesis an automatic annotation tool was created to provide the documents with semantic annotations. The application links entities found from the texts to ontologies defined by the user. The application is highly configurable and can be used with different Finnish texts. The application was developed as a part of WarSampo and Semantic Finlex projects and tested using Kansa Taisteli magazine articles and consolidated legislation of Finnish legislation. The quality of the automatic annotation was evaluated by measuring precision and recall against existing manual annotations. The results showed that the quality of the input text, as well as the selection and configuration of the ontologies impacted the results.
Eero Hyvönen, Mikko Tolonen, Arto Mustajoki and Hanna Snellman: Heldig - Helsinki Centre for Digital Humanities. Digital Humanities Centres: Experiences and Perspectives. Warsaw, December 8-9, 2016, abstracts, Warsaw, Poland, Dec, 2016. bib pdf
Petri Leskinen: Sotilashenkilöiden ja joukko-osastojen mallintaminen ja käyttö toimijaontologiana. MSc Thesis (in Finnish), Aalto University, School of Science, Degree Programme in Computer Science and Engineering, Dec, 2016. bib pdf
Toimijaontologia mallintaa henkilöitä ja henkilöryhmiä linkitetyssä avoimessa datassa. Toimijaontologiamallin tarkoitus on mahdollistaa eri lähteiden aineistojen kokoaminen yhteen ja sen julkaisu yhdenmukaisessa formaatissa, jotta tietoa voidaan hyödyntää niin digitaalisten ihmistieteiden tutkimuksessa kuin tarjoamalla käyttöliittymiä aineiston selaamiseen visuaalisessa muodossa. Laadittu ontologia noudattaa toimija–tapahtuma-mallia. Siinä toimija mallinnetaan häneen liittyvien elämäkerrallisten tapahtumien summana. Ratkaisujen perustana käytettiin CIDOC CRM -standardia, millä haluttiin taata mallin helppo laajennettavuus sekä noudattaa kulttuurihistorialliselle datalle yhdenmukaista julkaisukäytäntöä. Työ on tehty osana laajempaa Sotasampo-projektia, johon kerättiin kattava tietokanta toisen maailmansodan aikaista aineistoa Suomen osalta. Oma osuuteni tässä työssä oli toimijaontologiamallin laatiminen sekä sen populointi sotilashenkilöillä ja -osastoilla. Aineisto on julkaistu avoimena datana (http://www.ldf.fi/dataset/warsa) ja on selattavissa Sotasampo-portaalissa (http://www.sotasampo.fi).
Haitao Tang, Kaj Stenberg, Kasper Apajalahti, Juha Niiranen and Vilho Räisänen: Automatic Definition and Application of Similarity Measures for Self-Operation of Network. Proc. 8th EAI International Conference on Mobile Networks and Management, Springer-Verlag, Abu Dhabi, Oct, 2016. bib pdf
Arttu Oksanen: Lainsäädännön ja oikeuskäytännön mallintaminen ja julkaiseminen linkitettynä avoimena datana. MSc Thesis (in Finnish), Aalto University, School of Electrical Engineering, Degree Programme in Electronics and Electrical Engineering, October, 2016. bib pdf
Viranomaiset julkaisevat oikeudellista tietoa verkossa avoimesti usein ihmisluettavissa PDF- ja HTML-muodoissa. Kuitenkin tiedonhaun tehostamiseksi ja tiedon ymmärtämisen helpottamiseksi tarvitaan älykkäitä palveluita sekä älykästä tietomallinnusta. Lisäksi lainsäädännön kansainvälistyessä eri organisaatioilla on tarve edistää oikeudellista tiedonvaihtoa yli kansallisten rajojen, mikä edellyttää aineistojen esitystavan yhtenäistämistä. Tässä diplomityössä tutkitaan, miten linkitetyn datan teknologioilla voidaan mallintaa ja julkaista lainsäädäntö sekä oikeuskäytäntö siten, että julkaisu palvelee laajasti eri oikeudellisen tiedon käyttötapauksia. Työ sisälsi RDF-tietomallien ja datamuunnoksen kehittämisen, datan rikastamisen sekä ohjelmointirajapintojen ja sovellusprototyyppien toteuttamisen. Lopputuloksena syntyi Semanttinen Finlex -palvelu, jossa Suomen lainsäädäntö sekä korkeimman oikeuden ja korkeimman hallinto-oikeuden ratkaisut on julkaistu keskeisiltä osin linkitettynä avoimena datana noudattaen eurooppalaisia tunniste- ja metatietostandardeja.
Eetu Mäkelä: LAS: an integrated language analysis tool for multiple languages. The Journal of Open Source Software, vol. 1, no. 6, The Open Journal, oct, 2016. bib link
LAS is a command-line tool for lemmatizing, morphological analysis, inflected form generation, hyphenation and language identification of multiple languages.
Kimmo Kettunen, Eetu Mäkelä, Juha Kuokkala, Teemu Ruokolainen and Jyrki Niemi: Modern Tools for Old Content - in Search of Named Entities in a Finnish OCRed Historical Newspaper Collection 1771-1910. Proceedings of LWDA 2016, Potsdam, Germany, September, 2016. bib pdf
Named entity recognition (NER), search, classification and tagging of names and name like frequent informational elements in texts, has become a standard information extraction procedure for textual data. NER has been applied to many types of texts and different types of entities: newspapers, fiction, historical records, persons, locations, chemical compounds, protein families, animals etc. In general a NER system’s performance is genre and domain dependent and also used entity categories vary. The most general set of named entities is usually some version of three partite categorization of locations, persons and organizations. In this paper we report first trials and evaluation of NER with data out of a digitized Finnish historical newspaper collection Digi. Digi collection contains 1 960 921 pages of newspaper material from years 1771– 1910 both in Finnish and Swedish. We use only material of Finnish documents in our evaluation. The OCRed newspaper collection has lots of OCR errors; its estimated word level correctness is about 74–75 %. Our principal NER tagger is a rule-based tagger of Finnish, FiNER, provided by the FIN-CLARIN consortium. We show also results of limited category semantic tagging with tools of the Semantic Computing Research Group (SeCo) of the Aalto University. FiNER is able to achieve up to 60.0 F-score with named entities in the evaluation data. Seco’s tools achieve 30.0–60.0 F-score with locations and persons. Performance of FiNER and SeCo’s tools with the data shows that at best about half of named entities can be recognized even in a quite erroneous OCRed text
Esko Ikkala: Suomalainen historiallisten paikkojen ja karttojen ontologiapalvelu. MSc Thesis (in Finnish), Aalto University, School of Electrical Engineering, Degree Programme of Automation and Systems Technology, August, 2016. bib pdf link
Historiallinen paikkatieto on keskeisessä asemassa muistiorganisaatioiden kokoelmien hallinnassa ja hyödyntämisessä sekä digitaalisten ihmistieteiden tutkimuksessa. Paikkatiedon käsitteleminen muissa kuin erikoistuneissa paikkatietojärjestelmissä sekä paikkatiedon ajallinen ulottuvuus tuovat mukanaan lukuisia haasteita, joihin linkitetyn datan teknologiat ovat tarjonneet lupaavia ratkaisuja. Tässä työssä esitellään kulttuurialan organisaatioiden tarpeeseen kehitetty uusi linkitetyn datan teknologioihin perustuva historiallisten paikkojen ja karttojen palvelumalli, HIPLA. HIPLA-palvelumallin tavoitteena on tarjota yhteinen näkymä eri organisaatioiden hallinnoimaan paikkatietoon ja mahdollistaa hajautettujen paikkatietoaineistojen yhteisöllinen täydentäminen, haku ja selailu sekä nykyisillä että historiallisilla kartoilla. Lisäksi työssä toteutettiin HIPLA-palvelumallin etuja havainnollistava prototyyppisovellus Hipla.fi, jota pilotoitiin osana talvi- ja jatkosodan aineistoja linkitettynä avoimena datana julkaisevaa Sotasampo-projektia. Pilotoinnin tuloksena syntyi talvi- ja jatkosodan paikkaontologia, joka tarjoaa työkalun sotiin liittyvien aineistojen automaattiselle linkitykselle ja aineistojen maantieteelliselle visualisoimiselle.
Esko Ikkala, Jouni Tuominen and Eero Hyvönen: Contextualizing Historical Places in a Gazetteer by Using Historical Maps and Linked Data. Proceedings of Digital Humanities 2016, short papers, pp. 573-577, Kraków, Poland, July, 2016. bib pdf link
Eetu Mäkelä, Thea Lindquist and Eero Hyvönen: CORE - A Contextual Reader based on Linked Data. Proceedings of Digital Humanities 2016, long papers, pp. 267-269, Kraków, Poland, July, 2016. bib pdf link
CORE is a contextual reader application intended to improve user close reading experience, particularly with regard to material in an unfamiliar domain. CORE works by utilizing Linked Data reference vocabularies and datasets to identify entities in any PDF file or web page. For each discovered entity, pertinent information such as short descriptions, pictures, or maps are sourced and presented on a mouse-over, to allow users to familiarize themselves with any unfamiliar concepts, places, etc in the texts they are reading. If further information is needed, an entity can be clicked to open a full context pane, which supports deeper contextualization (also visually, e.g. by displaying interactive timelines or maps). Here, CORE also facilitates serendipitous discovery of further related knowledge, by being able to bring in and suggest related resources from various repositories. Clicking on any such resource loads it into the contextual reader for endless further browsing.
Eetu Mäkelä, Tanja Säily and Terttu Nevalainen: Khepri - a Modular View-Based Tool for Exploring (Historical Sociolinguistic) Data. Proceedings of Digital Humanities 2016, long papers, pp. 269-272, Kraków, Poland, July, 2016. bib pdf link
Digital humanities needs tools that better support the core processes of humanistic inquiry. This includes support for handling uncertainty and incompleteness in the data, for interactive exploration, and for fluidly moving between close and distant reading. The Khepri tool presented here is part of a user-centered project to develop a modular set of components that take these requirements into account, and can be connected and configured to respond to the needs of a particular humanities task and data. Here, the configuration presented is one for the field of historical sociolinguistics, developed in collaboration between computer scientists and sociolinguistic researchers.
Eero Hyvönen, Erkki Heino, Petri Leskinen, Esko Ikkala, Mikko Koho, Minna Tamper, Jouni Tuominen and Eetu Mäkelä: Publishing Second World War History as Linked Data Events on the Semantic Web. Proceedings of Digital Humanities 2016, short papers, pp. 571-573, Kraków, Poland, July, 2016. bib pdf link
Data about wars is typically heterogeneous, distributed in the data silos of the fighting parties, multilingual, and often controversial depending on the political point of view. It is therefore hard for the historians to get a global picture of what has actually happened, to whom, where, when, and how. We argue that Semantic Web and Linked Data technologies are a very promising approach for modeling, harmonizing, and aggregating data about war history. Our goal is to make it possible, for both historians and laymen, to study history in a contextualized way where linked datasets enrich each other. The paper presents the in-use WarSampo 1 system, where massive collections of heterogeneous data about the (Finnish) history of the Second World War are harmonized using an event-based approach, and provided as a Linked Open Data service for applications to use. As a use case, a semantic portal WarSampo providing six different perspectives to the war based on events is presented.
Kasper Apajalahti, Eero Hyvönen, Juha Niiranen, Vilho Räisänen: Combining Ontologies and Markov Logic Networks for Statistical Relational Mobile Network Analysis. Proceedings of the 1st Workshop on Semantic Web Technologies for Mobile and Pervasive Environments (SEMPER), CEUR Workshop Proceedings, Heraklion, Crete, Greece, May, 2016. Vol 1588. bib pdf link
Eero Hyvönen, Esko Ikkala and Jouni Tuominen: Linked Data Brokering Service for Historical Places and Maps. Proceedings of the 1st Workshop on Humanities in the Semantic Web (WHiSe), pp. 39-52, CEUR Workshop Proceedings, Heraklion, Crete, Greece, May, 2016. Vol 1608. bib pdf link
This paper presents a new Linked Open Data brokering service model HIPLA for using and maintaining historical place gazetteers and maps based on distributed SPARQL endpoints. The model introduces several novelties: First, the service facilitates collaborative maintenance of geo-ontologies and maps in real time as a side effect of annotating contents in legacy cataloging systems. The idea is to support a collaborative ecosystem of curators that creates and maintains data about historical places and maps in a sustainable way. Second, in order to foster understanding of historical places, the places can be provided on both modern and historical maps, and with additional contextual Linked Data attached. Third, since data about historical places is typically maintained by different authorities and in different countries, the service can be used and extended in a federated fashion, by including new distributed SPARQL endpoints (or other web services with a suitable API) into the system. To test and demonstrate the model, we created the first prototype implementation Hipla.fi of the HIPLA model. Hipla.fi is based on four Finnish datasets in SPARQL endpoints totaling some 840,000 geocoded places on 450 historical maps from two atlas series aligned on modern maps, and on the Getty Thesaurus of Geographic Names (TGN) SPARQL endpoint in the US. As a first application, a part of the Hipla.fi data service has been applied in creating a 5 million triple semantic portal of historical Second World War data with tens of thousands of end users.
Mikko Koho, Eero Hyvönen, Erkki Heino, Jouni Tuominen, Petri Leskinen and Eetu Mäkelä: Linked Death - Representing, Publishing, and Using Second World War Death Records as Linked Open Data. Proceedings of the 1st Workshop on Humanities in the Semantic Web (WHiSe), CEUR Workshop Proceedings, Heraklion, Crete, Greece, May, 2016. Vol 1608. bib pdf link
War history of the Second World War (WW2), humankind s largest disaster, is of great interest to both laymen and researchers. Most of us have ancestors and relatives who participated in the war, and in the worst case got killed. Researchers are eager to find out what actually happened then, and even more importantly why, so that future wars could perhaps be prevented. The darkest data of war history are casualty records---from such data we could perhaps learn most about the war. This paper presents a model and system for representing death records as linked data, so that 1) citizens could find out more easily what happened to their relatives during WW2 and 2) digital humanities (DH) researchers could (re)use the data easily for research.
Mikko Koho, Erkki Heino and Eero Hyvönen: SPARQL Faceter - Client-side Faceted Search Based on SPARQL. Joint Proceedings of the 4th International Workshop on Linked Media and the 3rd Developers Hackshop, CEUR Workshop Proceedings, Heraklion, Crete, Greece, May, 2016. Vol 1615. bib pdf link
The faceted search paradigm is widely used in web applications, and there are various tools available for implementing it on the server side. In contrast, this paper presents an HTML based component tool on the client side that can be plugged on virtually any public SPARQL endpoint on the web, using only SPARQL API for data retrieval. To test and demonstrate the idea and the tool, application of the tool in a large in-use semantic portal is presented.
Kasper Apajalahti, Eero Hyvönen, Juha Niiranen, Vilho Räisänen: StaRe: Statistical Reasoning Tool for 5G Network Management. The Semantic Web: ESWC 2016 Satellite Events (Harald Sack, Giuseppe Rizzo, Nadine Steinmetz, Dunja Mladenić, Sören Auer and Christoph Lange (eds.)), Springer-Verlag, May, 2016. bib pdf
Eero Hyvönen, Erkki Heino, Petri Leskinen, Esko Ikkala, Mikko Koho, Minna Tamper, Jouni Tuominen and Eetu Mäkelä: WarSampo Data Service and Semantic Portal for Publishing Linked Open Data about the Second World War History. The Semantic Web – Latest Advances and New Domains (ESWC 2016) (Harald Sack, Eva Blomqvist, Mathieu d Aquin, Chiara Ghidini, Simone Paolo Ponzetto and Christoph Lange (eds.)), pp. 758-773, Springer-Verlag, May, 2016. bib pdf link
This paper presents the WarSampo system for publishing collections of heterogeneous, distributed data about the Second World War on the Semantic Web. WarSampo is based on harmonizing massive datasets using event-based modeling, which makes it possible to enrich datasets semantically with each others’ contents. WarSampo has two components: First, a Linked Open Data (LOD) service WarSampo Data for Digital Humanities (DH) research and for creating applications related to war history. Second, a semanticWarSampo Portal has been created to test and demonstrate the usability of the data service. The WarSampo Portal allows both historians and laymen to study war history and destinies of their family members in the war from different interlinked perspectives. Published in November 2015, theWarSampo Portal had some 20,000 distinct visitors during the first three days, showing that the public has a great interest in this kind of applications.
Osma Suominen, Henri Ylikotila, Sini Pessala, Mikko Lappalainen, Matias Frosterus, Jouni Tuominen, Thomas Baker, Caterina Caracciolo and Armin Retterath: Publishing SKOS vocabularies with Skosmos. 2016. Technical Report. bib pdf
2015
Matias Frosterus, Jouni Tuominen, Sini Pessala and Eero Hyvönen: Linked Open Ontology cloud: managing a system of interlinked cross-domain light-weight ontologies. International Journal of Metadata, Semantics and Ontologies, vol. 10, no. 3, pp. 189-201, 2015. bib pdf link
Miika Alonen, Tomi Kauppinen, Eero Hyvönen: Vocab.at – Automatic Linked Data Documentation And Vocabulary Usage Analysis. 2015. Technical Report. Aalto University, Semantic Computing Research Group (SeCo). bib pdf
Eero Hyvönen, Jouni Tuominen, Esko Ikkala and Eetu Mäkelä: Ontology Services Based on Crowdsourcing: Case National Gazetteer of Historical Places. Proceedings of the ISWC 2015 Posters & Demonstrations Track, CEUR-WS Proceedings, Bethlehem, PA, USA, October, 2015. Vol 1486. bib pdf link
This paper introduces the idea of applying crowdsourcing to evolving ontology services; the goal is to facilitate collaborative maintenance of ontologies in real time as a side effect of annotating contents in legacy cataloging systems. The idea is being implemented in the use case of creating and managing a national level gazetteer of historical places in Finland.
Eero Hyvönen, Jouni Tuominen, Eetu Mäkelä, Jérémie Dutruit, Kasper Apajalahti, Erkki Heino, Petri Leskinen and Esko Ikkala: Second World War on the Semantic Web: The WarSampo Project and Semantic Portal. Proceedings of the ISWC 2015 Posters & Demonstrations Track, CEUR-WS Proceedings, Bethlehem, PA, USA, October, 2015. Vol 1486. bib pdf link
This paper initiates and fosters work on publishing Linked Open Data about the Second World War. It is argued that the heterogeneous, distributed data about the international world war history makes a promising use case for semantic technologies. We hope that by making war data openly available we can learn from the past and promote peace.
Sini Pessala: Linkitettyjen kevytontologioiden muutosten kuvaaminen (Visualising Changes in a System of Linked Lightweight Ontologies). MSc Thesis (in Finnish), Aalto University, School of Science, Degree Programme of Computer Science and Engineering, April, 2015. bib pdf
Tuukka Ruotsalo and Eero Hyvönen: Exploiting Semantic Annotations for Domain-Specific Entity Search. Advances in Information Retrieval, ECIR2015, LNCS 9022, pp. 358-369, Springer-Verlag, March, 2015. bib link
Ville Piiparinen: Havaintodatan semanttinen mallintaminen ja validointi (Semantic modelling and validation of observation data). MSc Thesis (in Finnish), Aalto University, School of Electrical Engineering, Degree Programme of Automation and Systems Technology, February, 2015. bib pdf
Mikko Koho: Linked Data -palvelu luontohavaintoaineistoille. MSc Thesis (in Finnish), University of Helsinki, Department of Computer Science, February, 2015. bib pdf link
Biologisten havaintoaineistojen julkaiseminen linkitettynä datana mahdollistaa useiden aineistojen yhdistämisen toisiinsa. Yhdistämällä toisiinsa useita samaan asiaan liittyviä aineistoja, voidaan saavuttaa parempi ymmärrys kiinnostuksen kohteena olevasta ilmiöstä kuin tutkimalla aineistoja erikseen. Näin voidaan mahdollistaa tarkempien päätelmien tekeminen aineistojen pohjalta sekä etsiä odotettuja tai odottamattomia yhteyksiä aineistojen välillä. Linkitetyssä datassa käytetty RDF-tietomalli tuo aineistoihin koneluettavuuden ja helpon tavan viitata kaikkiin aineistojen osiin. Linkitettynä datana julkaistuja aineistoja voidaan helposti rikastaa yhä uusilla aineistoilla. Tässä tutkielmassa käsitellään Hangon lintuaseman havaintoaineiston sekä Ilmatieteenlaitoksen Hangon Russarön säähavaintoaineiston mallinnusta, käsittelyä ja hyödyntämistä linkitettynä datana. Aineistot on mallinnettu käyttäen RDF Data Cube -sanastoa, joka parantaa aineistojen yhteentoimivuutta. Lintuhavaintoaineistoon on annotoitu lajitietoa käyttäen ontologiaa Suomen linnuista, jota on rikastettu mm. lajien tuntomerkkiontologialla sekä uhanalaisuustiedoilla. Aineistot on julkaistu Linked Data Finland -alustalla, ja aineistojen välisten yhteyksien hahmottamiseksi on kehitetty visualisointipalvelun prototyyppi. Säätilan tiedetään olevan tärkeimpiä päivittäisen lintumuuton voimakkuuteen vaikuttavia tekijöitä. Visualisointipalvelulla pyritään näyttämään käyttäjälle, miten säätila vaikuttaa lintuhavaintomääriin ja erityisesti havaittuun lintumuuttoon. Aineistojen välisten suhteiden parempi tuntemus mahdollistaa tarkempien päätelmien tekemisen lintuhavaintoaineiston perusteella. Tutkielmassa esitetyt menetelmät ovat yleistettävissä lintu- ja säähavaintoaineistojen lisäksi muihin rakenteeltaan samankaltaisiin aineistoihin.
2014
Tomi Kauppinen, Giovana Mira de Espindola, Jim Jones, Alber Sánchez, Benedikt Gräler and Thomas Bartoschek: Linked Brazilian Amazon Rainforest Data. Semantic Web Journal, vol. 5, no. 2, pp. 151-155, 2014. bib link
Matias Frosterus, Jouni Tuominen and Eero Hyvönen: Facilitating Re-use of Legal Data in Applications--Finnish Law as a Linked Open Data Service. Proceedings of the 27th International Conference on Legal Knowledge and Information Systems (JURIX 2014), pp. 115-124, IOS Press, Krakow, Poland, December, 2014. bib pdf link
Esko Ikkala, Eetu Mäkelä and Eero Hyvönen: TourRDF: Representing, Enriching, and Publishing Curated Tours Based on Linked Data. 19th International Conference of Knowledge Engineering and Management (EKAW 2014), Demo and Poster Papers, November, 2014. bib pdf
Current mobile tourist guide systems are developed and used in separate data silos: each system and vendor tends to use its own proprietary, closed formats for representing tours and point of interest (POI) content. As a result, tour data cannot be enriched from other providers’ tour and POI repositories, or from other external data sources — even when such data were publicly available by, e.g., cities willing to promote tourism. This paper argues, that an open shared RDF-based tour vocabulary is needed to address these problems, and introduces such a model, TourRDF, extending the earlier TourML schema into the era of Linked Data. As a test and an evaluation of the approach, a case study based on data about the Unesco World Heritage site Suomenlinna fortress is presented.
Osma Suominen, Sini Pessala, Jouni Tuominen, Mikko Lappalainen, Susanna Nykyri, Henri Ylikotila, Matias Frosterus and Eero Hyvönen: Deploying National Ontology Services: From ONKI to Finto. Proceedings of the Industry Track at the International Semantic Web Conference 2014, CEUR Workshop Proceedings, Riva del Garda, Italy, October, 2014. Vol 1383. bib pdf link
The Finnish Ontology Library Service ONKI was published as a living laboratory prototype for public use in 2008. Its idea is to support content indexers and ontology developers via a browser interface and machine APIs. ONKI has been well-accepted, but being a prototype maintained by the ending research project FinnONTO (2003–2012), a more sustainable service was needed, supported by permanent governmental funding. To achieve this, ONKI was deployed and is being further developed by the National Library of Finland into a new national vocabulary service Finto. We discuss challenges in the deployment of ONKI into Finto and lessons learned during the transition process.
Eero Hyvönen, Miika Alonen, Esko Ikkala and Eetu Mäkelä: Life Stories as Event-based Linked Data: Case Semantic National Biography. Proceedings of ISWC 2014 Posters & Demonstrations Track, CEUR Workshop Proceedings, October, 2014. bib pdf link
This paper argues, by presenting a case study and a demonstration on the web, that biographies make a promising application case of Linked Data: the reading experience can be enhanced by enriching the biographies with additional life time events, by proving the user with a spatio-temporal context for reading, and by linking the text to additional contents in related datasets.
Nina Laurenne, Jouni Tuominen, Hannu Saarenmaa and Eero Hyvönen: Making species checklists understandable to machines - a shift from relational databases to ontologies. Journal of Biomedical Semantics, vol. 5, no. 40, September, 2014. bib pdf link
Osma Suominen and Christian Mader: Assessing and Improving the Quality of SKOS Vocabularies. Journal on Data Semantics, vol. 3, no. 1, pp. 47-73, June, 2014. bib pdf link
Tuukka Ruotsalo and Matias Frosterus: Diversifying Semantic Entity Search: Independent Component Analysis Approach. International Journal of Semantic Computing, vol. 7, no. 4, pp. 407-426, June, 2014. bib
Eetu Mäkelä: Aether - Generating and Viewing Extended VoID Statistical Descriptions of RDF Datasets. The Semantic Web: ESWC 2014 Satellite Events. ESWC 2014 (Presutti, V., Blomqvist, E., Troncy, R., Sack, H., Papadakis, I. and Tordai, A. (eds.)), pp. 429-433, Springer-Verlag, May, 2014. bib pdf link
This paper presents the Aether web application for generating, viewing and comparing extended VoID statistical descriptions of RDF datasets. The tool is useful for example in getting to know a newly encountered dataset, in comparing datasets between versions and in detecting outliers and errors. Examples are given on how the tool has been used to shed light on multiple important datasets.
Eetu Mäkelä: Combining a REST Lexical Analysis Web Service with SPARQL for Mashup Semantic Annotation from Text. The Semantic Web: ESWC 2014 Satellite Events. ESWC 2014 (Presutti, V., Blomqvist, E., Troncy, R., Sack, H., Papadakis, I. and Tordai, A. (eds.)), pp. 424-428, Springer-Verlag, May, 2014. bib pdf link
Current automatic annotation systems are often monolithic, holding internal copies of both machine-learned annotation models and the reference vocabularies they use. This is problematic particularly for frequently changing references such as person and place registries, as the information in the copy quickly grows stale. In this paper, arguments and experiments are presented on the notion that sufficient accuracy and recall can both be obtained simply by combining a sufficiently capable lexical analysis web service with querying a primary SPARQL store, even in the case of often problematic highly inflected languages.
Eero Hyvönen: FinnONTO-hanke loi ontologisen perustan kansalliselle webin tietoinfrastruktuurille (FinnONTO Project Created a Foundation for the Finnish Data Infrastructure on the Web). Tieteessä tapahtuu, no. 3, May, 2014. bib pdf link
Eero Hyvönen, Jouni Tuominen, Miika Alonen and Eetu Mäkelä: Linked Data Finland: A 7-star Model and Platform for Publishing and Re-using Linked Datasets. The Semantic Web: ESWC 2014 Satellite Events. ESWC 2014 (Presutti, V., Blomqvist, E., Troncy, R., Sack, H., Papadakis, I. and Tordai, A. (eds.)), pp. 226-230, Springer-Verlag, May, 2014. bib pdf link
The idea of Linked Data is to aggregate, harmonize, integrate, enrich, and publish data for re-use on the Web in a cost-efficient way using Semantic Web technologies. We concern two major hindrances for re-using Linked Data: It is often difficult for a re-user to 1) understand the characteristics of the dataset and 2) evaluate the quality the data for the intended purpose. This paper introduces the “Linked Data Finland” platform LDF.fi addressing these issues. We extend the famous 5-star model of Tim Berners-Lee, with the sixth star for providing the dataset with a schema that explains the dataset, and the seventh star for validating the data against the schema. LDF.fi also automates data publishing and provides data curation tools. The first prototype of the platform is available on the web as a service, hosting tens of datasets and supporting several applications.
Matias Frosterus: Ontologioiden siltaamisesta, peilaamisesta, ripustamisesta, mäppäämisestä ja linkittämisestä. Tietolinja, no. 1, The Finnish National Library, Helsinki, Finland, May, 2014. bib link
Mikko Koho, Eero Hyvönen and Aleksi Lehikoinen: Ornithology Based on Linking Bird Observations with Weather Data. The Semantic Web: ESWC 2014 Satellite Events, vol. 8798, pp. 75-85, Springer, May, 2014. bib pdf link
This paper presents first results of a use case of Linked Data for eScience, where 0.5 million rows of bird migration observations over 30 years time span are linked with 0.1 million rows of related weather observations and a bird species ontology. Using the enriched linked data biology researchers at the Finnish Museum of Natural History will be able to investigate temporal changes in bird biodiversity and how weather conditions affect bird migration. To support data exploration, the data is published in a SPARQL endpoint service using the RDF Data Cube model, on which semantic search and visualization tools are built.
Eetu Mäkelä and Eero Hyvönen: SPARQL SAHA, a Configurable Linked Data Editor and Browser as a Service. The Semantic Web: ESWC 2014 Satellite Events. ESWC 2014 (Presutti, V., Blomqvist, E., Troncy, R., Sack, H., Papadakis, I. and Tordai, A. (eds.)), pp. 434-438, Springer-Verlag, May, 2014. bib pdf link
SPARQL SAHA is a linked data editor and browser that can be used as a service, targeting any available SPARQL endpoint. Besides being available as a web service, the primary differentiating features of the tool are its configurability to match the underlying data, and the fact that the usability of its user interface has been verified by dozens of non-experts using the tool in multiple multi-year projects.
Katri Seppälä and Eero Hyvönen: Asiasanaston muuttaminen ontologiaksi. Yleinen suomalainen ontologia esimerkkinä FinnONTO-hankkeen mallista (Changing a Keyword Thesaurus into an Ontology. General Finnish Ontology as an Example of the FinnONTO Model). National Library, Plans, Reports, Guides, March, 2014. bib pdf link
Thomas Bartoschek, Gerald Pape, Christian Kray, Jim Jones and Tomi Kauppinen: Gestural Interaction with Spatiotemporal Linked Open Data. OSGeo Journal, vol. 13, no. 1, pp. 60-67, February, 2014. bib link
2013
Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen: Fiction Literature as Linked Open Data - the BookSampo Dataset. Semantic Web – Interoperability, Usability, Applicability, vol. 4, no. 3, pp. 299-306, 2013. bib pdf link
The BookSampo dataset provides information as linked data on fiction literature published in Finland going back to the 15th century, along with rich descriptions of both their content and context. The dataset contains data on nearly 400,000 subjects, including literary works, authors, book covers, reviews, awards, images, and movies, over 3 million triples in total. The data has been applied as the basis of the BookSampo portal in public use in Finland, and is aligned with the cross-domain cultural heritage contents and ontologies of CultureSampo, another in-use semantic portal. The data has been used to answer complex questions, such as what topics should one write about, if one wants to get a literary award (based on statistics). The metadata was transformed into RDF from legacy library databases, then enriched manually by dozens of librarians in a Web 2.0 fashion in Finnish public libraries, and is constantly updated at a rate of some new 90,000 triples monthly.
Eero Hyvönen: Linked Data Finland. Terminfo, no. 4, Finnish Terminology Centre TSK, Helsinki, Finland, 2013. bib
Matias Frosterus: ONKI-projekti luo kansallista ontologiapalvelua. Terminfo, no. 4, Finnish Terminology Centre TSK, Helsinki, Finland, 2013. bib
Sven Buschbeck, Raphael Troncy, Anthony Jameson, Houda Khouf, Adrian Spirescu, Osma Suominen, Tanja Schneeberger and Eero Hyvönen: Parallel Faceted Browsing. Proceedings of CHI 2013, Extended Abstracts, Paris, 2013, Association for Computing Machinery (ACM), 2013. bib pdf
Thea Lindquist, Michael Dulock, Juha Törnroos, Eero Hyvönen and Eetu Mäkelä: Using Linked Open Data to Enhance Subject Access in Online Primary Sources. Cataloging & Classifying Quarterly, vol. 51, no. 8, Francis & Taylor, 2013. bib link
Using online primary sources is both rewarding and challenging for users. Improving subject access is essential as these sources become increasingly important in educational curricula. A user needs assessment with humanities users showed improving findability and context for historical subjects were major needs. Linked Data can help by linking related concepts in the sources using specialized vocabularies, enriching them with outside resources, and enabling semantic services that empower users. This article discusses a project to enhance subject access in an online World War I collection by deep linking historical data on the civilian experience in occupied Belgium and France.
Eero Hyvönen, Miika Alonen, Jouni Tuominen, and Eetu Mäkelä: Linked Data Finland: Towards a 7-star Service Platform for Linked Datasets. The First Annual KnowEscape Conference - KnowEscape 2013, Espoo, Finland, November, 2013. bib pdf
The idea of opening data on the Web as Linked Data (LD) is widely adopted in areas such as public government, science, libraries, and cultural heritage. The key idea is to harmonize, integrate, enrich, and re-use existing data repositories in a cost-efficient way via standard APIs in novel applications. This paper concerns two major hindrances for re-using LD: It is often difficult for a re-user to understand the 1) characteristics of the dataset and 2) evaluate the quality of the data for her intended purpose. This paper introduces the “Linked Data Finland” publishing platform LDF.fi addressing these issues. In order to enhance and promote reusability, we propose extending the famous 5-star model of Tim Berners-Lee into a 7-star model: The sixth star requires that the dataset is defined and explained in terms of explicit schemas. Explicit schemas make it possible to explain the re-user the intended characteristics of the data by, e.g., documentation about the schemas, and how the schemas (vocabularies) are actually used in the given dataset. The seventh star is given, if the data has also been validated w.r.t. the schema specifications. The results of the validation may be a human readable document and/or a machine readable reprentation regarding the quality issues found in the data. This paper reports about work in progress, but the first prototype of the platform is already operational on the web as a service http://ldf.fi.
Osma Suominen: Methods for Building Semantic Portals. Dissertation, Aalto University, School of Science, Espoo, September, 2013. bib link
Tuukka Ruotsalo and Matias Frosterus: Semantic Entity Search Diversification. Semantic Computing (ICSC), 2013 IEEE Seventh International Conference on, pp. 32-39, Irvine, CA, Sept, 2013. bib pdf
We present an approach to diversify entity search by utilizing semantics present and inferred from the initial entity search results. Our approach makes use of ontologies and independent component analysis of the entity descriptions to reveal direct and latent semantic connections between the entities present in the initial search results. The semantic connections are then used to sample a set of diverse entities. We empirically demonstrate the performance of our approach through retrieval experiments that use a real-world dataset composed from four entity databases. The results indicate that our approach significantly improves both diversity and effectiveness of entity search.
Miika Alonen, Tomi Kauppinen, Osma Suominen and Eero Hyvönen: Exploring the Linked University Data With Visualization Tools. The Semantic Web: ESWC 2013 Satellite Events, pp. 204-208, Springer-Verlag, Berlin Heidelberg, Montpellier, France, May 26-30, 2013. bib pdf
University data is typically stored in separate data silos even though the data is implicitly richly related together. Such data has a large and diverse user base, including faculty members, students, industrial partners, alumnis, collaborating universities, and media. In this paper, we demonstrate two tools for understanding and using the contents of linked university data. The first tool, Visualization Playground (VISU), supports querying and visualizing the data for example for illustrating emerging trends in universities (e.g., about publications) and for com- paring differences. The second tool, Vocabulary Visualizer (V^2), demon- strates the usage of vocabularies in the Linked University Data Cloud. It reveals what kinds of data different universities have published, and what terms are used to describe the contents. Such analysis is a basis for facilitating design of Linked Data applications across university data boundaries.
Matias Frosterus, Jouni Tuominen, Mika Wahlroos and Eero Hyvönen: The Finnish Law as a Linked Data Service. The Semantic Web: ESWC 2013 Satellite Events, pp. 289-290, Springer-Verlag, Berlin Heidelberg, Montpellier, France, May 26-30, 2013. bib pdf
Juridical information is important to organizations and individuals alike and is linked to from all walks of life. The Finnish government has published the Finlex Data Bank for searching and browsing legislation documents. However, the data there is not yet open, is based on a traditional XML schema, and does not conform to new semantic metadata standards. There are many difficulties in maintaining and using the site in, e.g., data harvesting, interoperability, querying, and linking that could be mitigated by the Semantic Web technologies. This paper presents an approach and a project—including first results—for publishing and using Finnish legislation as a 5-star Linked Open Data service.
Jouni Tuominen, Nina Laurenne, Mikko Koho and Eero Hyvönen: The Birds of the World Ontology AVIO. The Semantic Web: ESWC 2013 Satellite Events, pp. 300-301, Springer-Verlag, Berlin Heidelberg, Montpellier, France, May 26-30, 2013. bib pdf
We present an ontology for managing the scientific and common names of birds. The ontology is based on the TaxMeOn meta-ontology model for biological names. The ontology is in use as an ontology service and it has been applied in a bird watching system.
Matias Frosterus, Jouni Tuominen, Sini Pessala, Katri Seppälä and Eero Hyvönen: Linked Open Ontology Cloud KOKO--Managing a System of Cross-domain Lightweight Ontologies. The Semantic Web: ESWC 2013 Satellite Events, pp. 296-297, Springer-Verlag, Berlin Heidelberg, Montpellier, France, May 26-30, 2013. bib pdf
Eero Hyvönen, Miika Alonen, Mikko Koho and Jouni Tuominen: BirdWatch--Supporting Citizen Scientists for Better Linked Data Quality for Biodiversity Management. Proceedings of the first international Workshop on Semantics for Biodiversity (S4BioDiv), ESWC 2013, CEUR Workshop Proceedings, Vol 979, Montpellier, France, May, 2013. bib pdf link
Observational data about species of public interest, such as birds and butterflies, is often created and collected by volunteered citizen scientists, and used by professionals for managing biodiversity. The education and skills of the citizens participating in the work varies a lot, and the process of making observations is typically not systematic but rather ad hoc. As a result, the quality of the observational data in repositories, such as the Global Biodiversity Information Facility GBIF Data Portal, is often not good, hampering its utilization severely. This paper presents an approach for enhancing data quality in a citizen science setting, and presents a mobile tool BirdWatch for citizen observers, mitigating difficulties in producing high quality Linked Data for biodiversity management.
Jouni Tuominen, Nina Laurenne and Eero Hyvönen: Publishing and Using Plant Names as an Ontology Service. Proceedings of the first international Workshop on Semantics for Biodiversity (S4BioDiv), ESWC 2013, CEUR Workshop Proceedings, Vol 979, Montpellier, France, May, 2013. bib pdf link
Animals and plants are referred to using scientific or common names depending on the expertise of an audience or a source of data. The names change in time and therefore their usage as identifiers as such is problematic. We present a solution for managing and using plant names as an ontology. The ontology is based on the TaxMeOn meta-ontology for biological names. In order to refer to organisms unambiguously and publish information as Linked Data on the web, the names are given URIs. The ontology is developed collaboratively and it supports the approval process and temporal tracking of the common names. We introduce an ontology service of plant names for end-users and provide user interfaces and APIs for integrating the ontology into applications.
Tuukka Ruotsalo, Krister Haav, Antony Stoyanov, Sylvain Rochee, Elena Fanid, Romina Deliaic, Eetu Mäkelä, Tomi Kauppinen and Eero Hyvönen: SMARTMUSEUM: A Mobile Recommender System for the Web of Data. Journal of Web Semantics, vol. 20, pp. 50-67, May, 2013. bib link
Semantic and context knowledge have been envisioned as an appropriate solution for addressing the content heterogeneity and information overload in mobile Web information access, but few have explored their full potential inmobile scenarios, where information objects refer to their physical counterparts, and retrieval is context-aware and personalized for users. We present SMARTMUSEUM, a mobile ubiquitous recommender system for the Web of Data, and its application to information needs of tourists in context-aware, on-site access to cultural heritage. The SMARTMUSEUM system utilizes Semantic Web languages as the form of data representation. Ontologies are used to bridge the semantic gap between heterogeneous content descriptions, sensor inputs, and user profiles. The system makes use of an information retrieval framework where in context data and search result clustering are used in recommendation of suitable content for mobileusers. Results from laboratory experiments demonstrate that ontology-based reasoning, query expansion, search result clustering, and context knowledge lead to significant improvement in recommendation performance. The results from field trials show that the usability of the system meets users’ expectations in real-world use. The results indicate that semantic content representation and retrieval can significantly improve the performance of mobile recommender systems in knowledge-rich domains.
Mika Wahlroos: Indeksointimetatiedon eristäminen ja arviointi (Extraction and evaluation of index metadata). MSc Thesis (in Finnish), University of Helsinki, Department of Computer Science, February, 2013. bib pdf
Tiedonhallinnassa käytetään usein metatietona tiedon sisältöä kuvaavia avainsanoja parantamaan tiedon hallittavuutta tai löydettävyyttä. Sisällön kuvailua luonnollisen kielen termein tai käsittein kutsutaan indeksoinniksi. Yhdenmukaisuuden vuoksi voidaan käyttää tarkoitusta varten laadittua asiasanastoa, joka kattaa toimialan kannalta keskeisen termistön. Semanttisessa webissä ja yhdistetyssä tiedossa käytettävät ontologiat vievät ajatuksen pitemmälle määrittelemällä termit käsitteinä ja niiden välisinä merkityssuhteina. Metatiedon tuottamisen helpottamiseksi ja tehostamiseksi on kehitetty erilaisia menetelmiä, joilla sisältöä kuvailevia termejä voidaan tuottaa tekstiaineistosta automaattisesti. Tässä tutkielmassa keskitytään avaintermien automaattiseen eristämiseen tekstistä sekä metatiedon laatuun ja sen arvioinnin menetelmiin. Esimerkkitapauksena käsitellään ontologiaa hyödyntävän Maui-indeksointityökalun käyttöä asiakirjallisen tiedon automaattiseen asiasanoittamiseen. Automaattisesti eristetyn metatiedon laatua verrataan alkuperäiseen ihmisten määrittämään asiasanoitukseen käyttäen tarkkuus- ja saantimittauksia. Lisäksi evaluointia täydennetään aihealueen asiantuntijoiden esittämillä subjektiivisilla laatuarvioilla. Tulosten perusteella selvitetään tekstin esikäsittelyn ja sanaston hierarkian merkitystä automaattisen asiasanoituksen laadun kannalta sekä pohditaan keinoja annotointimenetelmän jatkokehittämiseksi.
2012
Mika Wahlroos, Matias Frosterus and Eero Hyvönen: Tapaustutkimus: semanttinen linkitys ja tiedon löydettävyys puolustusvoimien normitietokannassa (Semantic linking and document findability in Finnish Defence Forces norms - a case study). Tiedon kehittyneempi käsittely johtamisjärjestelmäarkkitehtuurissa - tutkimukset 2011 (Vesa Kuikka (ed.)) (in Finnish), series 1, no. 5, Puolustusvoimien Johtamisjärjestelmäkeskus, 2012. bib pdf
Jouni Tuominen, Nina Laurenne and Eero Hyvönen: An Ontology Model and Service for Managing Scientific and Common Names of Plants. Poster and Demo proceedings of the 2nd Joint International Semantic Technology Conference (JIST2012), Nara, Japan, December, 2012. bib pdf
Tomi Kauppinen, Line C. Pouchard and Carsten Kessler (eds.): . Proceedings of the Second International Workshop on Linked Science 2012 (LISC2012), CEUR Workshop Proceedings, vol. 951, CEUR Workshop Proceedings, Vol 951, http://ceur-ws.org, ISSN 1613-0073, November, 2012. bib link
Sven Buschbeck, Anthony Jameson, Raphael Troncy, Houda Khrouf, Osma Suominen and Adrian Spirescu: A Demonstrator for Parallel Faceted Browsing. Proceedings of the IESD Challenge track at the International Workshop on Intelligent Exploration of Semantic Data (IESD 12), Galway, Ireland, October, 2012. bib pdf
Osma Suominen and Eero Hyvönen: Improving the Quality of SKOS Vocabularies with Skosify. Proceedings of the 18th International Conference on Knowledge Engineering and Knowledge Management (EKAW 2012), Springer-Verlag, Galway, Ireland, October, 2012. bib pdf
Alkyoni Baglatzi and Tomi Kauppinen: Managing and Representing Scientific Findings about the Environment. Demos and Posters of the 18th International Conference on Knowledge Engineering and Knowledge Management (EKAW2012), October, 2012. bib
Eero Hyvönen: Publishing and Using Cultural Heritage Linked Data on the Semantic Web. Morgan & Claypool, Palo Alto, CA, USA, October, 2012. bib pdf link
Osma Suominen, Alex Johansson, Henri Ylikotila, Jouni Tuominen and Eero Hyvönen: Vocabulary Services Based on SPARQL Endpoints: ONKI Light on SPARQL. Poster proceedings of the 18th International Conference on Knowledge Engineering and Knowledge Management (EKAW 2012), Galway, Ireland, October, 2012. bib pdf
Kim Viljanen, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen: Normalized Access to Ontology Repositories. Proceedings of the Sixth International Conference on Semantic Computing (IEEE ICSC 2012), IEEE Press, Palermo, Italy, September, 2012. bib pdf
Ontology repositories, such as NCBO Bioportal, ONKI and Cupboard, help finding and using ontologies on the Semantic Web. However, currently each ontology repository constitutes a separate island with its own user interface, APIs, users, ontology languages and set of ontologies. Because there is not a universal way to access all ontology repositories, doing global search, browsing, and inference over all available ontology repositories turns out to be technically difficult and is generally not done. Ontologies are not reused as much as they could and hence the full potential of ontologies is not achieved. To address the problem, we propose the Normalized Ontology Repository (NOR) approach to make the ontology repositories universally accessible while maintaining their unique functionalities and strengths. The SKOS language is used as the lowest common denominator for presenting the ontologies. In addition, a simple API for searching and accessing the ontologies is defined. As a proof-of-concept evaluation, we present three case implementations to demonstrate the NOR approach: 1) the distributed architecture of the ONKI repository, 2) the metasearch for ONKI and NCBO Bioportal, and 3) publishing informal ontological concept collections as NOR end-points, demonstrated with the semantic portal CultureSampo and the metadata editor SAHA.
Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen: Improving Fiction Literature Access by Linked Open Data -Based Collaborative Knowledge Storage - the BookSampo Project. World Library and Information Congress: 78th IFLA General Conference and Assembly, Helsinki, IFLA, http://conference.ifla.org/ifla78, August, 2012. bib pdf
BookSampo is a joint project between the Finnish public libraries and semantic web researchers, to improve fiction literature search and recommendation. In the project, dozens of librarians around Finland have used a collaborative web-based metadata editor to input diverse knowledge about fiction literature into a shared database. Particularly, the project has sought to improve access by indexing not only bibliographical information about the books, but focusing on the content and context of the works. In order to do this, the database employs advanced techniques such as functional, content-centered indexing, ontological vocabularies and the networked data model of linked open data. To demonstrate the functionality this makes possible, the fiction literature portal http://www.kirjasampo.fi/ was created. This portal uses the knowledge created in the project to offer advanced semantic search and recommendation based on the database created. In addition, web services exposing direct access to the data have been used for example in culture hack events to answer more complex questions, such as where in Finland are the most crimes committed in fiction literature.
Thea Lindquist, Eero Hyvönen, Juha Törnroos, Eetu Mäkelä: Leveraging linked data to enhance subject access - A case study of the University of Colorado Boulder s World War I collection online. World Library and Information Congress: 78th IFLA General Conference and Assembly, Helsinki, IFLA, http://conference.ifla.org/ifla78, August, 2012. bib link
Academic users often find work with online primary sources both rewarding and challenging. Improving subject access in these sources is essential as digital collections propagate and work with primary sources becomes increasingly important in humanities curricula. A user needs assessment was conducted with humanities users at the University of Colorado Boulder to facilitate engagement with these sources. Two of the major user needs identified were improving findability and context, particularly for historical subjects. Linked Data can help meet these needs by linking related concepts in the sources using a specialized vocabulary, enriching them with outside resources, and enabling semantically rich services that empower users. This paper discusses a project the authors undertook to enhance subject access in CU’s WWI Collection Online by deep linking historical data on the civilian experience in occupied Belgium. This work is intended to lead to a richer understanding of forces shaping the WWI period.
Eero Hyvönen, Thea Lindquist, Juha Törnroos and Eetu Mäkelä: History on the Semantic Web as Linked Data - An Event Gazetteer and Timeline for World War I. Proceedings of CIDOC 2012 - Enriching Cultural Heritage, Helsinki, Finland, CIDOC, http://www.cidoc2012.fi/en/cidoc2012/programme, June, 2012. bib pdf
Events are an essential component of cultural heritage (CH) Linked Data (LD): they link actors, places, times, objects, and other events into larger narrative structures, providing a rich basis for semantic searching, recommending, analysis, and visualization of CH data. This paper argues that shared vocabularies (gazetteers, ontologies) of events, such as the “Battle of Normandy” or “Crucifixion of Jesus”, are necessary to facilitate the aggregation and linking of heterogeneous content from various collections. For example, biographies, histories, photos, and paintings often reference or depict events. A set of general requirements for an event gazetteer is presented, based on the needs of publishing, aggregating, and reusing cultural heritage content as Linked Data. After this, a metadata model addressing the presented requirements for representing historical events is outlined. The model is being applied in a case study aimed at developing an event ontology for World War I (WWI). Our goals from an end-user perspective are twofold: 1) Facilitate event-based cataloging for curators in memory organizations; 2) Utilize semantic event descriptions and narrative event structures in end-user applications for searching and linking documents and other content about WWI, and for structuring and visualizing them.
Suvi Kettula and Eero Hyvönen: Process-centric Cataloguing of Intangible Cultural Heritage. Proceedings of CIDOC 2012 - Enriching Cultural Heritage, Helsinki, Finland, CIDOC, http://www.cidoc2012.fi/en/cidoc2012/programme, June, 2012. bib pdf
Museums and archives collect and store documentation of processes of intangible cultural heritage (ICH), such as craftsmanship skills, acts, and events recorded in videos, audio tapes, manuscripts, photos, and transcriptions. Such recordings are typically catalogued in an object-centric way as documents, using schemas such as Dublin Core. Also in event-centric models the focus has been on tangible cultural heritage. In this article we point out the importance of cataloguing not only the documentation object or related events, but the actual cultural process, such as a craftsmanship skill. Using special process-centric metadata for ICH, one can search for information about the elements and parts of intangible processes, not only documentation objects. Furthermore, process descriptions can be linked to related tangible and intangible objects in collections and Linked Data repositories on the web, facilitating rich and detailed semantic recommendations to end-users. To test and evaluate this idea, we created a metadata model for representing cultural processes, and applied it to the video documentation of traditional shoemaking with visualization and real time semantic recommendations on the CultureSampo portal.
Eero Hyvönen, Aleksi Lindblad and Eetu Mäkelä: TravelSampo System for Creating Mobile Audio Guide Tours Enriched with Linked Data. Proceedings of CIDOC 2012 - Enriching Cultural Heritage, Helsinki, Finland, CIDOC, http://www.cidoc2012.fi/en/cidoc2012/programme, June, 2012. bib pdf
TravelSampo [1] is a prototype system, by which museums are able to create interactively audio guide tours inside museums and outside in the open air. The system includes a web-based editor by which a curator can describe objects in an exhibition, or in the open air, using a set of shared ontologies published in the National Ontology Service ONKI (http://onki.fi/), and upload related audio descriptions, text, and images. Each exhibit object is given an identifier and a geo-location. When the end-user is near the object, either in a museum or in the open air, information related to the object can be given to her based on the object identifier or GPS location. A major novelty of TravelSampo lies in its ability to associate the object metadata automatically with millions of semantically related pieces of information available though the Linked Data cloud (http://linkedata.org/) and the CultureSampo system (http://www.kulttuurisampo.fi/). For example, a painting can be linked, based on the underlying ontologies and metadata, with the biography of the painter in Wikipedia or in the National Biography, with other paintings of the artist in the collections of other museums, with photos and books about the artist, and so on. This gives the end-user a richer experience than is possible with traditional audio guide systems. For the museums, TravelSampo offers a cost-efficient and dynamic way of creating information rich audio guide programs, and re-using and linking each others collections through linked data, leading to a win-win situation. The paper presents and discusses the underlying ideas of TravelSampo and our experiences in developing the systems especially from the content publishers’, i.e. the museums’ viewpoint. [1] E. Mäkelä, J. Väätäinen, R. Alitalo, O. Suominen, E. Hyvönen: Discovering Places of Interest through Direct and Indirect Associations in Heterogeneous Sources - The TravelSampo System. Terra Cognita 2011: Foundations, Technologies and Applications of the Geospatial Web, CEUR Workshop Proceedings, Vol-798, 2011. http://ceur-ws.org/Vol-798/proceedings.pdf
Jouni Tuominen, Kim Viljanen and Eero Hyvönen: Ontologiapalvelut semanttisessa webissä (Ontology services on the Semantic Web). (in Finnish), Tietojenkäsittelytiede, no. 34, pp. 17-36, Tietojenkäsittelytieteen Seura ry, April, 2012. bib pdf
Ontologiat ovat keskeinen osa semanttista webiä: ne toimivat yhteisinä jaettuina käsitteistöinä, joiden avulla tietokoneet voivat käsitellä tietoa älykkäämmin. Jotta eri toimijat voivat hyödyntää yhteisiä käsitteistöjä sovelluksissaan, ontologiat on julkaistava heidän käyttöönsä. Yksinkertaisimmillaan ontologiat voidaan julkaista tiedostomuodossa. Tällöin jokainen toimija joutuu toteuttamaan itse toiminnallisuuksia ontologioiden hyödyntämiseen. Koska osa toiminnallisuuksista on yleisiä, useissa järjestelmissä toistuvia, niiden toteuttaminen valmiina palveluina on mielekästä. Palveluita voidaan tarjota ihmiskäyttäjille käyttöliittymäkomponentteina sekä ohjelmalliseen käyttöön rajapintoina, joita käyttämällä toiminnallisuudet voidaan integroida asiakasjärjestelmiin. Tässä artikkelissa kuvataan ontologioiden käyttäjäryhmien tarpeita sekä ontologiapalveluiden toteutuksia. Yleisten ontologioiden käyttämiseen liittyvien toiminnallisuuksien tarjoamiseksi esitetään ontologiapalvelu ONKI, joka on osa Suomalaiset semanttisen webin ontologiat -hankesarjassa (FinnONTO, 2003–2012) kehitettyä ontologiainfrastruktuuria. Artikkeli perustuu Jouni Tuomisen pro gradu -työhön, jolle Tietojenkäsittelytieteen Seura ry myönsi lukuvuoden 2009–2010 pro gradu -palkinnon. Tutkimustyöhön ovat osallistuneet myös Kim Viljanen ja Eero Hyvönen.
Eetu Mäkelä, Eero Hyvönen and Tuukka Ruotsalo: How to deal with massively heterogeneous cultural heritage data – lessons learned in CultureSampo. Semantic Web – Interoperability, Usability, Applicability, vol. 3, no. 1, January, 2012. bib pdf link
This paper presents the CultureSampo system for publishing heterogeneous linked data as a service. Discussed are the problems of converting legacy data into linked data, as well as the challenge of making the massively heterogeneous yet interlinked cultural heritage content interoperable on a semantic level. Novel user interface concepts for then utilizing the content are also presented. In the approach described, the data is published not only for human use, but also as intelligent services for other computer systems that can then provide interfaces of their own for the linked data. As a concrete use case of using CultureSampo as a service, the BookSampo system for publishing Finnish fiction literature on the semantic web is presented.
Dimitrios A. Koutsomitropoulos, Eero Hyvönen, and Theodore S. Papatheodorou: Semantic Web and Reasoning for Cultural Heritage and Digital Libraries. Semantic Web – Interoperability, Usability, Applicability, vol. 3, no. 1, IOS Press, January, 2012. Editorial. bib pdf link
Dimitrios A. Koutsomitropoulos, Eero Hyvönen, and Theodore S. Papatheodorou (eds.): Semantic Web and Reasoning for Cultural Heritage and Digital Libraries (Special Issue). Semantic Web – Interoperability, Usability, Applicability, vol. 3, no. 1, IOS Press, January, 2012. bib link
Eero Hyvönen: Museoalan ontologiat Suomessa. Kansallisen FinnONTO-hankkeen tulosten hyödyntäminen (Cultural Heritage Ontologies in Finland. Utilizing the results of the FinnONTO project.). (in Finnish), Aalto University, Department of Media Technology, 4, 2012. bib pdf
2011
Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen: BookSampo--Lessons Learned in Creating a Semantic Portal for Fiction Literature. The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, pp. 173-188, Springer-Verlag, 2011. bib pdf link
BookSampo is a semantic portal in use, covering metadata about practically all Finnish fiction literature of Finnish public libraries on a work level. The system introduces a variety of semantic web novelties deployed into practise: The underlying data model is based on the emerging functional, content-centered metadata indexing paradigm using RDF. Linked Data (LD) principles are used for mapping the metadata with tens of interlinked ontologies in the national FinnONTO ontology infrastructure. The contents are also linked with the large LD metadata repository of related cultural heritage content of CultureSampo. BookSampo is actually based on using CultureSampo as a semantic web service, demonstrating the idea of re-using semantic content from multiple perspectives without the need for modifications. Most of the content has been transformed automatically from existing databases, with the help of ontologies derived from thesauri in use in Finland, but in addtion tens of volunteered librarians have participated in a Web 2.0 fashion in annotating and correcting the metadata, especially regarding older litarature. For this purpose, semantic web editing tools and public ONKI ontology services were created and used. The paper focuses on lessons learned in the process of creating the semantic web basis of BookSampo.
Eetu Mäkelä, Aleksi Lindblad, Jari Väätäinen, Rami Alatalo, Osma Suominen and Eero Hyvönen: Discovering Places of Interest through Direct and Indirect Associations in Heterogeneous Sources -- The TravelSampo System. Terra Cognita 2011: Foundations, Technologies and Applications of the Geospatial Web, CEUR Workshop Proceedings, Vol-798, 2011. bib pdf
Linked data related to places has a potential to offer a vastly superior collection of information to base search and recommendation functionality on in eTourism visit planning as well as location-aware mobile applications. Particularly, through linked data, besided places interesting in themselves, it is possible to discover places interesting only through association, such as being the venue for a concert by an artist with an interesting genre. However, in order to harness this collective data source, challenges relating to data heterogeneity, quality, scale, and indexing and querying complexity must be resolved. In this paper, the TravelSampo visit planning and mobile application is presented, which tackles these issues. Using the system, queries describing both simple and complex interests can be run over some 17 million places of interest from over 20 vastly heterogeneous sources.
Katariina Nyberg, Matias Frosterus and Eero Hyvönen: Linking Data for Industrial Knowledge Management – A Case Study. ESWC 2011, OSEMA Workshop paper, 2011. bib pdf
Manufacturing companies face the challenge of maintaining documentation and knowledge about their projects and products, scattered in heterogenous, distributed databases, represented in different formats and languages, and hosted in mutually incompatible systems. At the same time, the knowledge needs to be accessed on a global level from different perspectives and user groups, such as project planners, designers, and maintenance personnel. This paper presents a case study, based on real datasets of a major international diesel engine and power plant manufacturer, where these problems are addressed simultaneuosly by harmonizing the datasets from different sources using RDF, and by linking them together into a global repository using shared resources. Based on the global RDF store, services for both human and machine users, such as a faceted search engine and a SPARQL end-point, can be provided to support access from different perspectives to the company knowledge base.
Eero Hyvönen, Jouni Tuominen, Tomi Kauppinen, Jari Väätäinen: Representing and Utilizing Changing Historical Places as an Ontology Time Series. Geospatial Semantics and Semantic Web: Foundations, Algorithms, and Applications (Naveen Ashish and Amit Sheth (eds.)), pp. 1-25, Springer-Verlag, 2011. Book chapter. bib pdf link
Matias Frosterus, Eero Hyvönen and Joonas Laitio: Creating and Publishing Semantic Metadata about Linked and Open Datasets. AAAI Fall Symposium 2011, Open Government Knowledge: AI Opportunities and Challenges, Arlington, USA, November, 2011. bib pdf
We present a comprehensive system for producing interoperable metadata for Linked Open datasets and governmental datasets published in various formats.
Matias Frosterus, Eero Hyvönen, Joonas Laitio: Creating and Publishing Semantic Metadata about Linked and Open Datasets. Linking Government Data (David Wood (ed.)), Springer-Verlag, November, 2011. bib link
Eero Hyvönen: Linked Open Aalto, Project Proposal. Aalto University, Department of Media Technology, November, 2011. bib pdf
Linked Open Aalto is a research project aiming at developing a semantic web approach for creating and publishing interlinked educational, research, and managerial contents produced at different communities, schools, departments, research groups, and persons in Aalto. By using semantic Linked (Open) Data principles, technologies, and open datasets available, Aalto contents can be interlinked with related teaching and research materials in Finland and internationally. By aggregating and combining local contents from separate incompatible data silos and systems, the end-user can be provided with a global, cross-disciplinary perspective to knowledge produced in Aalto and other universities. For example, a web page describing a course can be interlinked automatically with related research results, publications, projects, Wikipedia pages, research groups, researchers, internationally available video lectures, open course materials, events in Aalto, conferences, blog discussions, and so on.
Alexander García Castro, Ken Baclawski, John Bateman, Christoph Lange and Kim Viljanen (eds.): . Proceedings of the ISWC 2011 Workshop Ontologies Come of Age in the Semantic Web (OCAS), CEUR Workshop Proceedings, Vol 809, http://ceur-ws.org, ISSN 1613-0073, October, 2011. bib link
Matias Frosterus, Eero Hyvönen and Mika Wahlroos: Extending Ontologies with Free Keywords in a Collaborative Annotation Environment. Proceedings of the ISWC 2011 Workshop Ontologies Come of Age in the Semantic Web (OCAS), CEUR Workshop Proceedings, Vol 809, http://ceur-ws.org, ISSN 1613-0073, Bonn, Germany, October, 2011. bib pdf
Semantic web technologies have introduced the idea of annotating content in terms of concepts taken from ontologies. Since concepts are defined in terms of properties and relations to other concepts, descriptions grow up into larger RDF graphs that can be used as a basis for data integration and intelligent information retrieval. Since ontologies do not typically contain all the possible concepts needed for annotation, it is usually necessary to offer the annotator the possibility to introduce new free keywords or tags in addition to the predefined ontology concepts. The problem then is that free keywords/tags do not have ontological connections to the rest of the RDF graph, unless such relations are defined by the annotator.We present a process for integrating free keywords into the ontological framework, and a practical tool implementation of it, discussing the challenges and possibilities introduced by the system. We also describe a case study performed for the Finnish Defence Forces, where the tool is used for creating a faceted semantic search portal featuring the free keywords and the ontological concepts at the same time.
A. Thalhammer, T. Ermilov, K. Nyberg, A Santoso and J. Domingue: MovieGoer - Semantic Social Recommendations and Personalized Location-based Offers. The 10th International Semantic Web Conference (ISWC 2011), poster papers, Bonn, Germany, Oct, 2011. bib pdf
Sini Pessala, Katri Seppälä, Osma Suominen, Matias Frosterus, Jouni Tuominen and Eero Hyvönen: MUTU: An Analysis Tool for Maintaining a System of Hierarchically Linked Ontologies. ISWC 2011 - Ontologies come of Age Workshop (OCAS), Bonn, Germany, October, 2011. bib pdf
We consider ontology evolution in a system of light-weight Linked Data ontologies, aligned with each other to form a larger ontology system. When one ontology changes, the human editor must keep track of the actual changes and of the modifications needed in the related ontologies in order to keep the system consistent. This paper presents an analysis tool MUTU, by which such changes and their potential effects on other ontologies can be found. Such an analysis is useful for the ontology editors for understanding the differences between ontology versions, and for updating linked ontologies when changes occurred in other components of an ontology system.
Nina Laurenne, Jouni Tuominen, Katharina Schleidt and Eero Hyvönen: Observing observations - an ontology-based approach for improving the reliability of biodiversity data. TDWG 2011 Annual Conference of the Taxonomic Databases Working Group, New Orleans, Lousiana, USA, October, 2011. Poster abstract. bib pdf
Nina Laurenne, Jouni Tuominen and Eero Hyvönen: Radiation of beetles into cyberspace - two case studies of modelling taxonomic information. TDWG 2011 Annual Conference of the Taxonomic Databases Working Group, New Orleans, Lousiana, USA, October, 2011. Poster abstract. bib pdf
Joonas Laitio: Semantic Web Data Quality Control. MSc Thesis, Aalto University, School of Electrical Engineering, Degree Programme of Automation and Systems Technology, October, 2011. bib pdf
Data quality is a growing concern on the Semantic Web. The amount of data available is growing faster than ever, and the emphasis thus far has been on creating and interlinking data without much regard to how good the data actually is. The trend is shifting from creating new data to refining what already exists. Data quality is a subjective concept and a formal representation for it is often troublesome. First, we must define what is meant by data quality - what are the different facets of the concept. Second, a way for representing this quality must be found. Third, actual processes to refine data and improve its quality and ways to take data quality into account on the Semantic Web must be developed. This work presents some solutions to the problem. Many ways to annotate quality metadata as RDF are first discovered, along with their pros and cons. A framework for managing RDF-based quality metadata is presented, with a set of tools for specifically managing the quality annotations. Additionally, an automatic annotation system and a schema validation system, within the restraints of the open world assumption, have been designed, implemented and integrated into the framework. The system has been tested using real life datasets with promising first results.
Nina Laurenne, Jouni Tuominen, Arto Mertaniemi, Hannu Saarenmaa and Eero Hyvönen: LSID versus HTTP URI: Two approaches and e-infrastructures for managing information about taxon names. EIM 2011 Conference, Santa Barbara, California, USA, September, 2011. Poster abstract. bib pdf
Reetta Sinkkilä, Osma Suominen and Eero Hyvönen: Automatic Semantic Subject Indexing of Web Documents in Highly Inflected Languages. Proceedings of the 8th Extended Semantic Web Conference (ESWC 2011), pp. 215-229, Springer-Verlag, Heraklion, Greece, June, 2011. bib pdf
Structured semantic metadata about unstructured web documents can be created using automatic subject indexing methods, avoiding laborious manual indexing. A succesful automatic subject indexing tool for the web should work with texts in multiple languages and be independent of the domain of discourse of the documents and controlled vocabularies. However, analyzing text written in a highly inflected language requires word form normalization that goes beyond rule-based stemming algorithms. We have tested the state-of-the art automatic indexing tool Maui on Finnish texts using three stemming and lemmatization algorithms and tested it with documents and vocabularies of different domains. Both of the lemmatization algorithms we tested performed significantly better than a rule-based stemmer, and the subject indexing quality was found to be comparable to that of human indexers.
Jouni Tuominen, Nina Laurenne and Eero Hyvönen: Biological Names and Taxonomies on the Semantic Web - Managing the Change in Scientific Conception. Proceedings of the 8th Extended Semantic Web Conference (ESWC 2011), Springer-Verlag, Heraklion, Greece, June, 2011. bib pdf
Biodiversity management requires the usage of heterogeneous biological information from multiple sources. Indexing, aggregating, and finding such information is based on names and taxonomic knowledge of organisms. However, taxonomies change in time due to new scientific findings, opinions of authorities, and changes in our conception about life forms. Furthermore, organism names and their meaning change in time, different authorities use different scientific names for the same taxon in different times, and various vernacular names are in use in different languages. This makes data integration and information retrieval difficult without detailed biological information. This paper introduces a meta-ontology for managing the names and taxonomies of organisms, and presents three applications for it: 1) publishing biological species lists as ontology services (ca. 20 taxonomies including more than 80,000 names), 2) collaborative management of the vernacular names of vascular plants (ca. 26,000 taxa), and 3) management of individual scientific name changes based on research results, covering a group of beetles. The applications are based on the databases of the Finnish Museum of Natural History and are used in a living lab environment on the web.
Kim Viljanen, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen: Combining Distributed Ontology Repositories into a Global Service. June, 2011. Draft paper. bib pdf
Ontologies and vocabularies are a key resource for creating interoperable metadata on the Semantic Web. To make finding and using ontologies easier, the idea of Ontology Repositories has been introduced with current implementations including e.g. the NCBO Bioportal, ONKI and Cupboard. There is a genuine need for different kinds of Ontology Repositories, each focusing on different kinds specific user-needs, different ontologies and different organizational requirements which cannot be addressed by a single general implementation. However, at the moment each Ontology Repository is a separate island with its own user interfaces and APIs. They also use varying ontology languages such as OWL, SKOS, and RDF Schema. Due to this, global search, browsing, and inference over the repositories is difficult and generally not done which means that, for example, finding and reusing existing ontologies becomes difficult. To address the problems, we have developed a loosely coupled Network of Ontology Repositories (NOR) architecture that makes the repositories globally interoperable while maintaining their unique functionalities and strengths. To participate in the network, each ontology repository is required to implement a shared API. As a proof-of-concept evaluation, we present three case implementations demonstrating different aspects of the NOR approach: 1) internal distributed architecture of ONKI, 2) global search of ONKI and NCBO Bioportal, 3) publishing non-ontological concept collections as NOR endpoints, demonstrated with the semantic portal CultureSampo and the metadata editor SAHA.
Matias Frosterus, Eero Hyvönen and Joonas Laitio: DataFinland - A Semantic Portal for Open and Linked Dataset. Proceedings of the 8th Extended Semantic Web Conference (ESWC 2011), pp. 243-254, Springer-Verlag, Heraklion, Greece, June, 2011. bib pdf link
The number of open datasets available on the web is increasing rapidly with the rise of the Linked Open Data (LOD) cloud and various governmental efforts for releasing public data in different formats, not only in RDF. The aim in releasing open datasets is for developers to use them in innovative applications, but the datasets need to be found first and metadata available is often minimal, heterogeneous, and distributed making the search for the right dataset often problematic. To address the problem, we present DataFinland, a semantic portal featuring a distributed content creation model and tools for annotating and publishing metadata about LOD and non-RDF datasets on the web. The metadata schema for DataFinland is based on a modified version of the voiD vocabulary for describing linked RDF datasets, and annotations are done using an online metadata editor SAHA connected to ONKI ontology services providing a controlled set of annotation concepts. The content is published instantly on an integrated faceted search and browsing engine HAKO for human users, and as a SPARQL endpoint and a source file for machines. As a proof of concept, the system has been applied to LOD and Finnish governmental datasets.
Kaisa Hypén and Eetu Mäkelä: An ideal model for an information system for fiction and its application: Kirjasampo and Semantic Web. Library Review, vol. 60, no. 4, April, 2011. bib link
Purpose – Library Director Jarmo Saarti introduced a wide or ideal model for fiction in literature in his dissertation, published in 1999. It introduces those aspects that should be included in an information system for fiction. Such aspects include literary prose and its intertextual references to other works, the writer, readers and critics receptions of the work as well as a researcher s view. It is also important to note how libraries approach a literary work by means of inventory, classification and content description. The most ambiguous of the aspects relates to that context in cultural history, which the work reflects and is a part of. The paper aims to discuss these issues. Design/methodology/approach – Since the model consists of several components which are not found in present library information systems and cannot be implemented by them, a new way had to be found to produce, save, process and present fiction‐related metadata. The Semantic Computing Research Group of Aalto University has developed several Semantic Web services for use in the field of culture, so cooperation with it and the use of Semantic Web tools were a natural starting point for the construction of the new service. Kirjasampo will be based on the Semantic Web RDF data model. The model enables a flexible linking of metadata derived from different sources, and it can be used to build a Semantic Web that can be approached contextually from different angles. Findings – The “semantically enriched” ideal model for fiction has hence been realised, at least to some extent: Kirjasampo supports literature‐related metadata that is more varied than earlier and aims to account for different contexts within literature and connections with regard to other cultural phenomena. It also includes contemporary reviews of works and, as such, readers receptions as well. Modern readers can share their views on works, once the user interface of the server is completed. It will include several features from the Kirjasto 2.0‐application, which enables the evaluation, description and recommendations of works. The service should be online by the end of Spring 2011. Research limitations/implications – The project involves novel collaboration between a public library and a computer science research unit, and utilises a novel approach to the description of fiction. Practical implications – The system encourages user participation in the description of fiction and is of practical benefit to librarians in understanding both how fiction is organised and how users interpret the same. Originality/value – Upon completion, the service will be the first Finnish information system for libraries built with the tools of the Semantic Web which offers a completely new user environment and application for data produced by libraries. It also strives to create a new model for saving and producing data, available to both library professionals and readers. The aim is to save, accumulate and distribute literary knowledge, experiences and silent information.
Jussi Kurki: Toimijaontologia ja niiden käyttö semanttisessa webissä (Actor Ontologies and Their Usage on the Semantic Web. MSc Thesis, University of Helsinki, Department of Computer Science, March, 2011. bib pdf
Katariina Nyberg: Document Classification Using Machine Learning and Ontologies. MSc Thesis, Aalto University, School of Science, Degree Programme of Information Networks, January, 2011. bib pdf
This master s thesis explores a way in which documents can be automatically classified based on their contents. Automatic classification of data is one of the main applications of machine learning. With the help of already classified data a model for the most likely class can be learned. Whether adding background knowledge from ontologies can be added to the model in order to improve the classification accuracy, is also explored in this master s thesis. A new machine learning model is introduced that incorporates ontology information. The proposed method for learning a classification model and enhancing it with ontology information is used in a case study for the Finnish National Archives and a set of digital documents that have been manually classified. An RDF schema for representing documents, sentences and words is created in order to prepare tha data for the machine learning analysis. The words are put into base form and matched semi-automatically with concepts of the General Finnish Ontology YSO. Then the ontology enhanced model is applied on the data and the most likely classes for documents are learned. The master s thesis shows that the classification accuracy of the model increases when ontology information is added to it.
Lyndon Nixon, Stamatia Dasiopoulou, Jean-Pierre Evain, Eero Hyvönen, Ioannis Kompatsiaris and Raphael Troncy: Multimedia, Broadcasting and eCulture. Handbook of Semantic Web Technologies (John Domingue, Dieter Fensel and James Hendler (eds.)), Springer-Verlag, January, 2011. bib link
2010
Tomi Kauppinen, Panu Paakkarinen, Eetu Mäkelä, Heini Kuittinen, Jari Väätäinen and Eero Hyvönen: Geospatio-temporal Semantic Web for Cultural Heritage. Digital Culture and E-Tourism: Technologies, Applications and Management Approaches, 2010. bib pdf link
People frequently need to find knowledge related to places when they plan a leisure trip, when they are executing that plan in a certain place, or when they want to virtually explore a place they have visited in the past. In this chapter we present and discuss a set of methods for searching and browsing spatiotemporally referenced knowledge related to cultural objects, e.g. artifacts, photographs and visiting sites. These methods have been implemented in the semantic cultural heritage portal CULTURESAMPO that offers map-based interfaces for a user to explore hundreds of thousands of content objects and points of interest in Finland. Our goal is to develop and demonstrate novel ways to help the user 1) to decide where to go for a trip, and 2) to learn more about the neighborhoods and points of interest during the visit.
Kaisa Hypén and Eetu Mäkelä: RDF ja FRBRoo: Kirjasammon skeemasta. Informaatiotutkimus, vol. 29, no. 3, 2010. bib pdf
Kim Viljanen, Jouni Tuominen and Eero Hyvönen: A Network of Ontology Repositories. December, 2010. Draft paper. bib pdf
Ontologies and vocabularies are a key resource for creating interoperable metadata on the Semantic Web. To make the finding and using ontologies easier, the idea of Ontology Repositories have been introduced with current implementations including e.g. the NCBO Bioportal, ONKI and Cupboard. However, at the moment each ontology repository is a separate island with its own user interfaces and APIs. They also use varying ontology languages such as OWL, SKOS, RDF Schema and others. Due to this, global search, browsing, and inference over the repositories is difficult and generally not done. At the same time, there is a genuine need for different kinds of Ontology Repositories, each focusing on different kinds specific user-needs, different ontologies and different organizational requirements which can not be addressed by a single global implementation. Since there are benefits of having interoperability among the repositories, we have developed a loosely coupled Network of Ontology Repository (NOR) architecture that makes the repositories globally interoperable while maintaining their unique functionalities and strengths. To participate in the network, each ontology repository is required to implement a shared API. As a proof-of-concept, we present a global metasearch prototype for searching simultaneously hundreds of ontologies in the ONKI and NCBO Bioportal repositories.
Rami Alatalo: Ontologiapalvelun käyttöliittymän jatkokehitys (Further Development of an Ontology Service User Interface). BSc Thesis (in Finnish), Aalto University, School of Science and Technology, Degree Program of Computer Science and Engineering, December, 2010. bib pdf
Eero Hyvönen: Preventing Interoperability Problems Instead of Solving Them. Semantic Web Journal, vol. 1, no. 1-2, pp. 33-37, December, 2010. bib link
Eetu Mäkelä: View-Based User Interfaces for the Semantic Web. Dissertation, Aalto University, School of Science and Technology, Espoo, November, 2010. D.Sc. dissertation. bib pdf
This thesis explores the possibilities of using the view-based search paradigm to create intelligent user interfaces on the Semantic Web. After surveying several semantic search techniques, the view-based search paradigm is explained, and argued to fit in a valuable niche in the field. To test the argument, numerous portals with different user interfaces and data were built using the paradigm. Based on the results of these experiments, this thesis argues that the paradigm provides a strong, extendable and flexible base on which to built semantic user interfaces. Designing the actual systems to be as adaptable as possible is also discussed.
Tomi Kauppinen, Glauco Mantegari, Panu Paakkarinen, Heini Kuittinen, Eero Hyvönen and Stefania Bandini: Determining Relevance of Imprecise Temporal Intervals for Cultural Heritage Information Retrieval. International Journal of Human-Computer Studies, vol. 86, no. 9, pp. 549-560, Elsevier, September, 2010. bib
Nina Laurenne, Jouni Tuominen, Mikko Koho and Eero Hyvönen: Modeling and Publishing Biological Names and Classifications on the Semantic Web. TDWG 2010 Annual Conference of the Taxonomic Databases Working Group, Woods Hole, Massachusetts, USA, September, 2010. Poster abstract. bib pdf
Jouni Tuominen, Matias Frosterus, Nina Laurenne and Eero Hyvönen: Publishing Biological Classifications as SKOS Vocabulary Services on the Semantic Web. TDWG 2010 Annual Conference of the Taxonomic Databases Working Group, Woods Hole, Massachusetts, USA, September, 2010. Demonstration abstract. bib pdf
Nina Laurenne, Jouni Tuominen, Mikko Koho and Eero Hyvönen: Taxon Meta-Ontology TaxMeOn - Towards an Ontology Model for Managing Changing Scientific Names in Time. TDWG 2010 Annual Conference of the Taxonomic Databases Working Group, Woods Hole, Massachusetts, USA, September, 2010. Contributed abstract. bib pdf
Katariina Nyberg, Tapani Raiko, Teemu Tiinanen, and Eero Hyvönen: Document Classification Utilizing Ontologies and Relations between Documents. Proceedings of Eighth Workshop on Mining and Learning with Graphs 2010, Washington D.C., USA, July, 2010. bib pdf
Jussi Kurki and Eero Hyvönen: Collaborative Metadata Editor Integrated with Ontology Services and Faceted Portals. Workshop on Ontology Repositories and Editors for the Semantic Web (ORES 2010), the Extended Semantic Web Conference ESWC 2010, Heraklion, Greece, CEUR Workshop Proceedings, http://ceur-ws.org/, June, 2010. bib pdf
Markus Holi: Crisp, Fuzzy, and Probabilistic Faceted Semantic Search. Dissertation, Aalto University, School of Science and Technology, Espoo, June, 2010. bib link
Kim Viljanen, Jouni Tuominen, Mikko Salonoja and Eero Hyvönen: Global Access to Distributed Ontology Repositories. Poster Papers, the Extended Semantic Web Conference ESWC 2010, Heraklion, Greece, June, 2010. bib pdf
Ontology repository systems are used for publishing and sharing ontologies. However, currently the repositories form separate islands of ontologies, which hinders the user from finding and utilizing the most suitable ontological concepts and ontologies on a global level. In contrast, this paper presents the idea of creating a network of Linked Open Ontology Services (LOOS) based on a set of ontology services that publish their content via a shared API. This facilitates global search and browsing over all ontologies in the network. LOOS has been implemented in the National Finnish Ontology Service ONKI serving currently 79 ontologies.
Kim Viljanen, Jouni Tuominen, Mikko Salonoja and Eero Hyvönen: Linked Open Ontology Services. Workshop on Ontology Repositories and Editors for the Semantic Web (ORES 2010), the Extended Semantic Web Conference ESWC 2010, CEUR Workshop Proceedings, Vol. 596, Heraklion, Greece, June, 2010. bib pdf link
Ontology repository systems are used for publishing and sharing ontologies and vocabularies for content indexing, information retrieval, content integration, and other purposes. However, interlinking these distributed repositories to provide global search and browsing over the repositories has not been made. In the spirit of Linked Open Data, we propose creating a network of Linked Open Ontology Services (LOOS) consisting of ontology repositories that publish their content using a shared API. To test the approach, we have defined an HTTP API and present a proof-of-concept implementation consisting of three client applications that are used for accessing a LOOS network of over 50 ontology servers, part of the Ontology Library Service ONKI.
Eero Hyvönen, Tuomas Palonen, Joeli Takala: Narrative Semantic Web -- Case National Finnish Epic Kalevala. Poster Papers, the 7th Extended Semantic Web Conference, Heraklion, Greece, June, 2010. bib pdf
Jouni Tuominen, Mikko Salonoja, Kim Viljanen and Eero Hyvönen: A User Interface for Ontology Repositories. Workshop on Ontology Repositories and Editors for the Semantic Web (ORES 2010), the Extended Semantic Web Conference ESWC 2010, CEUR Workshop Proceedings, Vol. 596, Heraklion, Greece, June, 2010. bib pdf link
Finding ontologies and concepts from a collection of ontologies is a recurring task in many use cases, such as content indexing, searching, and ontology developing. To facilitate this, efficient search and browsing methods are needed. This paper introduces ONKI2, an ontology browser providing a user interface for a repository of ontologies. The system provides a multi-facet search facility for finding an ontology. Finding concepts is supported by autocompletion-based text search that can be refined with additional restrictions. ONKI2 is in use in the Finnish Ontology Library Service ONKI for a collection of 79 ontologies and vocabularies.
Mathieu d Aquin, Alexander García Castro, Christoph Lange and Kim Viljanen (eds.): Ontology Repositories and Editors for the Semantic Web ORES-2010. Proceedings. CEUR-WS, Heraklion, Greece, May 31, 2010. bib link
Eero Hyvönen: Developing and Using a National Cross-domain Semantic Web Infrastructure. Semantifc Computing (Phillip Sheu, Heather Yu, C. V. Ramamoorthy, Arvind K. Joshi and Lotfi A. Zadeh (eds.)), IEEE Wiley - IEEE Press, May, 2010. bib link
Lora Aroyo, Grigoris Antoniu, Eero Hyvönen, Annette ten Teije, Heiner Stuckenschmidt, Liliana Cabral and Tania Tudorache (eds.): The Semantic Web: Research and Applications. 7th Extended Semantic Web Conference, ESWC 2010. Proceedings, Part I. Springer-Verlag, May/June, 2010. bib link
Lora Aroyo, Grigoris Antoniu, Eero Hyvönen, Annette ten Teije, Heiner Stuckenschmidt, Liliana Cabral and Tania Tudorache (eds.): The Semantic Web: Research and Applications. 7th Extended Semantic Web Conference, ESWC 2010. Proceedings, Part II. Springer-Verlag, May/June, 2010. bib link
Osma Suominen and Eero Hyvönen: Expressing and Aggregating Rich Event Descriptions. Proceedings of the 6th Workshop on Scripting and Development on the Semantic Web, Heraklion, Greece, May, 2010. bib pdf
Publishing information about upcoming events such as concerts and discussion group meetings in a structured format allows the event information to be aggregated, filtered and delivered to potential participants. Making automatic personalized recommendations about events requires structured metadata such as machine-understandable locations and semantic descriptions about the topic and audience of the event. We present a survey of the state of current semantic representation formats for events, including iCalendar and its RDFa and microformat representations, and show that their support for expressing rich structured metadata is limited. We have also tested how well different tools support and understand the formats. Based on the surveys we have implemented a rich event information schema for a health-oriented activity portal and developed an aggregation and validation tool for gathering and processing event information.
Jouni Tuominen: Helppokytkentäiset ontologiapalvelut semanttisessa webissä. MSc Thesis (in Finnish), University of Helsinki, Department of Computer Science, May, 2010. bib pdf link
Ontologiat luovat semanttisen webin perustan: ne toimivat yhteisinä jaettuina käsitteistöinä, joiden avulla tietokoneet voivat käsitellä tietoa älykkäämmin. Jotta eri toimijat voivat hyödyntää yhteisiä käsitteistöjä sovelluksissaan, ontologiat on julkaistava heidän käyttöönsä. Yksinkertaisimmillaan ontologiat voidaan julkaista datana, tiedostomuodossa. Tällöin jokainen toimija joutuu toteuttamaan itse toiminnallisuuksia ontologioiden hyödyntämiseen. Osa toiminnallisuuksista on yleisiä, useissa järjestelmissä toistuvia, kuten ontologian visualisointi, selaaminen ja käsitehaku. On kuitenkin kustannustehokkaampaa toteuttaa yleisiä ontologiatoiminnallisuuksia valmiina palveluina. Palveluita voidaan tarjota ihmiskäyttäjille käyttöliittymäkomponentteina sekä ohjelmalliseen käyttöön rajapintoina, joita käyttämällä toiminnallisuudet voidaan integroida asiakasjärjestelmiin. Lisäksi käytettäessä ontologioita palveluina toimijoiden käytössä on aina ontologioiden ajantasaiset versiot. Tässä tutkielmassa kuvataan ontologioiden käyttäjäryhmien -- ontologioiden kehittäjien, tiedon annotoijien, tiedon hakijoiden ja semanttisen webin sovellusten kehittäjien -- tarpeita sekä esitellään ontologioiden hyödyntämiseen kehitettyjä sovelluksia. Yleisten ontologioiden käyttämiseen liittyvien toiminnallisuuksien tarjoamiseksi esitetään ontologiapalvelu ONKI, joka julkistettiin virallisesti käyttöön syyskuussa 2008.
Eero Hyvönen: Linked Open Data - innovaatio jaetulle avoimelle tiedolle. Tietoasiantuntija-lehti, no. 2-3, May, 2010. bib pdf
Tuukka Ruotsalo: Methods and Applications for Ontology-Based Recommender Systems. Dissertation, Aalto University, School of Science and Technology, Espoo, May, 2010. bib link
Tomi Kauppinen: Methods for Creating and Using Geospatio-temporal Semantic Web. Dissertation, Aalto University, School of Science and Technology, Espoo, April, 2010. bib link
2009
Tuukka Ruotsalo and Eetu Mäkelä: A Comparison of Corpus-Based and Structural Methods on Approximation of Semantic Relatedness in Ontologies. International Journal On Semantic Web and Information Systems, vol. 5, no. 4, pp. 39-56, IGI Global, 2009. bib pdf link
In this paper, the authors compare the performance of corpus-based and structural approaches to determine semantic relatedness in ontologies. A large light-weight ontology and a news corpus are used as materials. The results show that structural measures proposed by Wu and Palmer, and Leacock and Chodorow have superior performance when cut-off values are used. The corpus-based method Latent Semantic Analysis is found more accurate on specific rank levels. In further investigation, the approximation of structural measures and Latent Semantic Analysis show a low level of overlap and the methods are found to approximate different types of relations. The results suggest that a combination of corpus-based methods and structural methods should be used and appropriate cut-off values should be selected according to the intended use case.
Tuukka Ruotsalo, Lora Aroyo and Guus Schreiber: Knowledge-Based Linguistic Annotation of Digital Cultural Heritage Collections. IEEE Intelligent Systems, vol. 24, no. 2, pp. 64-75, IEEE Computer Society, 2009. bib link
William Brace, Tuukka Ruotsalo, Jan Henrik Storgårds, Mikko Villi and Yu Xiao: Life Unwired - The Future of Telecommunication and Networks. Bit Bang, Rays to the Future (Yrjö Neuvo and Sami Ylönen (eds.)), pp. 42-62, Helsinki University Press, 2009. bib
Kai Kuikkaniemi, Ranran Lin, Tuukka Ruotsalo, Sebastian Siikavirta and Yu Xiao: The Future of Media - Free of Fantastic?. Bit Bang, Rays to the Future (Yrjö Neuvo and Sami Ylönen (eds.)), pp. 142-173, Helsinki University Press, 2009. bib
Tomi Kauppinen, Tuukka Ruotsalo, Frédéric Weis, Sylvain Roche, Marco Berni, Eetu Mäkelä, Nima Dokoohaki and Eero Hyvönen: SmartMuseum Knowledge Exchange Platform for Cross-European Cultural Content Integration and Mobile Publication. Proceedings of the CULTURAL HERITAGE on line Empowering users: an active role for user communities, December 15-16, 2009. bib pdf
European museums and other cultural institutions host rich collections that have ability to attract EU citizens and tourists. Cultural objects, e.g. paintings, in these collections are related in many ways and in many cases they refer to same underlying concepts, people and places. The Cultural Heritage Knowledge Exchange Platform, SMARTMUSEUM requires that these collections are interoperable over cultural and language barriers, and provides a mobile publication channel for collections.
Osma Suominen, Eero Hyvönen, Kim Viljanen and Eija Hukka: HealthFinland-a National Semantic Publishing Network and Portal for Health Information. Journal of Web Semantics, vol. 7, no. 4, pp. 287-297, Dec, 2009. bib pdf
Providing citizens with reliable, up-to-date and individually relevant health information on the web is done by governmental, non-governmental, business and other organizations. Currently the information is published with little co-ordination and co-operation between the publishers. For publishers, this means duplicated work and costs due to publishing same information twice on many websites. Also maintaining links between websites requires work. From the citizens point of view, finding content is difficult due to e.g. differences in layman’s vocabularies compared to medical terminology and difficulties in aggregating information from several sites. To solve these problems, we propose as a solution a national scale semantic publishing system HealthFinland which consists of a 1) a centralized content infrastructure of health ontologies and services with tools, 2) a distributed semantic content creation channel based on several health organizations, and 3) an intelligent semantic portal aggregating and presenting the contents from intuitive and health promoting end-user perspectives for human users as well as for other web sites and portals.
Tuomas Palonen, Jouni Hyvönen, Joeli Takala and Eero Hyvönen: Semanttinen Kalevala - Kulttuurisammon taontaa (Semantic Kalevala - Forging the CutureSampo). eLore, vol. 16, no. 2, December, 2009. bib pdf
Suvi Kettula: Semanttisen webin ontologisen tekstiilikäsitteistön kehittäminen ja liittäminen museon luettelointijärjestelmiin. Dissertation, December, 2009. Väitöskirja, Helsingin yliopisto, käsityötieteen laitos. bib link
Innar Liiv, Tanel Tammet, Tuukka Ruotsalo and Alar Kuusik: Personalized Context-aware recommendations in SMARTMUSEUM: Combining Semantics with Statistics. Proceedings of the The Third International Conference on Advances in Semantic Processing (SEMAPRO 2009), IEEE Computer Society, October, 2009. Sliema, Malta. bib pdf
Jussi Kurki and Eero Hyvönen: Authority Control of People and Organizations on the Semantic Web. Proceedings of the International Conferences on Digital Libraries and the Semantic Web 2009 (ICSD2009), Trento, Italy, September, 2009. bib pdf
Authors and documents with identical titles are common in the digital library environment. In order to manage identities correctly, authority control is used by library and information scientists for disam- biguating and cross-referencing entity names. We argue that the benefits of traditional authority control can be enhanced by using techniques and technologies of the Semantic Web, leading to simpler management of multiple languages, better linkability of resources, simpler reuse of au- thority registries in applications, and less work in indexing. To demon- strate our propositions, we have created a prototype of an ontology server and service called ONKI People that is used in two ways: First, it is a centralized authority service providing human end-users with efficient and easy to use authority finding and disambiguation services based on faceted semantic search and visualizations. The services are available on- line also as AJAX and Web Services API for machines to use. Second, the underlying RDF triple store can be used as a content resource in ap- plications such as semantic cultural heritage portals. The paper discusses and demonstrates both use cases in a real life setting.
Matias Frosterus and Eero Hyvönen: Bridging the Search Gap between the Web of Pages and Web of Data by Combining Ontological Document Expansion with Text Search. Proceedings of the International Conferences on Digital Libraries and the Semantic Web 2009 (ICSD2009), Trento, Italy, September, 2009. bib pdf
The Semantic Web extends traditional web documents, i.e. the Web of Pages, with conceptual structures based on ontologies and metadata, i.e. the Web of Data. This paper presents a hybrid document search approach combining the benefits of the traditional text search of literal documents and the semantic search based on their underlying conceptual structures. The approach is based on document expansion, where documents are automatically annotated with not only the concepts explicitly present in a given document, but also with the ontologically related concepts using smaller weights. Our test results using the CLEF Test Suite suggest that document expansion alone achieves better recall than text search at the expense of precision. As a solution, a method of combining document expansion with text search is presented in which better recall was obtained without sacrificing precision. This approach seems promising when integrating unstructured, textual content with the Semantic Web of Data.
Mikko Salonoja: Palveluiden semanttinen kuvailu ja haku. MSc Thesis, Helsinki University of Technology, Department of Automation and Systems Technology, September, 2009. bib pdf
Ihminen tarvitsee kuluttajana ja yrittäjänä usein toisten ihmisten ja organisaatioden tukea palveluiden muodossa. Aiemmin tuntemattomien palveluiden löytäminen voi kuitenkin olla vaikeaa Internetin tiedon määrän lisääntyessä, koska tiedonhakijalle mielekäs sisältö hautautuu helposti epäolennaisen tiedon joukkoon. Tämän ongelman helpottamiseksi semanttisen webin tekniikat tarjoavat uusia mahdollisuuksia. Tässä diplomityössä tutkittiin millaisia palveluiden haun kannalta hyödyllisiä ratkaisuja on aiemmin toteutettu. Tämän jälkeen nämä ratkaisut peilattiin kahden Internetissä sijaitsevan palvelun, PKT-säätiön ylläpitämä Yrityksen palveluhakemisto ja Suomen Asiakastiedon ylläpitämä Aarrepalvelu, parannusehdotuksiksi. Tutkittuja palveluita ehdotettiin tässä diplomityössä parannettavan useilla erilaisilla tavoilla. Näistä osalla ei ollut suoraan mitään tekemistä semanttisen webin tekniikoiden kanssa ja osa taas liittyi semanttiseen webiin hyvin kiinteästi. Erityisesti maantieteellisen tiedon käsittelyssä havaittiin olevan runsaasti kehitettävää molempien tutkittujen palveluiden kohdalla. Myös eroja parannusehdotusten välillä oli havaittavissa. Johtuen Aarre-palvelun suuremmasta tietokannasta ja suuremmasta käyttäjäjoukosta siinä sanojen välisen verkon hyödyntäminen näytti olevan merkityksellisempää kuin Yrityksen palveluhakemiston kohdalla.
Eeva Ahonen and Eero Hyvönen: Publishing Historical Texts on the Semantic Web - A Case Study. Proceedings of the Third IEEE International Conference on Semantic Computing (ICSC2009), Berkeley, CA, USA, September, 2009. bib pdf
Historical texts are an important component of cultural heritage, and are being digitized and published on the web in various portals for the researhers and the public. However, searching and linking them with related contents is challenging due the non-structured text form, digitization errors, and the differences and variations between old and modern language, including historical names (e.g. places), used for querying. This paper addresses these issues by presenting an approach and a system for publishing old texts on the semantic web. As a case study, an existing historical newspaper archive on the web is considered. In our model, semantic metadata is added to the text using automated concept extraction methods. Search is implemented with semantic techniques, by creating a multi-faceted search interface for the text materials. Problems due to OCR errors and spelling variants are addressed with a fuzzy string matching algorithm trying to guess corresponding words in a lexicon, and giving suggestions for corrected words forms. References between texts in the library as well as links between the library and external knowledge sources are formed by using shared ontologies for semantic annotations.
Tuukka Ruotsalo, Eetu Mäkelä, Tomi Kauppinen, Eero Hyvönen, Krister Haav, Ville Rantala, Matias Frosterus, Nima Dokoohaki and Mihhail Matskin: Smartmuseum: Personalized Context-aware Access to Digital Cultural Heritage. Proceedings of the International Conferences on Digital Libraries and the Semantic Web 2009 (ICSD2009), September, 2009. Trento, Italy. bib pdf
This paper presents a semantic recommender method and a system for a personalized access to digital cultural heritage through context-aware user pro- filing. Given annotation knowledge-bases, explicit background knowledge in the form of ontologies, a user model capturing the user’s behavior and context, the system produces recommendations. Ontology-based user profiling can be used to reduce cold-start, sparsity and over-specialization problems. In addition, we present a recommendation retrieval method that is based on the vector space model and uses indices that enable fast and scalable implementation of the system.
Osma Suominen, Kim Viljanen and Eero Hyvönen: TerveSuomi-portaalin metatietomäärittely (HealthFinland portal metadata specification). Jun 18, 2009. bib pdf
Lora Aroyo, Paolo Traverso, Fabio Ciravegna, Philipp Cimiano, Tom Heath, Eero Hyvönen, Riichiro Mizoguchi, Eyal Oren, Marta Sabou and Elena Simperl (eds.): The Semantic Web: Research and Applications, 6th European Semantic Web Conference, ESWC 2009, Heraklion, Crete, Greece, May 31-June 4, 2009, Proceedings. ESWC, Lecture Notes in Computer Science, vol. 5554, Springer, June, 2009. bib
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä, Joeli Takala, Kimmo Puputti, Heini Kuittinen, Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg: CultureSampo - A National Publication System of Cultural Heritage on the Semantic Web 2.0. Proceedings of the 6th European Semantic Web Conference (ESWC2009), Heraklion, Greece, May 31 - June 4, 2009. Springer-Verlag. bib pdf
CULTURESAMPO is an application demonstration of a national level publication system of cultural heritage contents on the Web, based on ideas and technologies of the Semantic (Web and) Web 2.0. On the semantic side, the system presents new solutions to interoperability problems of dealing with multiple ontologies of different domains, and to problems of integrating multiple metadata schemas and cross-domain content into a homogeneous semantic portal. A novelty of the system is to use semantic models based on events and narrative process descriptions for modeling and visualizing cultural phenomena, and for semantic recommendations. On the Web 2.0 side, CULTURESAMPO proposes and demonstrates a content creation process for collaborative, distributed ontology and content development including different memory organizations and citizens. The system provides the cultural heritage contents to end-users in a new way through multiple (nine) thematic perspectives, based on semantic visualizations. Furthermore, CULTURESAMPO services are available for external web-applications to use through semantic AJAX widgets.
Tomi Kauppinen, Kimmo Puputti, Panu Paakkarinen, Heini Kuittinen, Jari Väätäinen and Eero Hyvönen: Learning and Visualizing Cultural Heritage Connections between Places on the Semantic Web. Proceedings of the Workshop on Inductive Reasoning and Machine Learning on the Semantic Web (IRMLeS2009), The 6th Annual European Semantic Web Conference (ESWC2009), May 31 - June 4, 2009. bib pdf
Semantic web techniques can be used to relate two things together. However, usually this relation is not accompanied with a measure that would tell how interesting the relation is. Data mining tradition provides interestingness measures; it is natural to try and fit semantic web and data mining traditions together. In this paper we use support and confidence values provided by association rule mining as interestingness measures for relations. The presented method is tailored to location ontologies in order to find out what interesting mutual relations two places have based on annotations in the cultural heritage domain. The method also uses ontology-based reasoning to group places together. We present tests of running the method against a set of over 60,000 annotations in order to find out cultural heritage connections between places.
Jouni Tuominen, Matias Frosterus, Kim Viljanen and Eero Hyvönen: ONKI SKOS Server for Publishing and Utilizing SKOS Vocabularies and Ontologies as Services. Proceedings of the 6th European Semantic Web Conference (ESWC 2009), pp. 768-780, Springer-Verlag, Heraklion, Greece, May 31 - June 4, 2009. bib pdf
Vocabularies are the building blocks of the Semantic Web providing shared terminological resources for content indexing, information retrieval, data exchange, and content integration. Most semantic web applications in practical use are based on lightweight ontologies and, more recently, on the Simple Knowledge Organization System (SKOS) data model being standardized by W3C. Easy and cost-efficient publication, integration, and utilization methods of vocabulary services are therefore highly important for the proliferation of the Semantic Web. This paper presents the ONKI SKOS Server for these tasks. Using ONKI SKOS, a SKOS vocabulary or a lightweight ontology can be published on the web as ready-to-use services in a matter of minutes. The services include not only a browser for human usage, but also Web Service and AJAX interfaces for concept finding, selecting and transporting resources from the ONKI SKOS Server to connected systems. Code generation services for AJAX and Web Service APIs are provided automatically, too. ONKI SKOS services are also used for semantic query expansion in information retrieval tasks. The idea of publishing ontologies as services is analogous to Google Maps. In our case, however, vocabulary services are provided and mashed-up in applications. ONKI SKOS was published in the beginning of 2008 and is to our knowledge the first generic SKOS server of its kind. The system has been used to publish and utilize some 60 vocabularies and ontologies in the National Finnish Ontology Service ONKI www.yso.fi.
Kim Viljanen, Jouni Tuominen and Eero Hyvönen: Ontology Libraries for Production Use: The Finnish Ontology Library Service ONKI. Proceedings of the 6th European Semantic Web Conference (ESWC 2009), pp. 781-795, Springer-Verlag, Heraklion, Greece, May 31 - June 4, 2009. bib pdf
This paper discusses problems of creating and using ontology library services in production use. One approach to a solution is presented with an online implementation--the Finnish Ontology Library Service ONKI--that is in pilot use on a national level in Finland. ONKI contributes to previous research on ontology libraries in many ways: First, mashup and web service support with various tools is provided for cost-efficient utilization of ontologies in indexing and search applications. Second, services covering the different phases of the ontology life cycle are provided. Third, the services are provided and used in real world applications on a national scale. Fourth, the ontology framework is being developed by a collaborative effort by organizations representing different application domains, such as health, culture, and business.
Jouni Tuominen, Tomi Kauppinen, Kim Viljanen and Eero Hyvönen: Ontology-Based Query Expansion Widget for Information Retrieval. Proceedings of the 5th Workshop on Scripting and Development for the Semantic Web (SFSW 2009), 6th European Semantic Web Conference (ESWC 2009), CEUR Workshop Proceedings, Vol. 449, Heraklion, Greece, May 31 - June 4, 2009. bib pdf link
In this paper we present an ontology-based query expansion widget which utilizes the ontologies published in the ONKI Ontology Service. The widget can be integrated into a web page, e.g. a search system of a museum catalogue, enhancing the page by providing a query expansion functionality. We have tested the system with general, domain-specific and spatio-temporal ontologies.
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä, Joeli Takala, Kimmo Puputti, Heini Kuittinen, Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg: CultureSampo - Finnish Culture on the Semantic Web 2.0. Thematic Perspectives for the End-user. Proceedings, Museums and the Web 2009, Indianapolis, USA, April 15-18, 2009. bib pdf
We present an overview of CultureSampo, an ambitious system for creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic web challenge of aggregating highly heterogeneous, cross-domain cultural heritage collections and other contents into a semantically rich intelligent system for human and machine users. At the same time, CultureSampo is an approach to solve the social and practical Web 2.0 challenge of organizing the underlying collaborative ontology development and content creation work of memory organizations and citizens. This paper focuses on CultureSampo’s search, recommendation, and visualization services for the end-users. The key idea here is to access cultural heritage on the Semantic Web through nine “thematic perspectives”, such as places on the maps, the social network of cultural persons, timelines, and narrative texts, e.g. biographies and literary works.
Eero Hyvönen: Semantic Portals for Cultural Heritage. Handbook on Ontologies (2nd Edition) (Steffen Staab and Rudi Studer (eds.)), Springer-Verlag, April, 2009. bib pdf
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä, Joeli Takala, Kimmo Puputti, Heini Kuittinen, Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg: CultureSampo - Finnish Cultural Heritage Collections on the Semantic Web 2.0. Proceedings of the 1st International Symposium on Digital Humanities for Japanese Arts and Cultures (DH-JAC-2009), Ritsumeikan University, Kyoto, Japan, March, 2009. bib pdf
This paper presents an overview of the SemanticWeb 2.0 application CultureSampo, an ambitious system for creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic web challenge of aggregating highly heterogeneous, cross-domain cultural heritage content into a semantically rich intelligent system for human and machine users. At the same time, CultureSampo is an approach to solve the social and practical Web 2.0 challenge of organizing the underlying collaborative ontology development and content creation work of memory organizations and citizens.
Lynda Hardman, Jacco van Ossenbruggen, Lora Aroy and Eero Hyvönen: Using AI to Access and Experience Cultural Heritage. IEEE Intelligent Systems, vol. 24, no. 2, IEEE Computer Society, March/April, 2009. bib pdf
Tomi Kauppinen, Heini Kuittinen, Jouni Tuominen, Katri Seppälä and Eero Hyvönen: Extending an Ontology by Analyzing Annotation Co-occurrences in a Semantic Cultural Heritage Portal. Proceedings of the ASWC 2008 Workshop on Collective Intelligence (ASWC-CI 2008) organized as a part of the 3rd Asian Semantic Web Conference (ASWC 2008), Bangkok, Thailand, February 2-5, 2009. bib pdf
Ontologies aim to capture knowledge about things and their relationships. Publishing ontologies on the Semantic Web enables people and organizations to use shared ontologies in annotating e.g. photographs, videos, music, and other types of cultural objects. Search engines also use relationships provided by ontologies in semantic search, e.g. for query expansion or for view-based search. However, building ontologies is a time-consuming process, and it should be helped by automatic finding of interesting, possible relationships. Finding the correct concept for annotation purposes is helped by subsumption and partonomy hierarchies and associative relationships. In this paper we show how an analysis of co-occurrences of concepts in annotations can be used to provide interesting relationships for enriching ontological structures. We use association rule mining techniques and test the idea using a set of annotations of cultural objects in CULTURESAMPO portal and the Finnish General Upper Ontology YSO. The results are visualized in the ONKI SKOS browser to give an additional layer on top of the original relationships of the YSO ontology. An analysis shows that best ranked relationships should also be included in the ontology as subclassof or associative relationships.
2008
Tuukka Ruotsalo, Katri Seppälä, Kim Viljanen, Eetu Mäkelä, Jussi Kurki, Olli Alm, Tomi Kauppinen, Jouni Tuominen, Matias Frosterus, Reetta Sinkkilä and Eero Hyvönen: Ontology-based Approach for Interoperability of Digital Collections. Signum, no. 5, 2008. bib pdf
This paper presents solutions and lessons learned in FinnONTO project carried out in Finland in 2003–2007. The paper focuses on three aspects of interoperability of digital collections. First, transforming thesauri to ontologies. Second, publishing ontologies for the use of indexers and content providers. Third, ontology based methods for improving end user access to digital collections. The first aspect is analysed through case studies done with Finnish thesauri. The second is discussed by presenting the ONKI ontology server. The last aspect is demonstrated in the scope of the semantic portal CultureSampo for publishing cultural heritrage on the Semantic Web.
Katariina Nyberg: Ontologian arviointi OntoClean-menetelmällä Kandidaatintyö, Informaatio- ja luonnontieteiden tiedekunta, Teknillinen Korkeakoulu. BSc Thesis (in Finnish), December 1, 2008. bib pdf
Eero Hyvönen, Suvi Kettula: Kulttuurisampo (CultureSampo. Museo-lehti, no. 4, Museoliitto, Helsinki, Finland, Nov, 2008. bib pdf
Kulttuurisampo kokoaa suomalaisen kulttuurin palapeliä aivan uudella tavalla, kirjoittavat Eero Hyvönen ja Suvi Kettula
Eero Hyvönen: Semanttinen web ja paikkatietoihin perustuvat palvelut (Semantic web and services based on geographical data. Historiaa kunnioittaen, tulevaisuuteen suunnaten. Maanmittaustieteen päivät 2008, Maanmittaustieteiden Seura, julkaisu n:o 45, ss. 8-16, Espoo, Finland, Nov, 2008. bib pdf
Artikkelissa luodaan katsaus kansallisessa Suomalaiset semanttisen webin ontologiat hankkeessa FinnONTO 2003-2007 ja FinnONTO 2.0 2008-2010 kehitettyihin paikkaontologioihin SUO (Suomalainen paikkaontologia) ja SAPO (Suomen ajallinen paikkaontologia), näiden julkaisemiseen AJAX-palveluina Kansallisessa ontologiapalvelussa ONKI, sekä paikkaontologioiden avulla Kulttuurisampo-portaaliin kehitettyihin palveluhin.
Eero Hyvönen, Kim Viljanen, Jouni Tuominen, Katri Seppälä, Tomi Kauppinen, Matias Frosterus, Reetta Sinkkilä, Jussi Kurki, Olli Alm, Eetu Mäkelä and Joonas Laitio: National Ontology Infrastructure Service ONKI. Oct 1, 2008. bib pdf
This paper presents the national level cross-domain ontology and ontology service infrastructure ONKI used in Finland. The novelty of ONKI is based on two ideas. First, the core ontologies are developed collaboratively by experts transforming thesauri into mutually aligned lightweight ontologies, based on a large top ontology that is extended by various domain specific ontologies. Second, the National Ontology Service ONKI has been implemented for publishing ontologies cost-efficiently as ready to use services. ONKI provides legacy and other applications with ready to use functionalities for using ontologies on the HTML level by Ajax and semantic widgets. ONKI has been used in various applications for creating mash-up applications in a way analogous to using Google Maps, but in our case external applications are mashed-up with ontology support for indexing and information retrieval.
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg: CultureSampo - A Collective Memory of Finnish Cultural Heritage on the Semantic Web 2.0. Semantic Computing Research Group, Helsinki University of Technology and University of Helsinki, Sept 29, 2008. bib pdf
This paper presents the Semantic Web 2.0 application CULTURESAMPO, an ambitious system of creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic challenge of aggregating highly heterogeneous, cross-domain cultural heritage into a semantically rich intelligent system for human and machine users. At the same time, CULTURESAMPO is an approach to solve the social and practical Web 2.0 challenge of organizing the underlying collaborative ontology development and content creation work of memory organizations and citizens.
Osma Suominen, Eero Hyvönen, Kim Viljanen, Eija Hukka: HealthFinland - A National Publication System for Semantic Health Information. Semantic Computing Research Group, Helsinki University of Technology and University of Helsinki, Sept 29, 2008. bib pdf
Eero Hyvönen: Kulttuurisampo - suomalainen kulttuuri semanttisessa webissä. Muistiorganisaatioiden ja kansalaisten yhteisöllinen kansallinen julkaisujärjestelmä (CultureSampo - Finnish culture on the Semantic Web. A national collaborative Semantic Web 2.0 portal for memory organizations and citizens.). Paper presented at the publication event of the CultureSampo portal, National Museum, Helsinki, Finland, Sept 25, 2008. bib pdf
Eero Hyvönen: FinnONTO-malli kansallisen semanttisen webin sisältöinfrastruktuurin perustaksi - visio ja sen toteus (FinnONTO model as the basis for a national semantic web infrastructure - vision and its implementation). Paper presented at the publication event of the Finnish Ontology Library Service ONKI, Espoo, Finland, Sept 12, 2008. bib pdf
Jussi Kurki: Finding People and Organizations on the Semantic Web. AI and Machine Consciousness - Proceedings of the 13th Finnish Artificial Intelligence Conference STeP 2008, Espoo, Finland, August 20-22, 2008. bib pdf
Finding people is essential in finding information. Librarians and information scientists have studied authority control - psychologists and sociologists social networks. In aforementioned, authors link to documents (and co-authors) creating access points to information. In latter, social paths serve as channels for rumours as well as expertise. Key problems include identification and disambiguation of individuals followed by difficulties of tracking the social connections. With semantic web, these aspects can be approached simultaneously. In this paper, we define a simple ontology for describing people and organizations. The model is based on FOAF and other existing vocabularies. We also demonstrate search and visualization tools for finding people.
Jouni Tuominen, Matias Frosterus, Kim Viljanen and Eero Hyvönen: ONKI-SKOS - Publishing and Utilizing Thesauri in the Semantic Web. AI and Machine Consciousness - Proceedings of the 13th Finnish Artificial Intelligence Conference STeP 2008, Espoo, Finland, August 20-22, 2008. bib pdf
Thesauri and other controlled vocabularies act as building blocks of the Semantic Web by providing shared terminology for facilitating information retrieval, data exchange and integration. Representation and publishing methods are needed for utilizing thesauri efficiently, e.g., in content indexing and searching. W3C has provided the Simple Knowledge Organization System (SKOS) data model for expressing concept schemes, such as thesauri. A standard representation format for thesauri eliminates the need for implementing thesaurus specific rules or applications for processing them. However, there do not exist general tools which provide out of the box support for publishing and utilizing SKOS vocabularies in applications, without needing to implement application specific user interfaces for end users. For solving this problem the ONKI-SKOS server is presented.
Reetta Sinkkilä, Eetu Mäkelä, Tomi Kauppinen and Eero Hyvönen: Combining Context Navigation with Semantic Autocompletion to Solve Problems in Concept Selection. First International Workshop on Semantic Metadata Management and Applications, SeMMA 2008, Located at the Fifth European Semantic Web Conference (ESWC 2008), Tenerife, Spain, June 2nd, 2008. Proceedings (Khalid Belhajjame, Mathieu d Aquin, Peter Haase and Paolo Missier (eds.)), CEUR Workshop Proceedings, vol. 346, pp. 61-68, CEUR-WS.org, Tenerife, Spain, June 1-5, 2008. bib pdf
Many tasks on the semantic web require the user to choose concepts from a limited vocabulary e.g. for describing an indexed resource or for use in semantic search. Semantic autocompletion interfaces offer an efficient way for concept selection. However, these interfaces usually do not expose the semantic context of the matched concepts, thereby making it hard to know if a matched concept is the right one, as well as hiding possibly more appropriate choices. Ontology browsers, on the other hand, show context but do not allow quick discovery or embedding into other applications. To lessen these problems, we present an interface combining semantic autocompletion with in-place ontological context navigation. Because required context differs between ontologies, the implementation was designed to make it easy to add different contexts and visualizations. To test the applicability of our idea and implementation the, system was tested on three ontologies with different requirements and structure.
Tomi Kauppinen, Jari Väätäinen and Eero Hyvönen: Creating and Using Geospatial Ontology Time Series in a Semantic Cultural Heritage Portal. S. Bechhofer et al.(Eds.): Proceedings of the 5th European Semantic Web Conference 2008 ESWC 2008, LNCS 5021, Tenerife, Spain, pp. 110-123, Springer-Verlag, June 1-5, 2008. bib pdf
Content annotations in semantic cultural heritage portals commonly make spatiotemporal references to historical regions and places using names whose meanings are different in different times. For example, historical administrational regions such as countries, municipalities, and cities have been renamed, merged together, split into parts, and annexed or moved to and from other regions. Even if the names of the regions remain the same (e.g., “Germany”), the underlying regions and their relationships to other regions may change (e.g., the regional coverage of “Germany” at different times). As a result, representing and finding the right ontological meanings for historical geographical names on the semantic web creates severe problems both when annotating contents and during information retrieval. This paper presents a model for representing the meaning of changing geospatial resources. Our aim is to enable precise annotation with temporal geospatial resources and to enable semantic search and browsing using related names from other historical time periods. A simple model and metadata schema is presented for representing and maintaining geospatial changes from which an explicit time series of temporal part-of ontologies can be created automatically. The model has been applied successfully to representing the complete change history of municipalities in Finland during 1865–2007, and the resulting ontology time series is used in the semantic cultural heritage portal CULTURESAMPO to support faceted semantic search of contents and to visualizing historical regions on overlaying maps originating from different historical eras.
Eero Hyvönen, Eetu Mäkelä, Tuukka Ruotsalo, Tomi Kauppinen, Olli Alm, Jussi Kurki, Joeli Takala, Kimmo Puputti and Heini Kuittinen: CultureSampo-Finnish Culture on the Semantic Web. Posters of the 5th European Semantic Web Conference 2008 (ESWC 2008), Tenerife, Spain, June 1-5, 2008. bib pdf
This paper presents the semantic portal CULTURESAMPO---Finnish Culture on the Semantic Web . The portal provides memory organizations and other cultural content publishers with a national, shared semantic publication channel for heteroge- nous cultural contents. The content comes from over ten organizations and is annotated using various ontologies of the FinnONTO infrastructure. For the end-user, intel- ligent semantic search, recommendation, and visualization services for accessing and learning about cultural heritage are provided.
Tomi Kauppinen, Riikka Henriksson, Reetta Sinkkilä, Robin Lindroos, Jari Väätäinen and Eero Hyvönen: Ontology-based Disambiguation of Spatiotemporal Locations. Proceedings of the 1st international workshop on Identity and Reference on the Semantic Web (IRSW2008), 5th European Semantic Web Conference 2008 (ESWC 2008), CEUR Workshop Proceedings, ISSN 1613-0073, June 1-5, 2008. bib pdf
Geographic place names are semantically often highly ambiguous. For example, there are 491 places in Finland sharing the same name ”Isosaari” (great island) that are instances of several geographical classes, such as Island, Forest, Peninsula, Inhabited area, etc. Referencing unambiguously to a particular ”Isosaari”, either when annotating content or during information retrieval, can be quite problematic and requires usage of advanced search methods and maps for semantic disambiguation. Historical places introduce even more challenges, since historical metadata commonly make spatiotemporal references to historical regions and places using names whose meanings are non-existing or different in different times. This paper presents how these problems have been addressed in a large Finnish place ontology SUO and a historical geo-ontology SAPO. A location ontology server ONKI-Geo has been created for publishing the ontologies and utilizing them as mashup services. To demonstrate the usability of our ontologies, two case applications in the cultural heritage domain are presented.
Kim Viljanen, Jouni Tuominen and Eero Hyvönen: Publishing and Using Ontologies as Mash-Up Services. Proceedings of the 4th Workshop on Scripting for the Semantic Web (SFSW2008), 5th European Semantic Web Conference 2008 (ESWC 2008), CEUR Workshop Proceedings, Vol. 368, Tenerife, Spain, June 1-5, 2008. bib pdf link
The Semantic Web is based on using ontologies for enabling semantically disambiguated data exchange between distributed systems on the web. This requires efficient means for publishing ontologies on the web to ensure the availability, sharing and acceptance of the ontologies. Support services are needed for utilizing ontologies easily and cost-effectively in applications and legacy systems lacking ontology support. To address these vital needs, this paper presents the ONKI ontology service which provides ready-to-use mash-up functionalities, such as semantic disambiguation, concept finding and concept fetching as ready-to-use web widgets for adding ontology support to e.g. HTML forms using JavaScript. Two implementations of the ONKI Server are presented: ONKI-SKOS for ontologies presented in the Simple Knowledge Organization System (SKOS) language and ONKI-Geo for geographical ontologies with a map interface. The presented ONKI systems are operational on the web, used in the National Finnish Ontology Service. They have been successfully used in several pilot applications.
Eero Hyvönen, Kim Viljanen, Jouni Tuominen and Katri Seppälä: Building a National Semantic Web Ontology and Ontology Service Infrastructure - The FinnONTO Approach. Proceedings of the European Semantic Web Conference ESWC 2008, pp. 95-109, Springer, Tenerife, Spain, June, 2008. bib pdf
This article presents the vision and results of creating a national level cross-domain ontology service infrastructure in Finland in the FinnONTO project. The novelty of the infrastructure is based on two ideas. First, a system of open source core ontologies is being developed by transforming thesauri into mutually aligned lightweight ontologies, including a top ontology of 20,000 concepts that is extended by various domain specific ontologies. Second, the ONKI Ontology Server framework for publishing ontologies as ready to use services has been designed and implemented. ONKI provides legacy and other applications with ready to use functionalities for using ontologies on the user interface level as semantic widgets. The idea is to use ONKI for creating mash-up applications in a way analogous to using Google or Yahoo Maps, but in our case external applications are mashed-up with ontology support. The ontology framework presented is operational on the web and is being used in creating the application demonstrations.
Eero Hyvönen, Kim Viljanen, Osma Suominen, Eija Hukka: HealthFinland - Publishing Health Promotion Information on the Semantic Web. Proceedings of DrMED 2008: International Workshop on Describing Medical Web Resources. The 21st International Congress on the European Federation for Medical Informatics (MIE 2008), Göteborg, Sweden, May, 2008. bib pdf
Reetta Sinkkilä: Käsitteen kontekstiperustainen valinta semanttisessa webissä. MSc Thesis, University of Helsinki, Department of Computer Science, May, 2008. bib pdf
Semanttisen webin ideana on kuvailla tietoa siten, että koneet pystyvät ymmärtämään sitä, ja käyttämään älykkäitä tekniikoita tiedon hyödyntämiseksi. Tiedon kuvailemise käytetään ontologioita jotka muodostavat laajoja käsiteverkkoja. Kuvailua varten ontologiasta on läydettävä tarkimmin kohdetta kuvaavat käsitteet. Tässä työssä tutkittiin minkälaisia menetelmiä käsitteenvalintaan on kehitetty ja millä tavoin käsitteenvalintaa voidaan tukea visuaalisin keinoin. Lisäksi perehdyttiin joihinkin semanttisen webin sovelluksiin niiden käsitteenvalinnan osalta. Käsitevalitsimissa havaittiin puutteita liittyen käsitteiden merkityksen disambiguointiin ja siihen, kuinka helppoa niiden avulla on valita paras käsite kuvaamaan tietoa. Ongelmia oli myös tavoissa esittää useiden ontologioiden yhdistelmiä. Näiden havaintojen pohjalta suunniteltiin ja toteutettiin yleinen ja monentyyppisen aineiston käsittelyyn soveltuva käsitevalitsin IRMA
Robin Lindroos: Paikkatiedon ontologiapalvelu. MSc Thesis, Helsinki University of Technology (TKK), May, 2008. bib pdf
Tämä diplomityö käsittelee menetelmiä, joilla paikkatietoaineistoja muunnetaan ontologiseen muotoon sekä esittelee palvelun, ONKI-Paikan, jolla ontologisessa muodossa olevaa paikkatietoa voidaan tuottaa, ylläpitää ja hakea. Palvelu perustuu paikkatiedon mallintamiseen Suomalaisen paikkaontologian SUO:n mukaisesti. Työ koostuu neljästä vaiheesta. Ensimmäisessä vaiheessa selvitetään menetelmä, jolla SUO-ontologia populoidaan paikkainstansseilla. Erityistä huomiota kiinnitetään paikkojen uniikkien tunnisteiden, URI:en luomiseen. Toisessa vaiheessa selvitetään, miten ontologian populointivaiheessa tuotetut paikkojen RDF-kuvaukset on tallennettava. Kolmannessa vaiheessa ratkotaan ontologisessa muodossa olevan paikkatietoaineiston suuren määrän tuomia ongelmia muun muassa kehittämällä paikkatiedon RDF-varastolle indeksointitietokanta nopeita hakuja varten. Neljännessä vaiheessa kehitetään rajapinta hakujen suorittamista varten sekä hakurajapintaa hyödyntävä graafinen, selaimessa toimiva käyttöliittymä. Työ on tehty osana FinnONTO-projektia, jossa kehitettiin suomalaisiin olosuhteisiin räätälöityjä semanttisen webin ontologioita sekä näitä hyödyntäviä palveluita.
Riikka Henriksson, Tomi Kauppinen and Eero Hyvönen: Core Geographical Concepts: Case Finnish Geo-Ontology. Location and the Web (LocWeb) 2008 workshop, 17th International World Wide Web Conference WWW 2008, ACM International Conference Proceeding Series; Vol. 300, Pages 57-60, Beijing, China, April 21-25, 2008. bib pdf
In this paper we examine 1) the scope of geo-ontologies used especially for the purposes of information retrieval on the Web, 2) the core geographical concepts and their mutual relations, and 3) the properties the concepts have. Furthermore, we present the Finnish geo-ontology (Suomalainen paikkaontologia, SUO) and discuss the theories and principles that have governed the development process, as well as the limitations and requirements the use of geographical dictionaries as an instance data source have imposed to the content and the structure of SUO.
Kim Viljanen, Jouni Tuominen, Teppo Känsälä and Eero Hyvönen: Distributed Semantic Content Creation and Publication for Cultural Heritage Legacy Systems. Proceedings of the 2008 IEEE International Conference on Distributed Human-Machine Systems, IEEE Press, Athens, Greece, March 9-12, 2008. bib pdf
Cultural heritage is by nature strongly interlinked, e.g. thematically and historically, but at the same time distributed in heterogeneous collections of different memory organizations at different locations. In order to provide the end-users with aggregated homogeneous views to distributed heterogeneous contents, semantic portals have been created successfully based on metadata and shared (or aligned) ontologies. This paper discusses two problems encountered in such a distributed semantic content creation environment. First, during the content creation work, how could a publisher start using shared ontologies in legacy cataloguing and annotation systems that do not support ontologies. Second, during content publication, how could a publisher re-use the aggregated content in its own legacy publication system, e.g., on the ordinary web pages of a museum or in a collection browser. As a solution, we present the ONKI Ontology Server for adding shared ontological annotation functionalities to legacy cataloguing systems in a practical, cost-efficient and lightweight way. For distributed publishing of the aggregated semantic portal services, we introduce the lightweight mash-up web widget components called floatlets . A major idea behind both the ONKI functionalities and floatlets is that they can be easily integrated with legacy systems on the user interface level, in the same spirit as e.g. Google Maps.
Matias Frosterus: Tekstiaineiston ontologiaperustainen indeksointi ja haku. MSc Thesis, Helsinki University of Technology, Department of Automation and Systems Technology, March, 2008. bib pdf
Informaation lisääntyessä yhteiskunnassa vaaditaan sen tehokasta käsittelyä yhä enemmän ammattilaisten lisäksi myös tavallisilta käyttäjiltä. Tällöin luonnollinen pyrkimys on yksinkertaistaa ja automatisoida tiedonhakuprosessia mahdollisimman paljon, johon semanttisen webin tekniikat tarjoavat uusia mahdollisuuksia. Tässä diplomityössä tutkittiin mahdollisuuksia dokumentin laajentamisen ja ontologisten käsitteiden hyödyntämisen kautta parantaa tiedonhakuprosessia tekstipohjaiseen aineistoon, kuten sanomalehtiarkistoon. Tätä tarkoitusta varten luotiin automaattinen annotointi ja hakusovellus Airo, joka suorittaa jonkin annetun ontologian pohjalta dokumentin laajennuksen. Tämä tapahtuu ontologisella käsiteklusteroinnilla, jossa jonkin käsitteen esiintyminen tekstissä nostaa myös ontologian hierarkiassa läheisten käsitteiden painoa kyseistä dokumenttia indeksoitaessa ja haettaessa. Järjestelmän testit osoittivat, että käsitehaku yhdistettynä sanahakuun laskee haun tarkkuutta, mutta nostaa saantia. Sen sijaan hybridimenetelmä dokumentin- ja kyselyn laajennuksesta, jossa perinteisen sanahaun tuottamien dokumenttien käsitteillä suoritetaan laajentava haku, nosti saantia tarkkuuden kärsimättä. Luotu järjestelmä on ontologiariippumaton ja jokaisen ontologian tuottamat käsitteistykset talletetaan omaan indeksiinsä, jolloin niitä voidaan hakea erikseen.
Osma Suominen: Käyttäjäkeskeinen moninäkymähaku semanttisessa portaalissa. MSc Thesis, University of Helsinki, Department of Computer Science, February, 2008. bib pdf
Tiedonhakuun webissä on kehitetty sanahaun lisäksi rikkaampia tiedonhaku- ja selausmenetelmiä, jotka mahdollistavat tutkivan tiedonhaun. Niistä on hyötyä silloin, kun käyttäjä ei etukäteen tiedä täsmälleen, mitä hän on etsimässä. Yksi tällainen hakukäyttöliittymätyyppi on moninäkymähaku, jossa haun kohteena oleva tietosisältö luokitellaan moniulotteiseksi avaruudeksi fasettiluokituksen periaattein. Käyttöliittymä mahdollistaa aineiston haun ja selaamisen minkä tahansa ulottuvuuden tai niiden yhdistelmän suhteen. Moninäkymähaun kehitys lähti liikkeelle käyttöliittymätutkimuksen piiristä. Moninäkymähakuun perustuvia käyttöliittymiä käytettiin myöhemmin semanttisen webin sovelluksissa ja portaaleissa, joissa kuitenkaan ei samassa määrin huomioitu käyttäjiä suunnitteluprosessin aikana. Tutkielmassa käyttäjäkeskeisiä suunnittelu- ja tiedonjäsennysmenetelmiä sovellettiin terveysaiheista materiaalia sisältävän tervesuomi.fi-portaalin suunnitteluun sekä toteutettiin moninäkymähakua käyttävä portaalin prototyyppi semanttisen webin teknologioiden avulla. Portaalin informaatioarkkitehtuuri suunniteltiin korttienjärjestämismenetelmän avulla ja sen käyttöliittymä rakennettiin käyttäjäkeskeisellä suunnittelu- ja arviointiprosessilla. Tulosten arviointi osoittaa, että käyttäjäkeskeisistä menetelmistä oli merkittävää hyötyä portaalin suunnittelussa. Valmiin prototyypin käytettävyyden arviointi osoitti, että portaalin käytettävyys on käyttäjäkeskeisten suunnittelu- ja arviointimenetelmien ansiosta saatu hyvälle tasolle. Arviointi myös paljasti portaalin aineistoissa, käyttöliittymässä ja tiedon jäsennystavassa ongelmia, joihin terveysportaalin jatkokehityksessä voidaan puuttua.
Antti Vehviläinen, Eero Hyvönen and Olli Alm: A Semi-automatic Semantic Annotation and Authoring Tool for a Library Help Desk Service. Emerging Technologies for Semantic Work Environments: Techniques, Methods, and Applications, IGI Group, Hershey, USA, 2008. bib pdf
2007
Riikka Henriksson and Tomi Kauppinen: Ontologioilla paikkatietojen sisältö hallintaan. Positio 1/2007. Maanmittauslaitos, Paikkatietojen yhteiskäytön edistäminen, 2007. bib pdf
Thomas Häggström: Toimintakeskeisen semanttisen moninäkymähaun toteutus ja evaluointi kulttuurialan portaalisovelluksessa. MSc Thesis, Helsinki University of Technology (TKK), December, 2007. bib pdf
Diplomityön tavoitteena on tutkia semanttisen webin tarjoamia mahdollisuuksia tiedonhaussa. Tähän tavoitteeseen pyrin toteuttamalla tiedonhakujärjestelmän, jossa lukuisista eri museoista peräisin olevat heterogeeniset aineistot ovat haettavissa yhdellä käyttöliittymällä. Työni kirjallisuusosassa käsittelen tiedonhaun teoriaa, metadatan käyttöä ja semanttisen webin toimintaperiaatteita. Kirjallisuusosan yhteydessä paneudun tutkimusryhmässä aikaisemmin tehtyyn tutkimukseen erityisesti sisällönkuvailun ja moninäkymähaun osalta. Työssäni kehitin heterogeenisen aineiston yhdistämisen mahdollistavaa toimintakeskeistä sisällönkuvailun menetelmää ja tietomallia. Mallin varaan toteutin moninäkymähaun periaatteella toimivan tiedonhakujärjestelmän, jolla toimintakeskeisesti kuvailtua aineistoa voidaan hakea. Saadakseni tarkkaa tietoa tiedonhakujärjestelmän toimivuudesta ja soveltuvuudesta suunniteltuun käyttötarkoitukseen, evaluoin valmista tiedonhakujärjestelmää käyttäjäkeskeisen evaluointimallin mukaisesti. Evaluointia varten suunnittelin kokeen, johon kuuluivat hakutehtävät ja tiedon keruun menetelmät, kuten kyselylomakkeet, transaktioloki sekä videokuvan ja äänen kaappaus. Toimintakeskeinen tietomalli osoittautui tiedonhakujärjestelmän sovelluskehityksen aikana toimivaksi ja heterogeenisiä aineistoja yhdistäväksi tietomalliksi. Työssä kehittämäni toimintakeskeinen semanttinen moninäkymähaku sekä aineiston kartta- ja aikajanaprojisointi toimivat täysin toimintakuvausten ja temaattisten roolien varassa. Käyttäjätestit puolestaan todistivat tiedonhakujärjestelmän toimivaksi. Hakujärjestelmä tuki jokaista tehtävätyyppiä ja käyttäjät pitivät arvioissaan järjestelmää uudentyyppisenä, tehokkaana, toimivana ja hyödyllisenä. Vaikka käyttöliittymä tarjosi perinteisen vapaatekstihaun, käytettiin toimintakeskeistä semanttista moninäkymähakua jokaisen tehtävätyypin suorituksen yhteydessä. Käyttäjät olivat melko tyytyväisiä hakujärjestelmän palauttamien dokumenttien relevanssiin. Kartta- ja aikajanaprojektioita pidettiin innovatiivisina ja toimivina lisäominaisuuksina. Käyttäjätestien aikana kritisoitiin moninäkymähaun käytön vaikeutta ja näkymien sisältöä pidettiin vaikeasti ymmärrettävänä. On selvää, että moninäkymähakua on edelleen kehitettävä. Evaluoinnin tulosten perusteella näyttää siltä, että moninäkymähaku ja vapaatekstihaku soveltuvat erityyppisten tehtävien suorittamiseen ja ovat näin ollen toisiaan täydentäviä hakutapoja. Moninäkymähakua hyödynnettiin tehtävissä, joissa lähtötiedot eivät olleet tarkkoja. Tehtävissä, joissa tehtävänannossa annettiin tarkkaa metatietoa, oli tekstihaku hyödyllisempi.
Eetu Mäkelä, Osma Suominen and Eero Hyvönen: Automatic Exhibition Generation Based on Semantic Cultural Content. Proceedings of the Cultural Heritage on the Semantic Web Workshop at the 6th International Semantic Web Conference (ISWC 2007), Busan, Korea, November 12, 2007. bib pdf
In this paper, we argue for a need to shift focus in semantic search from the items themselves to using them as lenses to wider topics. A system for doing this in the cultural heritage domain is presented, duplicating on the web the way exhibitions in the real world are organized. An interface for specifying such exhibitions is presented, combining a general narrative pattern with semantic autocompletion and the novel concept of domain-centric view-based search. This also solves a number of problems view-based search has previously encountered in the cultural heritage domain. Presented also are multiple visualizations for the exhibition, supporting the user in making sense of the data and in doing exploratory search.
Eero Hyvönen, Olli Alm and Heini Kuittinen: Using an Ontology of Historical Events in Semantic Portals for Cultural Heritage. Proceedings of the Cultural Heritage on the Semantic Web Workshop at the 6th International Semantic Web Conference (ISWC 2007), Busan, Korea, November 12, 2007. bib pdf
We argue that an ontology of historical events is needed in semantic portals for cultural heritage due to three reasons. First, ontological identifiers (URIs) of events, such as the World War II or coronation of Napoleon, are needed in order to make collection metadata mutually interoperable in terms of related events---in the vein as identifiers are needed for identifying artifact types, persons, and geolocations when annotating collection items. Second, events are of central importance in creating semantic links between cultural contents in applications such as recommendation systems. Third, historical events are important as content items of their own, forming the backbone of chronological histories.
Tuukka Ruotsalo and Eero Hyvönen: An Event-based Approach for Semantic Metadata Interoperability. Proceedings of the 6th International Semantic Web Conference (ISWC 2007), Busan, Korea, Springer-Verlag, November 11-15, 2007. bib pdf
This paper presents a method for making metadata conforming to heterogeneous schemas semantically interoperable. The idea is to make the knowledge embedded in the schema structures interoperable and explicit by transforming the schemas into a shared, event-based representation of knowledge about the real world. This enables and simplifies accurate reasoning services such as cross-domain semantic search, browsing, and recommending. A case study of transforming three different schemas and datasets is presented. An implemented knowledge-based recommender system utilizing the results in the semantic portal \CS\ was found useful in a preliminary user study.
Eetu Mäkelä, Tuukka Ruotsalo and Eero Hyvönen: Automatic Exhibition Generation Based on Semantic Cultural Content. Poster proceedings of the 6th International Semantic Web Conference, Busan, Korea, November 11-15, 2007. bib pdf
This paper shortly presents an automatic exhibition generation interface that turns the focus of semantic search from search items to the concepts they are annotated with.
Eetu Mäkelä, Reetta Sinkkilä and Eero Hyvönen: Combining Cross-ontology Navigation with Semantic Autocompletion. Poster proceedings of the 6th International Semantic Web Conference, Busan, Korea, November 11-15, 2007. bib pdf
Semantic autocompletion interfaces offer an efficient way for concept selection useful in both search and annotation applications. However, these interfaces usually do not expose the semantic context of the matched concepts, thereby making it hard to know if a matched concept is the right one, as well as hiding possibly more appropriate choices. To lessen these problems, we present an in-place ontological context navigation interface to be used with semantic autocompletion.
Eetu Mäkelä, Tuukka Ruotsalo and Eero Hyvönen: Domain-Centric View-Based Search. Poster proceedings of the 6th International Semantic Web Conference, Busan, Korea, November 11-15, 2007. bib pdf
In current Semantic Web view-based search systems views are formed by selecting properties and enumerating all their values as selections. This approach breaks down with multiple content types, such as in the cultural heritage domain, because the number of differing properties, and therefore views becomes unmanageable. We propose a novel solution termed Domain-Centric View-Based Search, in which views are created based on common property ranges and domain ontologies.
Eetu Mäkelä, Kim Viljanen, Olli Alm, Jouni Tuominen, Onni Valkeapää, Tomi Kauppinen, Jussi Kurki, Reetta Sinkkilä, Teppo Känsälä, Robin Lindroos, Osma Suominen, Tuukka Ruotsalo and Eero Hyvönen: Enabling the Semantic Web with Ready-to-Use Web Widgets. Proceedings of the First Industrial Results of Semantic Technologies Workshop, ISWC2007, pp. 56-69, CEUR Workshop Proceedings, Vol. 293, November 11, 2007. bib pdf link
A lot of functionality is needed when an application, such as a museum cataloguing system, is extended with semantic capabilities, for example ontological indexing functionality or multi-facet search. To avoid duplicate work and to enable easy and cost-efficient integration of information systems with the Semantic Web, we propose a web widget approach. Here, data sources are combined with functionality into readyto-use software components that allow adding semantic functionality to systems with just a few lines of code. As a proof of the concept, we present a collection of general semantic web widgets and case applications that use them, such as the ontology server ONKI, the annotation editor SAHA and the culture portal CultureSampo.
Kim Viljanen, Jouni Tuominen, Eero Hyvönen, Eetu Mäkelä and Osma Suominen: Extending Content Management Systems with Ontological Annotation Capabilities. Poster proceedings of the 6th International Semantic Web Conference, Busan, Korea, November 11-15, 2007. bib pdf
Producing semantic metadata requires efficient methods, e.g., concept finding, for accessing and using ontologies. To add such functionalities to metadata applications such as cataloging systems in museums, we propose a \emphmash-up approach where ready-to-use user interface components for using specific ontologies are made available to be integrated into applications. As a proof-of-concept, we present the \emphOntology Service ONKI wich implements semantic autocompletion concept search and concept browsing for ontologies as shared mash-up components.
Eero Hyvönen, Robin Lindroos, Tomi Kauppinen and Riikka Henriksson: An ontology service for geographical content. Poster Proceedings of the International Semantic Web Conference (ISWC 2007), Busan, Korea, Nov, 2007. bib pdf
Geographic place names are widely used but are semantically often highly ambiguous. For example, there are 491 places in Finland sharing the same name Isosaari (great island) that are instances of several geographical classes, such as Island, Forest, Peninsula, Inhabited area, etc. Referencing unambiguously to a particular Isosaari , either when annotating content or during information retrieval, can be quite problematic and requires usage of advanced search methods and maps for semantic disambiguation. This paper presents an ontology server, ONKI-Paikka, for solving the place finding and place name disambiguation problem. In ONKI-Paikka, places can be found by a faceted search engine, combined with semantic autocompletion and a map service for constraining search and for visualizing results. The service can be connected to legacy applications cost-effectively by using Ajax-technology in the same spirit as Google Maps that is used in ONKI-Paikka as a subservice.
Eero Hyvönen, Tuukka Ruotsalo, Thomas Häggström, Mirva Salminen, Miikka Junnila, Mikko Virkkilä, Mikko Haaramo, Eetu Mäkelä, Tomi Kauppinen and and Kim Viljanen: CultureSampo-Finnish Culture on the Semantic Web: The Vision and First Results (based on the STeP 2006 paper below). In: K. Robering (ed.): Information Technology for the Virtual Museum. LIT Verlag, Berlin., Nov, 2007. bib pdf
This paper concerns the idea of publishing heterogenous cultural content on the Semantic Web. By heterogenous content we mean metadata describing potentially any kind of cultural objects, including artifacts, photos, paintings, videos, folklore, cultural sites, cultural process descriptions, biographies, history etc. The metadata schemas used are different and the metadata may be represented at different levels of semantic granularity. This work is an extension to previous research on semantic cultural portals, such as MuseumFinland, that are usually based on a shared homogeneous schema, such as Dublin Core, and focus on content of similar kinds, such as artifacts. Our experiences suggest that a semantically richer event-based knowledge representation scheme than traditional metadata schemas is needed in order to support reasoning when performing semantic search and browsing. The new key idea is to transform different forms of metadata into event-based knowledge about the entities and events that take place in the world or in fiction. This approach facilitates semantic interoperability and reasoning about the world and stories at the same time, which enables implementation of intelligent services for the end-user. These ideas are addressed by presenting the vision and solution approaches taken in two prototype implementations of a new kind of cross-domain semantic cultural portal “CULTURESAMPO—Finnish Culture on the Semantic Web”.
Eero Hyvönen, Kim Viljanen and Osma Suominen: HealthFinland - Finnish Health Information on the Semantic Web. Proceedings of the 6th International Semantic Web Conference (ISWC 2007), Busan , Korea, Springer-Verlag, Nov, 2007. bib pdf
This paper shows how semantic web techniques can be applied to solving problems of distributed content creation, discovery, linking, aggregation, and reuse in health information portals, both from end-users and content publishers viewpoints. As a case study, the national semantic health portal \HF\ is presented. It provides citizens with intelligent searching and browsing services to reliable and up-to-date health information created by various health organizations in Finland. The system is based on a shared semantic metadata schema, ontologies, and ontology services. The content includes metadata about thousands of web documents such as web pages, articles, reports, campaign information, news, services, and other information related to health.
Kim Viljanen, Jouni Tuominen and Eero Hyvönen: ONKI Ontology Server--Extending Legacy Systems with Ontology Mash-up Services. November, 2007. Draft paper. bib pdf
The Semantic Web is based on using shared ontologies for enabling semantically disambiguated data exchange between distributed systems on the web. This requires, from the ontology publisher s viewpoint, efficient means for publishing ontologies on the web to ensure the availability and acceptance of the ontologies. From the ontology user s viewpoint, support services are needed for utilizing ontologies easily and cost-effectively in the users own systems that are typically legacy systems without ontology support. This paper presents the ONKI ontology server for addressing these vital needs. For the publisher, ONKI provides a server and a Simple Knowledge Organization (SKOS) compatible light-weight ontology browser with ready-made web interfaces for making ontologies available both for human and machine users. For external legacy and other applications, ONKI provides centralized ontology services for semantic disambiguation, concept finding, and concept fetching. A major contribution of ONKI is to provide these services as ready-to-use functionalities for creating mash-up applications very cost-efficiently. Two prototypes of the system---ONKI-SKOS for all kinds of ontologies and ONKI-Geo for geographical ontologies with a map mash-up interface---are operational on the web and are currently being successfully used in several pilot applications.
Jussi Kurki and Eero Hyvönen: Relational Semantic Search: Searching Social Paths on the Semantic Web. Poster Proceedings of the International Semantic Web Conference (ISWC 2007), Busan, Korea, Nov, 2007. bib pdf
This paper presents a system for searching semantic relations between web resources, in our case significant persons of art history. The system is based on the Union List of Artists Names (ULAN) metadata of some 120,000 persons and organizations.
Eero Hyvönen, Joeli Takala, Olli Alm, Tuukka Ruotsalo and Eetu Mäkelä: Semantic Kalevala - Accessing Cultural Contents Through Semantically Annotated Stories. Proceedings of the Cultural Heritage on the Semantic Web Workshop at the 6th International Semantic Web Conference (ISWC 2007), Busan, Korea, Nov, 2007. bib pdf
An event-based approach is presented for annotating events and narrative structures underlying texts and stories semantically. The idea is applied to using the Finnish national epic Kalevala for accessing related cultural contents, such as artifacts, paintings etc. in a semantic portal.
Eero Hyvönen: Älykäs semanttinen web tietämyksenhallinnan rajoja siirtämässä - Esimerkkinä suomalainen kulttuuri semanttisessa webissä (Intelligent Semantic Web - Case Finnish Culture on the Semantic). Rajalla - tiede rajojaan etsimässä (K. Raivio, J. Rydman and A. Sinnemäki (eds.)), Gaudeamus, Helsinki, Nov, 2007. bib pdf
Miikka Junnila, Eero Hyvönen and Mirva Salminen: Describing and Linking Cultural Semantic Content by Using Situations and Actions (based on the STeP 2006 paper below). In: K. Robering (ed.): Information Technology for the Virtual Museum. LIT Verlag, Berlin., Oct, 2007. bib pdf
Eero Hyvönen, Kim Viljanen, Osma Suominen and Eija Hukka: HealthFinland - Publishing Health Promotion Information on the Semantic Web. International Journal of Health Care Engineering, vol. 15, no. 5, pp. 325, oct, 2007. Abstract of a longer paper. bib
Tuukka Ruotsalo and Eero Hyvönen: A Method for Determining Ontology-Based Semantic Relevance. Proceedings of the International Conference on Database and Expert Systems Applications DEXA 2007, Regensburg, Germany, Springer, September 3-7, 2007. bib pdf
Eero Hyvönen, Kim Viljanen, Eetu Mäkelä, Tomi Kauppinen, Tuukka Ruotsalo, Onni Valkeapää, Katri Seppälä, Osma Suominen, Olli Alm, Robin Lindroos, Teppo Känsälä, Riikka Henriksson, Matias Frosterus, Jouni Tuominen, Reetta Sinkkilä and Jussi Kurki: Elements of a National Semantic Web Infrastructure - Case Study Finland on the Semantic Web (Invited paper). Proceedings of the First International Semantic Computing Conference (IEEE ICSC 2007), Irvine, California, September, 2007. IEEE Press. bib pdf
This article presents the vision and results of creating the basis for a national semantic web content infrastructure in Finland in 2003-2007. The main elements of the infrastructure are shared and open metadata schemas, core ontologies, and public ontology services. Several practical applications testing and demonstrating the usefulness of the infrastructure are overviewed in the fields of eCulture, eHealth, eGovernment, eLearning, and eCommerce.
Eero Hyvönen: Semantic Portals for Cultural Heritage. Manuscript draft for a book chapter, Sept, 2007. bib pdf
Olli Alm: Tekstidokumenttien automaattinen ontologiaperustainen annotointi. MSc Thesis, University of Helsinki, Department of Computer Science, September, 2007. bib pdf
Semanttisen Webin perustavana ajatuksena on tuoda Internetiin – tai suppeammassa mielessä hyperlinkitettyyn aineistoon – järjestystä määrittelemällä eksplisiittisiä, koneluettavia käsitteistöjä ja kuvaamalla Internetin sisältämää aineistoa tällä käsitteistöllä. Nämä kaksi työvaihetta kuuluvat keskeisesti Semanttisen Webin ydinalueisiin. Tässä tutkielmassa määritellään Semanttisen Webin liittyvän aineiston kuvailun eli ontologiaperustaisen annotoinnin piirteitä ja toisaalta myös rajoja. Ontologiaperustainen annotointi on aineiston kuvailua, jonka määrittävänä piirteenä on tietomalli. Annotoinnin automatisointi on keskeinen haaste ontologiaperustaisten järjestelmien tuottamisessa, sillä manuaalisesti tehtävä annotointi on yleensä hidasta ja aikaa vievää. Automaattista annotointia edustavien järjestelmien joukko on kirjava, eikä täsmällistä määrittelyä automaattisen annotoinnin ongelmakentästä esiinny kirjallisuudessa. Työssä määritellään automaattisille annotointijärjestelmille malli, jonka avulla voidaan vertailla järjestelmiä toisiinsa ja mallintaa uusia. Mallia sovelletaan työssä ontologiaperustaisten järjestelmien vertailuun ja automaattisen annotointijärjestelmän Pokan, toteuttamisessa.
Onni Valkeapää, Olli Alm and Eero Hyvönen: Efficient Content Creation on the Semantic Web Using Metadata Schemas with Domain Ontology Services (System Description). Proceedings of the European Semantic Web Conference ESWC 2007, Innsbruck, Austria, Springer, June 4-5, 2007. bib pdf
Kim Viljanen, Eero Hyvönen, Eetu Mäkelä, Osma Suominen and Jouni Tuominen: Mash-up Ontology Services for the Semantic Web. Demo track at the European Semantic Web Conference ESWC 2007, Innsbruck, Austria, June 4-5, 2007. bib pdf
We present ONKI ontology server, a mash-up approach for integrating ontology library services with semantic web applications. The idea of ONKI is to provide applications with ready-to-use ontology service functionalities, such as semantic autocompletion, browsing, and annotation support, at the user interface level using AJAX mash-up technologies. The system is being integrated with various semantic web applications.
Olli Alm, Eero Hyvönen and Antti Vehviläinen: Opas: An ontology-based library help desk service. Demo track at the European Semantic Web Conference ESWC 2007, Innsbruck, Austria, June 4-5, 2007. bib pdf
Osma Suominen, Kim Viljanen and Eero Hyvönen: Semantic Faceted Search in a Citizens Health Portal. Demo track at the European Semantic Web Conference ESWC 2007, Innsbruck, Austria, June 4-5, 2007. bib pdf
Tomi Kauppinen, Christine Deichstetter and Eero Hyvönen: Temp-O-Map: Ontology-based Search and Visualization of Spatio-Temporal Maps. Demo track at the European Semantic Web Conference ESWC 2007, Innsbruck, Austria, Springer, June 4-5, 2007. bib pdf
Osma Suominen, Kim Viljanen and Eero Hyvönen: User-centric Faceted Search for Semantic Portals. Proceedings of the European Semantic Web Conference ESWC 2007, Innsbruck, Austria, Springer, June 4-5, 2007. bib pdf
Jari Väätäinen: Ajallisesti muuttuvan paikkatiedon hallinta. Mediatekniikka, EVTEK, May, 2007. bib pdf
Paikannimiä käytetään monissa arkistoissa, museoissa ja tietokannoissa ainoana paikka-tietona. Paikannimistössä ja rajoissa tapahtuu kuitenkin ajan kuluessa muutoksia. Valtioiden rajat muuttuvat, läänien ja maakuntien alueita määritetään uudestaan ja kunnat jakaantuvat tai yhdistyvät toisiinsa tai vaihtavat nimeään. Tietojen hakeminen tämäntyyppisistä kohteista on tuottanut puutteellisia tai virheellisiä tuloksia, ellei hakijalla ole ollut tietoa tapahtuneista aluemuutoksista. Insinöörityön tavoitteena on löytää keinoja hallita ajallisesti muuttuvaa paikkatietoa niin, että tietoihin tehtävien hakujen tarkkuus paranee.Insinöörityöraportissa kuvataan erityyppiset muutokset, joita Suomen kunnissa on niiden olemassaolon aikana tapahtunut. Erilaiset muutokset luokitellaan seitsemään tyyppiin, jotka ovat perustaminen, yhdistyminen, jakaantuminen, nimenmuutos, aluesaanti toisesta maasta ja alueluovutus toiselle maalle ja aluesiirto kahden kunnan välillä. Samaa jakoa voidaan käyttää myös muissa hallinnollisissa alueissa tapahtuneisiin muutoksiin. Kuntien välisten muutosten tietoja käytettiin Geologian tutkimuskeskuksen (GTK) valokuvatietokannassa. Tietokanta toteutettiin Imatch-ohjelman avulla. Kuvat voidaan tietokannassa merkitä siihen kuntaan, jossa ne on kuvattu, mutta kuvia voidaan hakea myös ajallisesti myöhemmän kuntajaon perusteella. Eri aikoina olleita kuntia kuvaavien kuntakategorioiden väliset yhteydet toteutettiin yhdistävien ja poissulkevien loogisten operaattoreiden avulla.Kuntamuutostietoja käytettiin myös valmistettaessa Suomen ajallista paikkaontologiaa osana FinnONTO-projektia. Työssä kuvataan ontologioiden valmistuksen perusteet ja SAPO-ontologian valmistuksessa käytetyt menetelmät, mm. ontologiaversioiden väliset muutossillat ja sijainnin todennäköisyyden laskevan päättelykoneen periaate. Suomen ajallinen ontologia tulee vapaaseen käyttöön ONKI-ontologiakirjaston kautta, ja ensimmäisenä sitä käytetään FinnONTO-projektin semanttisen webin tekniikoita esittelevässä KulttuuriSampo-portaalissa.Alustavien kokeiden perusteella paikkatiedon ajallisten muutosten huomioonottaminen parantaa selvästi hakujen osuvuutta sekä GTK:n valokuvatietokannassa että paljon historiallista aineistoa sisältävässä KulttuuriSampo-portaalissa. Näin ollen näiden uusien ajallisen paikkatiedon hallintaan soveltuvien menetelmien käyttöönottoa voidaan suositella kaikkeen paikannimistön perusteella tapahtuvaan tiedon annotointiin eli nimeämiseen ja hakuun.
Eero Hyvönen, Katri Seppälä, Kim Viljanen and Matias Frosterus: Yleinen suomalainen ontologia YSO -- kohti suomalaista semanttista webiä (General Finnish Ontology YSO--Towards the Finnish Semantic Web). Tietolinja, May, 2007. bib pdf
Robin Lindroos, Tomi Kauppinen, Riikka Henriksson and Eero Hyvönen: ONKI-Paikka: An Ontology Service for Geographical Data. Helsinki, Apr, 2007. bib pdf
Petri Lindgren: Yhteiskäyttöisten käsitteiden kuvauslogiikkaperusteinen määrittely webissä. MSc Thesis, University of Helsinki, Department of Computer Science, April, 2007. bib pdf
Osma Suominen, Kim Viljanen, Eero Hyvönen, Markus Holi and Petri Lindgren: TerveSuomi.fi:n metatietomäärittely (TerveSuomi.fi metadata specification). Jan 26, 2007. bib pdf
Tomi Kauppinen and Eero Hyvönen: Modeling and Reasoning about Changes in Ontology Time Series. Ontologies: A Handbook of Principles, Concepts and Applications in Information Systems (Rajiv Kishore, Ram Ramesh and Raj Sharman (eds.)), Integrated Series in Information Systems, pp. 319-338, Springer-Verlag, New York (NY), January 15, 2007. bib pdf
Ville Komulainen: Public Services for Ontology Library Systems. MSc Thesis, University of Helsinki, Department of Computer Science, January, 2007. bib pdf
Onni Valkeapää, Olli Alm and Eero Hyvönen: A Framework for Ontology-based Adaptable Content Creation on the Semantic Web. Journal of Universal Computer Science, 2007. bib pdf
Creation of rich, ontology-based metadata is one of the major challenges in developing the Semantic Web. Emerging applications utilizing semantic web techniques, such as semantic portals, cannot be realized if there are no proper tools to provide metadata for them. This paper discusses how to make provision of metadata easier and cost-effective by an annotation framework comprising of annotation editor combined with shared ontology services. We have developed an annotation system supporting distributed collaboration in creating annotations, and hiding the complexity of the annotation schema and the domain ontologies from the annotators. Our system adapts flexibly to different metadata schemas, which makes it suitable for different applications. Support for using ontologies is based on ontology services, such as concept searching and browsing, concept URI fetching, semantic autocompletion and linguistic concept extraction. The system is being tested in various practical semantic portal projects.
2006
Markus Holi and Eero Hyvönen: Modeling Uncertainty in Semantic Web Taxonomies. Soft Computing in Ontologies and Semantic Web (Zhongmin Ma (ed.)), Springer-Verlag, 2006. bib pdf
Eero Hyvönen: Semantic Web Applications in the Public Sector in Finland - Building the Basis for a National Semantic Web Infrastructure. 2006. Paper presented at the Norwegian Semantic Days, April 26-27, Stavanger, Norway. bib pdf
Eetu Mäkelä: Harnessing Folksonomies for Search. Proceedings of the Seminar on Web 2.0, Laboratory of Media Technology, Helsinki University of Technology (TKK), December, 2006. bib pdf
This paper analyses folksonomies, an emergent web 2.0 technology. Folksonomies are found to be primarily a social dynamic phenomenon, and several key tensions are hypothesised that keep the folksonomy community vibrant. Strengths and weaknesses of folksonomies are analyzed w.r.t applicability to browsing and search, and suggestions are given on how to alleviate search problems by bringing in additional semantics into folksonomies, while trying to avoid upsetting the delicate social balances discovered.
Mikko Haaramo: Iconclass-luokittelujärjestelmän ontologisointi ja soveltaminen. MSc Thesis (in Finnish), Helsinki University of Technology (TKK), November, 2006. bib pdf
Eetu Mäkelä, Eero Hyvönen and Samppa Saarela: Ontogator -- A Semantic View-based Search Engine Service for Web Applications. Proceedings of the 5th International Semantic Web Conference (ISWC 2006), Nov, 2006. bib pdf
View-based search provides a promising paradigm for formulating complex semantic queries and representing results on the Semantic Web. A challenge for the application of the paradigm is the complexity of providing view-based search services through application programming interfaces (API) and web services. This paper presents a solution on how semantic view-based search can be provided efficiently through an API or as web service to external applications. The approach has been implemented as the open source tool Ontogator, that has been applied successfully in several practical semantic portals on the web.
Antti Vehviläinen, Eero Hyvönen and Olli Alm: A Semi-Automatic Semantic Annotation and Authoring Tool for a Library Help Desk Service. Proceedings of the first Semantic Authoring and Annotation Workshop, November, 2006. bib pdf
Eero Hyvönen, Tuukka Ruotsalo, Thomas Häggström, Mirva Salminen, Miikka Junnila, Mikko Virkkilä, Mikko Haaramo, Eetu Mäkelä, Tomi Kauppinen and and Kim Viljanen: CultureSampo-Finnish Culture on the Semantic Web: The Vision and First Results. Developments in Artificial Intelligence and the Semantic Web - Proceedings of the 12th Finnish AI Conference STeP 2006, October 26-27, 2006. bib pdf
This paper concerns the idea of publishing heterogenous cultural content on the Semantic Web. By heterogenous content we mean metadata describing potentially any kind of cultural objects, including artifacts, photos, paintings, videos, folklore, cultural sites, cultural process descriptions, biographies, history etc. The metadata schemas used are different and the metadata may be represented at different levels of semantic granularity. This work is an extension to previous research on semantic cultural portals, such as MuseumFinland, that are usually based on a shared homogeneous schema, such as Dublin Core, and focus on content of similar kinds, such as artifacts. Our experiences suggest that a semantically richer event-based knowledge representation scheme than traditional metadata schemas is needed in order to support reasoning when performing semantic search and browsing. The new key idea is to transform different forms of metadata into event-based knowledge about the entities and events that take place in the world or in fiction. This approach facilitates semantic interoperability and reasoning about the world and stories at the same time, which enables implementation of intelligent services for the end-user. These ideas are addressed by presenting the vision and solution approaches taken in two prototype implementations of a new kind of cross-domain semantic cultural portal “CULTURESAMPO—Finnish Culture on the Semantic Web”
Miikka Junnila, Eero Hyvönen and Mirva Salminen: Describing and Linking Cultural Semantic Content by Using Situations and Actions. Developments in Artificial Intelligence and the Semantic Web - Proceedings of the 12th Finnish AI Conference STeP 2006, October 26-27, 2006. bib pdf
Eero Hyvönen, Tomi Kauppinen, Jukka Kortela, Mikko Laukkanen, Tapani Raiko and Kim Viljanen (eds.): Developments in Artificial Intelligence and the Semantic Web - Proceedings of the 12th Finnish AI Conference STeP 2006. Finnish AI Society, Finland, October 26-27, 2006. bib pdf
Eero Hyvönen: FinnONTO-Building the Basis for a National Semantic Web Infrastructure in Finland. Developments in Artificial Intelligence and the Semantic Web - Proceedings of the 12th Finnish AI Conference STeP 2006, October 26-27, 2006. bib pdf
Tomi Kauppinen, Riikka Henriksson, Jari Väätäinen, Christine Deichstetter and Eero Hyvönen: Ontology-based Modeling and Visualization of Cultural Spatio-temporal Knowledge. Developments in Artificial Intelligence and the Semantic Web - Proceedings of the 12th Finnish AI Conference STeP 2006, October 26-27, 2006. bib pdf
Antti Vehviläinen: Ontologiapohjainen kysymys-vastauspalvelu (Ontology-based question-answer service). MSc Thesis, Helsinki University of Technology (TKK), October, 2006. bib pdf
Onni Valkeapää and Eero Hyvönen: A Browser-based Tool for Collaborative Distributed Annotation for the Semantic Web. September 26, 2006. 5th International Semantic Web Conference, Semantic Authoring and Annotation Workshop, November, 2006. bib pdf
Kim Viljanen, Teppo Känsälä, Eero Hyvönen and Eetu Mäkelä: ONTODELLA - A Projection and Linking Service for Semantic Web Applications. Proceedings of the 17th International Conference on Database and Expert Systems Applications (DEXA 2006), Krakow, Poland, pp. 370-376, IEEE, September 4-8, 2006. bib pdf ps
Content in semantic web portals is often projected along application specific navigational taxonomies and linked semantically. This paper presents a logic-based method and a server ONTODELLA for these tasks. We argue that logic rules between the content layer and the application layer add flexibility and better architectural separation of content and functionality. The system has been implemented and applied succesfully in several semantic portals.
Markus Holi and Eero Hyvönen: Fuzzy View-Based Semantic Search. Proceedings of the 1st Asian Semantic Web Conference (ASWC2006), Beijing, China, Springer-Verlag, September 3-7, 2006. bib pdf
Onni Valkeapää: Verkkoresurssien ontologiaperustainen annotointi. MSc Thesis, Helsinki University of Technology, September, 2006. bib pdf
Eero Hyvönen and Eetu Mäkelä: Semantic Autocompletion. Proceedings of the first Asia Semantic Web Conference (ASWC 2006), Beijing, Springer-Verlag, New York, August 4-9, 2006. bib pdf
This paper generalizes the idea of traditional syntactic text autocompletion onto the semantic level. The idea is to autocomplete typed text into ontological categories instead of words in a vocabulary. The idea has been implemented and its application for semantic indexing and content-based information retrieval in multi-facet search is proposed. Four operational semantic portals on the web using the implementation are presented as application cases.
Markus Holi, Eero Hyvönen and Petri Lindgren: Integrating tf-idf Weighting with Fuzzy View-Based Search. Proceedings of the ECAI Workshop on Text-Based Information Retrieval (TIR-06), Riva del Garda, Italy, Aug, 2006. bib pdf
Teppo Känsälä and Eero Hyvönen: A Semantic View-based Portal Utilizing Learning Object Metadata. August, 2006. 1st Asian Semantic Web Conference (ASWC2006), Semantic Web Applications and Tools Workshop. bib pdf ps
Onni Valkeapää and Eero Hyvönen: Semantic Annotation with Browser-based Annotation Tool SAHA. July 17, 2006. Demo paper, 1st Asian Semantic Web Conference (ASWC2006). bib pdf
Markus Holi, Petri Lindgren, Osma Suominen, Kim Viljanen and Eero Hyvönen: TerveSuomi.fi - A Semantic Health Portal for Citizens. July 17, 2006. Poster paper, 1st Asian Semantic Web Conference (ASWC2006). bib pdf
Antti Vehviläinen, Olli Alm and Eero Hyvönen: Combining Case-Based Reasoning and Semantic Indexing in a Question-Answer Service. June 20, 2006. Poster paper, 1st Asian Semantic Web Conference (ASWC2006). bib pdf
Onni Valkeapää and Eero Hyvönen: A Browser-based Semantic Annotation Tool for Distributed Content Creation. June 16, 2006. Poster paper, 1st Asian Semantic Web Conference (ASWC2006). bib pdf
Eetu Mäkelä: View-Based Search Interfaces for the Semantic Web. MSc Thesis, University of Helsinki, June, 2006. bib pdf
This thesis explores the possibilities of using the view-based search paradigm to create intelligent search interfaces on the Semantic Web. After surveying several current semantic search techniques, the view-based search paradigm is explained, and argued to fit in a valuable niche in the field. To test the argument, OntoViews, a semantic view-based search portal creation tool was designed and implemented, and eight portals with five vastly different user interfaces were built using it. Based on the results of these experiments, this thesis argues that the paradigm, particularly as implemented in the OntoViews tool provides a strong, extensible and flexible base on which to built semantic search applications. The particular problems faced in applying view-based search for semantic interfaces are noted, along with explanations on how they were solved in the OntoViews architecture. Finally, directions and ideas for future research are presented for both the paradigm and the implementation architecture, respectively.
Mirva Salminen: Kuvien ja videoiden semanttinen sisällönkuvailu. MSc Thesis (in Finnish), University of Helsinki, May, 2006. bib pdf
Kim Viljanen: Monilähteinen suosittelu semanttisessa webissä. MSc Thesis (in Finnish), University of Helsinki, March 6, 2006. bib pdf
Miikka Junnila: Tietosisältöjen semanttinen yhdistäminen toimintakuvausten avulla. MSc Thesis (in Finnish), University of Helsinki, March 6, 2006. bib pdf
2005
Eero Hyvönen, Eetu Mäkelä, Mirva Salminen, Arttu Valo, Kim Viljanen, Samppa Saarela, Miikka Junnila and Suvi Kettula: MuseumFinland - Finnish Museums on the Semantic Web. Journal of Web Semantics, vol. 3, no. 2, pp. 25, 2005. bib pdf
This article presents the semantic portal MUSEUMFINLAND for publishing heterogeneous museum collections on the Semantic Web. It is shown how museums with their semantically rich and interrelated collection content can create a large, consolidated semantic collection portal together on the web. By sharing a set of ontologies, it is possible to make collections semantically interoperable, and provide the museum visitors with intelligent content-based search and browsing services to the global collection base. The architecture underlying MUSEUMFINLAND separates generic search and browsing services from the underlying application dependent schemas and metadata by a layer of logical rules. As a result, the portal creation framework and software developed has been applied successfully to other domains as well. MUSEUMFINLAND got the Semantic Web Challence Award (second prize) in 2004.
Teemu Sidoroff: Semanttiset portaalit. MSc Thesis (in Finnish), University of Helsinki, 2005. bib pdf ps
Eetu Mäkelä: Survey of Semantic Search Research. Proceedings of the Seminar on Knowledge Management on the Semantic Web, Department of Computer Science, University of Helsinki, 2005. bib pdf
This paper surveys the research field of semantic search, i.e. search utilizing semantic techniques or search of formally annotated semantic content. The survey identifies and discusses various prevalent research directions in se- mantic search, as well as extracts common methodology used in them.
Tomi Kauppinen and Eero Hyvönen: Modeling and Reasoning about Changes in Ontology Time Series. Ontologies in the Context of Information Systems (Rajiv Kishore, Ram Ramesh and Raj Sharman (eds.)), Springer-Verlag, Berlin, Dec, 2005. In press. bib pdf
Tomi Kauppinen, Tuukka Ruotsalo and Mirva Salminen: Tiedon Mallintaminen Semanttisessa Webissä. Systeemityö (in Finnish), vol. 4, Systeemityöyhdistys SYTYKE ry, Helsinki, Dec, 2005. bib pdf
Eero Hyvönen: Kohti suomalaista semanttista webiä - Suomalaisen semanttisen webin ontologiat (FinnONTO)-hankkeen esittely. (in Finnish), Presented at the FinnONTO-symposium, Espoo, Finland, November 16, 2005. bib pdf
Eero Hyvönen, Arttu Valo, Katri Seppälä, Tomi Kauppinen, Ville Komulainen, Tuukka Ruotsalo, Mirva Salminen and Anu Ylisalmi: Creating a National Content and Service Infrastructure for the Semantic Web. Poster paper, 4th International Semantic Web Conference, Nov, 2005. bib pdf
Eero Hyvönen, Arttu Valo, Ville Komulainen, Katri Seppälä, Tomi Kauppinen, Tuukka Ruotsalo, Mirva Salminen and Anu Ylisalmi: Finnish National Ontologies for the Semantic Web - Towards a Content and Service Infrastructure. Proceedings of International Conference on Dublin Core and Metadata Applications (DC 2005), Nov, 2005. bib pdf
Markus Holi and Eero Hyvönen: Modeling Degrees of Overlap in Semantic Web Ontologies. Proceedings of the ISWC Workshop Uncertainty Reasoning for the Semantic Web (Paulo C. G. da Costa, Kathryn B. Laskey, Kenneth J. Laskey and Michael Pool (eds.)), CEUR Workshop Proceedings, Galway, Ireland, Nov, 2005. bib pdf
Teemu Sidoroff and Eero Hyvönen: Semantic E-goverment Portals - A Case Study. Proceedings of the ISWC-2005 Workshop Semantic Web Case Studies and Best Practices for eBusiness SWCASE05, Nov, 2005. bib pdf
Eetu Mäkelä, Kim Viljanen, Petri Lindgren, Mikko Laukkanen and Eero Hyvönen: Semantic Yellow Page Service Discovery: The Veturi Portal. Poster paper, 4th International Semantic Web Conference, Nov, 2005. bib pdf
A prototype semantic yellow page service portal is described. Our idea is to represent service offerings as events and processes in terms of ontologies. Based on versatile semantic descriptions, users can be provided with a flexible view-based search engine enhanced with semantic text autocompletion.
Ville Komulainen, Arttu Valo and Eero Hyvönen: A Tool for Collaborative Ontology Development for the Semantic Web. Proceedings of International Conference on Dublin Core and Metadata Applications (DC 2005), Nov, 2005. bib pdf
Eetu Mäkelä, Eero Hyvönen and Teemu Sidoroff: View-Based User Interfaces for Information Retrieval on the Semantic Web. Proceedings of the ISWC-2005 Workshop End User Semantic Web Interaction, Nov, 2005. bib pdf
This paper argues for using the multi-facet search paradigm as a basis in information retrieval on the Semantic Web. To support the argument, two user interfaces for extant semantic web portals based on the concept of viewhierarchies are presented. The interfaces described reveal and contrast how the view-based paradigm can be applied to support both browsing and searching strategies in information retrieval in applications using different domain and annotation ontologies. New semantics-based user interface elements complementing the basic paradigm are also discussed.
Eero Hyvönen: Miksi asiasanastot eivät riitä vaan tarvitaan ontologioita?. (in Finnish), Oct, 2005. bib pdf
Tomi Kauppinen and Eero Hyvönen: Modeling Coverage Between Geospatial Resources. Posters and Demos at the 2nd European Semantic Web Conference ESWC2005, pp. 49-50, Heraklion, Crete, May 29 - June 1, 2005. (Best Poster Award ESWC 2005). bib pdf
Ville Komulainen, Arttu Valo and Eero Hyvönen: A Collaborative Ontology Development and Service Framework ONKI. Proceeding of ESWC 2005, poster papers, 2005. bib pdf
2004
Eero Hyvönen: = Semanttinen web - mitä se on käytännössä?. (in Finnish), ATK - Tietotekniikkaa yliopistoille, no. 2, pp. 38-42, 2004. bib pdf
Tomi Kauppinen: An Ontology Versioning Framework. MSc Thesis, Department of Computer Science, University of Helsinki, University of Helsinki, Helsinki, Finland, 2004. bib pdf
Tuomas Korpilahti: Architecture for Distributed Development of an Ontology Library. MSc Thesis, Helsinki University of Technology, 2004. bib pdf
Markus Holi: A Method for Modeling Uncertainty in Semantic Web Ontologies. MSc Thesis, University of Helsinki, 2004. bib pdf
Eero Hyvönen: MuseoSuomi - Suomen museot semanttisessa webissä. Järjestelmä museovieraan ja museon näkökulmasta. (in Finnish), 2004. Helsingin yliopisto ja HIIT. bib pdf
Samppa Saarela: Näkymäpohjainen RDF-haku. MSc Thesis (in Finnish), University of Helsinki, 2004. bib pdf
Mikko Apiola: Ontologiaperustainen RDF-annotaatio. MSc Thesis (in Finnish), University of Helsinki, 2004. bib pdf
Eero Hyvönen: Towards National Finnish Semantic Web Ontologies. Helsinki University Library Bulletin, 2004. bib pdf
A. Varis: WordNet sanatietokannan hyödyntäminen luonnollisen kielen sovelluksissa. MSc Thesis (in Finnish), University of Helsinki, 2004. bib pdf
Eero Hyvönen, Mirva Salminen and Miikka Junnila: Annotation of Heterogeneous Database Content for the Semantic Web. Proceedings of the 4th International Workshop on Knowledge Markup and Semantic Annotation (SemAnnot 2004), Nov, 2004. bib pdf
Mikko Laukkanen, Kim Viljanen, Mikko Apiola, Petri Lindgren, Eetu Mäkelä, Samppa Saarela and Eero Hyvönen: Towards Semantic Web-Based Yellow Page Directory Services. Presented at the Third International Semantic Web Conference (ISWC2004), Hiroshima, Japan, Nov, 2004. Poster paper. bib pdf
This paper describes the ongoing work of IWebS (Intelligent Web Services) project, which studies the possibilities of the Semantic Web technology in creating a yellow page directory service for end-users. We propose an ontology-based mechanism for both advertising and finding the services. The essential parts of the system are ontologies for describing and storing service advertisements, a semantic service finder for the enduser, and a semantic service annotation editor for service providers.
Eero Hyvönen: Miksi on vaikeaa tuottaa web-palveluita?. (in Finnish), Helsingin sanomat, Letters to the Editor section, October 26, 2004. bib pdf
Tomi Kauppinen and Eero Hyvönen: Bridging the Semantic Gap between Ontology Versions. Proceedings of the 11th Finnish AI Conference, Web Intelligence Symposium, Conference Series - No 20, vol. 2, pp. 63-72, Finnish Artificial Intelligence Society, Vantaa, Finland, September 1-3, 2004. bib pdf
Eero Hyvönen, Tomi Kauppinen, Mirva Salminen, Kim Viljanen and Pekka Ala-Siuru (eds.): Web Intelligence-Proceedings of the 11th Finnish AI Conference. September 1-3, 2004. bib
Eero Hyvönen, Markus Holi and Kim Viljanen: Designing and Creating a Web Site Based on RDF Content. Proceedings of WWW2004 Workshop, Application Design, Development, and Implementation Issues, May, 2004. bib pdf
Eero Hyvönen, Arttu Valo, Kim Viljanen and Markus Holi: A Logic-Based Semantic Web HTML Generator - A Poor Man s Publishing Approach. Proceedings of WWW2004, New York, Alternate Track Papers and Posters, May, 2004. bib pdf
Markus Holi and Eero Hyvönen: A Method for Modeling Uncertainty in Semantic Web Taxonomies. Proceedings of WWW2004, New York, Alternate Track Papers and Posters, May, 2004. bib pdf
Eetu Mäkelä, Eero Hyvönen, Samppa Saarela and Kim Viljanen: OntoViews - A Tool for Creating Semantic Web Portals. Proceedings of the 3rd International Semantic Web Conference (ISWC 2004), May, 2004. bib pdf
This paper presents a semantic web portal tool ONTOVIEWS for publishing RDF content on the web. ONTOVIEWS provides the portal designer with a content-based search engine server, Ontogator, and a link recommendation system server, Ontodella. The user interface is created by combining these servers with the Apache Cocoon framework. From the end-user s viewpoint, the key idea of ONTOVIEWS is to combine the multi-facet search paradigm, developed within the information retrieval research community, with semantic web RDFS ontologies, and extend the search service with a semantic browsing facility based on ontological reasoning. ONTOVIEWS is presented from the view points of the end user, architecture, and implementation. The implementation described is modular, easily modified and extended, and provides a good practical basis for creating semantic portals on the web. As a proof of concept, application of ONTOVIEWS to a deployed semantic web portal is discussed.
Eero Hyvönen, Miikka Junnila, Suvi Kettula, Eetu Mäkelä, Samppa Saarela, Mirva Salminen, Ahti Syreeni, Arttu Valo and Kim Viljanen: Publishing Museum Collections on the Semantic Web - the MuseumFinland Portal. Proceedings of WWW2004, New York, Alternate Track Papers and Posters, May, 2004. bib pdf
Museum collections contain large amounts of data and semantically rich, mutually interrelated metadata in heterogeneous databases. The publication of museum collections on the web is therefore a very promising application domain for semantic web techniques. We present a semantic web portal called MUSEUMFINLAND - Finnish Museums on the Semantic Web , that contains some 4,000 cultural artifacts from the collections of three museums using three different database schemas and database systems. The system is based on seven RDF(S) ontologies consisting of some 10,000 classes and individuals.
Mikko Laukkanen, Kim Viljanen, Mikko Apiola, Petri Lindgren and Eero Hyvönen: Towards Ontology-Based Yellow Page Services. Proceedings of WWW2004 Workshop, Application Design, Development, and Implementation Issues, May, 2004. bib pdf
Tuomas Korpilahti and Eero Hyvönen: An Architecture for Collaborative Ontology Library Development. Proceedings of 16th European Conference on Artificial Intelligence (ECAI2004), Workshop on Application of Semantic Web Technologies to Web Communities, 2004. bib pdf
Eero Hyvönen, Samppa Saarela and Kim Viljanen: Application of Ontology Techniques to View-Based Semantic Search and Browsing. The Semantic Web: Research and Applications. Proceedings of the First European Semantic Web Symposium (ESWS 2004), pp. 92-106, Springer-Verlag, 2004. bib pdf
Eero Hyvönen, Mirva Salminen, Suvi Kettula and Miikka Junnila: A Content Creation Process for the Semantic Web. Proceedings of OntoLex 2004, 2004. bib pdf
Eero Hyvönen, Samppa Saarela, Kim Viljanen, Eetu Mäkelä, Arttu Valo, Mirva Salminen, Suvi Kettula and Miikka Junnila: A Cultural Community Portal for Publishing Museum Collections on the Semantic Web. Proceedings of 16th European Conference on Artificial Intelligence (ECAI2004), Workshop on Application of Semantic Web Technologies to Web Communities, 2004. bib pdf
This paper presents a deployed semantic web application in the cultural domain: the semantic portal MUSEUMFINLAND. It is a demonstration of a community portal and a publication channel by which heterogeneous collection database contents of different museums can be published on the Semantic Web. By semantic web techniques, it is possible to make collections semantically interoperable and provide the museum visitors with intelligent content-based search and browsing services to the global collection base.
Eero Hyvönen, Miikka Junnila, Suvi Kettula, Eetu Mäkelä, Samppa Saarela, Mirva Salminen, Ahti Syreeni, Arttu Valo and Kim Viljanen: Finnish Museums on the Semantic Web. User s Perspective on MuseumFinland. Proceedings of Museums and the Web 2004 (MW2004), 2004. bib
This paper presents a semantic portal, MuseumFinland, for publishing heterogeneous museum collections on the Semantic Web. The application is presented from the viewpoints of the end-user and the museums providing the contents. By semantic Web techniques, it is possible to make collections semantically interoperable and provide museum visitors with intelligent content-based search and browsing services to the global collection base. By using the MuseumFinland approach, the museums with their semantically rich and interrelated collection content can create consolidated semantic collection portals together on the Web.
Markus Holi and Eero Hyvönen: Probabilistic Information Retrieval Based on Conceptual Overlap in Semantic Web Ontologies. Proceedings of the 11th Finnish AI Confence, Web Intelligence, 2004. bib pdf
Eero Hyvönen, Samppa Saarela, Kim Viljanen, Eetu Mäkelä, Arttu Valo, Mirva Salminen, Suvi Kettula and Miikka Junnila: A Semantic Portal for Publishing Museum Collections on the Web. Proceedings of ECAI/PAIS 2004, 2004. bib pdf
This paper presents the semantic portal MUSEUMFINLAND for publishing museum collections on the Semantic Web. It is shown how museums with their semantically rich and interrelated collection content can create a large, consolidated semantic collection portal together on the web. By semantic web techniques, it is possible to make collections semantically interoparable and provide the museum visitors with inntelligent content-based search and browsing services to the global collection base.
2003
Eero Hyvönen, Samppa Saarela and Kim Viljanen: Ontogator: Combining View- and Ontology-Based Search with Semantic Browsing. Proceedings of XML Finland 2003, Kuopio, Finland, October 30-31, 2003. Paper presented at the international SEPIA Conference, Helsinki, Sept. 18-20, 2003. bib pdf
Eero Hyvönen, Miikka Junnila, Suvi Kettula, Samppa Saarela, Mirva Salminen, Ahti Syreeni, Arttu Valo and Kim Viljanen: Publishing Collections in the Finnish Museums on the Semantic Web Portal - First Results. Proceedings of XML Finland 2003, Kuopio, Finland, October 30-31, 2003. Paper presented at the symposium Arts and Humanities in the Digital Domain: Towards Web based Culture and Science, Salzburg, Austria, Oct. 6-7, 2003. bib pdf
Eero Hyvönen, Arttu Valo, Kim Viljanen and Markus Holi: Publishing Semantic Web Content as Semantically Linked HTML Pages. Proceedings of XML Finland 2003, Kuopio, Finland, October 30-31, 2003. bib pdf
Eero Hyvönen, Suvi Kettula, Vilho Raatikka, Samppa Saarela and Kim Viljanen: Finnish Museums on the Semantic Web. Proceedings of WWW2003, Budapest, Hungary, May, 2003. Poster papers. bib html
Eero Hyvönen, Samppa Saarela, Avril Styrman and Kim Viljanen: Ontology-Based Image Retrieval. Proceedings of WWW2003, Budapest, Hungary, May, 2003. Poster papers. bib html
Eero Hyvönen, Kim Viljanen and Antti Hätinen: Yellow Pages on the Semantic Web. Proceedings of WWW2003, Budapest, Hungary, May, 2003. Poster papers. bib html
2002
Sten Malmlund and Eero Hyvönen: Device, Document, and user profiling on the semantic web. Semantic Web Kick-Off in Finland - Vision, Technologies, Research, and Applications, pp. 153-170, 2002. bib
J. Haajanen, Eero Hyvönen and P. Takala: eBusiness standards for web services. Semantic Web Kick-Off in Finland - Vision, Technologies, Research, and Applications, pp. 171-196, 2002. bib
Eero Hyvönen, Petteri Harjula and Kim Viljanen: Representing metadata about web resources. Semantic Web Kick-Off in Finland - Vision, Technologies, Research, and Applications, pp. 47-76, 2002. bib
Eero Hyvönen: Semantic Web - visio, teknologia, sovellukset. (in Finnish), Systeemityö, no. 1, pp. 30-35, 2002. bib
Eero Hyvönen: Semantic Web - kohti seuraavan polven Internet-palveluja. (in Finnish), Tietoyhteys, no. 3, 2002. bib
Eero Hyvönen: Semantic Web - The New Internet of Meanings. Semantic Web Kick-Off in Finland - Vision, Technologies, Research, and Applications, pp. 3-26, 2002. bib
P. Silvonen and Eero Hyvönen: Semantic web tools. Semantic Web Kick-Off in Finland - Vision, Technologies, Research, and Applications, pp. 137-15, 2002. bib
Vilho Raatikka, K. Salminen and Eero Hyvönen: XML, RDF(S), and Topic Map databases. Semantic Web Kick-Off in Finland - Vision, Technologies, Research, and Applications, pp. 77-110, 2002. bib
Eero Hyvönen, Avril Styrman and Samppa Saarela: Ontology-Based Image Retrieval. Towards the semantic web and web services, Proceedings of XML Finland 2002 Conference, pp. 15-27, Helsinki, Finland, October 21-22, 2002. bib pdf
Vilho Raatikka and Eero Hyvönen: Ontology-Based Semantic Metadata Validation. Towards the semantic web and web services, Proceedings of XML Finland 2002 Conference, pp. 28-40, Helsinki, Finland, October 21-22, 2002. bib pdf
Eero Hyvönen, Suvi Kettula, Vilho Raatikka, Samppa Saarela and Kim Viljanen: Semantic Interoperability on the Web: Case Finnish Museums Online. Towards the semantic web and web services, Proceedings of XML Finland 2002 Conference, pp. 41-53, Helsinki, Finland, October 21-22, 2002. bib pdf
Eero Hyvönen and Mika Klemettinen (eds.): Towards the Semantic Web and Web Services. Proceedings of the XML Finland 2002 Conference. HIIT Publications 2002-03, Helsinki, Finland, October 21-22, 2002. bib pdf
Eero Hyvönen, Kim Viljanen and Antti Hätinen: Yellow Pages on the Semantic Web. Towards the semantic web and web services, Proceedings of XML Finland 2002 Conference, pp. 3-14, Helsinki, Finland, October 21-22, 2002. bib pdf
Eero Hyvönen (ed.): Semantic Web Kick-Off in Finland - Vision, Technologies, Research, and Applications. HIIT Publications 2002-01, May 19, 2002. bib pdf
2001
Eero Hyvönen: Semantic Web - kohti uutta merkitysten Internetiä. (in Finnish), Presentation at the Semantic Web Kick-Off Seminar, University of Helsinki and Helsinki Institute for Information Technology, Helsinki, October 2, 2001. bib pdf
(in total: 561 publications)