Ontologies by SeCo
Ontologies are the basis for creating semantic web applications. As part of SeCo's research projects 2002-, the ontologies below
have been created either based on existing vocabularies, from various databases, or from
scratch.
For ontology services, see
ONKI pages.
Subject matter ontologies: the KOKO ontology cloud
The ontologies below have been created based on existing Finnish keyword thesauri in use,
extending their semantics by manual crafting into RDF Schema.
The numbers in parantheses indicate the approximate number of concepts in the ontologies in 2014.
The idea as well as the thesaurus to ontology transformation process is descibed in (Hyvönen et al., 2008; Seppälä and Hyvönen, 2014).
The ontologies above were also interlinked with YSO and to some extent with each other into the cross-domain KOKO ontology cloud using the YSO ontology as the top ontology (Frosterus et al., 2015, 2013).
Based on Finnish keyword thesauri, these ontologies were originally represented using RDF Scema with subClassOf hierarchies as backbones.
However, as the SKOS recommenation of W3C matured, also SKOS versions were developed using the Skosify tool.
In 2014, the ontologies above were deployed and published in SKOS format by the National Library of Finland as the new Finto.fi service (Suominen et al., 2014).
Finto.fi was based on the
ONKI Light tool that
was later on developed into the Skosmos tool.
Both the ontogies and ONKI Light were results of the national Finnish ontology project FinnONTO 2003-2012, from which the name Finto also originates.
In addition to the keyword thesauri-based ontologies abole, researchers at SeCo have worked on various other ontologies, including those below:
Actor ontologies
Place ontologies
Occupation ontologies
Time ontologies
Time ontologies have been created and used in applications.
Event ontologies
Biological ontologies
SeCo is developing and publishing together with the Finnish Museum of Natural History and
other partners several ontologies/vocabularies of organisms, such as Birds of the World ontology AVIO and
Mammals of the World ontology MAMO.
More information about SeCo biological ontologies.
Other ontologies
In addition, SeCo has created and/or published a variety of additional ontologies
in the ONKI Ontology Service by using technical transformations
or by just re-publishing existing ontologies.
For SeCo's Linked Data publication service see the "7-star"
Linked Data Finland portal.
The datasets there include a variety of SKOS and other ontologies.
The keynote presentation video of the DCMI 2021 confence below gives an overview of our work on developing a national Semantic Web infrastructure in Finland and its applications.
Linked Open Data Intrastructure for Digital Humanities (LODI4DH)
A major reserarch focus of SeCo has been Digital Humanities based on Linked Data, and the work on ontologies is being done under the
initiative Linked Open Data Intrastructure for Digital Humanities
LOD4DH.
FIN-CLARIAH ja EU-DARIAH
In 2020, the national research infrastructure initiative FIN-CLARIAH, coordinated by the Univeristy of Helsinki (HELDIG) and involving several univerities and other organisations in Finland, was accepted on the
reserarch infrastucture roadmap of the Academy of Finland in Finland. Our work on developing and maintaining national ontology and linked open data infrastructures in Finland is part of this initiative.
This work is also the focus of our collaborations with the pan-Eurpean DARIAH infrastructure.
Publications Related to FinnONTO
2023
2022
2021
Eero Hyvönen:
How to Create a National Cross-domain Ontology and Linked Data Infrastructure and Use It on the Semantic Web. Oct, 2021. Keynote presentation for the DCMI 2021 conference.
bib pdf The vision behind the Semantic Wed is to build a global Web of Data (Giant Global Graph, GGG) for machines to use: based on this an interoperable and intelligent transnational WWW for humans can be created cost-efficiently. This keynote presentation for the DCMI 2001 conference addresses this grand challenge on a national level, as in practice much of the data available are often related to each other within national cultures, borders, organizations, and are represented using national languages, metadata models, vocabularies, and local conventions. This presentation overviews and discusses the vision and lessons learned in Finland on developing and deploying a cross-domain national ontology service infrastructure and Linked Open Data (LOD) publishing framework, extending the classic 5-star model to a 7-star model for better data re-usability (6. star) and quality (7. star). To test and demonstrate the infrastructure, a series of semantic portals and LOD services have been created using the Sampo model that has evolved gradually in 2002--2021 through lessons learned when developing and publishing the Sampo series of systems, including MuseumFinland (2004), HealthFinland (2009), CultureSampo (2009), BookSampo (2011), WarSampo (2015), BiographySampo (2018), NameSampo (2019), WarWictimSampo (2019), Mapping Manuscript Migrations (2020), AcademySampo (2021), as well as FindSampo, Law\-Sampo, and ParliamentSampo underway. These systems cover a wide range of application domains and have attracted up to millions of users on the Semantic Web depending on the application, suggesting feasibility of the proposed model. This work shows a shift of focus in research on semantic portals from data aggregation and exploration systems (1. generation systems) to systems supporting research with data analytic tools (2. generation systems), and finally to automatic knowledge discovery and Artificial Intelligence (3. generation systems).
2020
2017
2016
Eero Hyvönen:
Cultural Heritage Linked Data on the Semantic Web: Three Case Studies Using the Sampo Model.
VIII Encounter of Documentation Centres of Contemporary Art: Open linked data and integral management of information in cultural centres, 2016. Artium, Vitoria-Gasteiz, Spain, October 19-20, 2016.
bib pdf A major challenge in publishing linked Cultural Heritage (CH) collections on the web is interoperability. This is due to the heterogeneity of CH contents and the distributed content creation model where publishers focus on their own data with little consideration on the others’ data. As a solution approach, the “Sampo” model is presented based on using domain independent modeling standards, on a model for aligning metadata models, and on sharing domain ontologies for populating the matadata models. The harmonized data is published for machines as a linked data service, to be used by applications for human users. To illustrate and evaluate the model, three online systems on the Web, Culture- Sampo, BookSampo, and WarSampo are presented.
2015
2014
Osma Suominen, Sini Pessala, Jouni Tuominen, Mikko Lappalainen, Susanna Nykyri, Henri Ylikotila, Matias Frosterus and Eero Hyvönen:
Deploying National Ontology Services: From ONKI to Finto.
Proceedings of the Industry Track at the International Semantic Web Conference 2014, CEUR Workshop Proceedings, Riva del Garda, Italy, October, 2014. Vol 1383.
bib pdf link The Finnish Ontology Library Service ONKI was published as a living laboratory prototype for public use in 2008. Its idea is to support content indexers and ontology developers via a browser interface and machine APIs. ONKI has been well-accepted, but being a prototype maintained by the ending research project FinnONTO (2003–2012), a more sustainable service was needed, supported by permanent governmental funding. To achieve this, ONKI was deployed and is being further developed by the National Library of Finland into a new national vocabulary service Finto. We discuss challenges in the deployment of ONKI into Finto and lessons learned during the transition process.
2013
Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen:
Fiction Literature as Linked Open Data - the BookSampo Dataset. Semantic Web – Interoperability, Usability, Applicability, vol. 4, no. 3, pp. 299-306, 2013.
bib pdf link The BookSampo dataset provides information as linked data on fiction literature published in Finland going back to the 15th century, along with rich descriptions of both their content and context. The dataset contains data on nearly 400,000 subjects, including literary works, authors, book covers, reviews, awards, images, and movies, over 3 million triples in total. The data has been applied as the basis of the BookSampo portal in public use in Finland, and is aligned with the cross-domain cultural heritage contents and ontologies of CultureSampo, another in-use semantic portal. The data has been used to answer complex questions, such as what topics should one write about, if one wants to get a literary award (based on statistics). The metadata was transformed into RDF from legacy library databases, then enriched manually by dozens of librarians in a Web 2.0 fashion in Finnish public libraries, and is constantly updated at a rate of some new 90,000 triples monthly.
Sven Buschbeck, Raphael Troncy, Anthony Jameson, Houda Khouf, Adrian Spirescu, Osma Suominen, Tanja Schneeberger and Eero Hyvönen:
Parallel Faceted Browsing.
Proceedings of CHI 2013, Extended Abstracts, Paris, 2013, Association for Computing Machinery (ACM), 2013.
bib pdf
Tuukka Ruotsalo and Matias Frosterus:
Semantic Entity Search Diversification.
Semantic Computing (ICSC), 2013 IEEE Seventh International Conference on, pp. 32-39, Irvine, CA, Sept, 2013.
bib pdf We present an approach to diversify entity search by utilizing semantics present and inferred from the initial entity search results. Our approach makes use of ontologies and independent component analysis of the entity descriptions to reveal direct and latent semantic connections between the entities present in the initial search results. The semantic connections are then used to sample a set of diverse entities. We empirically demonstrate the performance of our approach through retrieval experiments that use a real-world dataset composed from four entity databases. The results indicate that our approach significantly improves both diversity and effectiveness of entity search.
Matias Frosterus, Jouni Tuominen, Sini Pessala, Katri Seppälä and Eero Hyvönen:
Linked Open Ontology Cloud KOKO--Managing a System of Cross-domain Lightweight Ontologies.
The Semantic Web: ESWC 2013 Satellite Events, pp. 296-297, Springer-Verlag, Berlin Heidelberg, Montpellier, France, May 26-30, 2013.
bib pdf
Jouni Tuominen, Nina Laurenne, Mikko Koho and Eero Hyvönen:
The Birds of the World Ontology AVIO.
The Semantic Web: ESWC 2013 Satellite Events, pp. 300-301, Springer-Verlag, Berlin Heidelberg, Montpellier, France, May 26-30, 2013.
bib pdf We present an ontology for managing the scientific and common names of birds. The ontology is based on the TaxMeOn meta-ontology model for biological names. The ontology is in use as an ontology service and it has been applied in a bird watching system.
Jouni Tuominen, Nina Laurenne and Eero Hyvönen:
Publishing and Using Plant Names as an Ontology Service.
Proceedings of the first international Workshop on Semantics for Biodiversity (S4BioDiv), ESWC 2013, CEUR Workshop Proceedings, Vol 979, Montpellier, France, May, 2013.
bib pdf link Animals and plants are referred to using scientific or common names depending on the expertise of an audience or a source of data. The names change in time and therefore their usage as identifiers as such is problematic. We present a solution for managing and using plant names as an ontology. The ontology is based on the TaxMeOn meta-ontology for biological names. In order to refer to organisms unambiguously and publish information as Linked Data on the web, the names are given URIs. The ontology is developed collaboratively and it supports the approval process and temporal tracking of the common names. We introduce an ontology service of plant names for end-users and provide user interfaces and APIs for integrating the ontology into applications.
Tuukka Ruotsalo, Krister Haav, Antony Stoyanov, Sylvain Rochee, Elena Fanid, Romina Deliaic, Eetu Mäkelä, Tomi Kauppinen and Eero Hyvönen:
SMARTMUSEUM: A Mobile Recommender System for the Web of Data. Journal of Web Semantics, vol. 20, pp. 50-67, May, 2013.
bib link Semantic and context knowledge have been envisioned as an appropriate solution for addressing the content heterogeneity and information overload in mobile Web information access, but few have explored their full potential inmobile scenarios, where information objects refer to their physical counterparts, and retrieval is context-aware and personalized for users. We present SMARTMUSEUM, a mobile ubiquitous recommender system for the Web of Data, and its application to information needs of tourists in context-aware, on-site access to cultural heritage. The SMARTMUSEUM system utilizes Semantic Web languages as the form of data representation. Ontologies are used to bridge the semantic gap between heterogeneous content descriptions, sensor inputs, and user profiles. The system makes use of an information retrieval framework where in context data and search result clustering are used in recommendation of suitable content for mobileusers. Results from laboratory experiments demonstrate that ontology-based reasoning, query expansion, search result clustering, and context knowledge lead to significant improvement in recommendation performance. The results from field trials show that the usability of the system meets users’ expectations in real-world use. The results indicate that semantic content representation and retrieval can significantly improve the performance of mobile recommender systems in knowledge-rich domains.
Eero Hyvönen, Miika Alonen, Mikko Koho and Jouni Tuominen:
BirdWatch--Supporting Citizen Scientists for Better Linked Data Quality for Biodiversity Management.
Proceedings of the first international Workshop on Semantics for Biodiversity (S4BioDiv), ESWC 2013, CEUR Workshop Proceedings, Vol 979, Montpellier, France, May, 2013.
bib pdf link Observational data about species of public interest, such as birds and butterflies, is often created and collected by volunteered citizen scientists, and used by professionals for managing biodiversity. The education and skills of the citizens participating in the work varies a lot, and the process of making observations is typically not systematic but rather ad hoc. As a result, the quality of the observational data in repositories, such as the Global Biodiversity Information Facility GBIF Data Portal, is often not good, hampering its utilization severely. This paper presents an approach for enhancing data quality in a citizen science setting, and presents a mobile tool BirdWatch for citizen observers, mitigating difficulties in producing high quality Linked Data for biodiversity management.
Mika Wahlroos:
Indeksointimetatiedon eristäminen ja arviointi (Extraction and evaluation of index metadata). MSc Thesis (in Finnish), University of Helsinki, Department of Computer Science, February, 2013.
bib pdf Tiedonhallinnassa käytetään usein metatietona tiedon sisältöä kuvaavia avainsanoja parantamaan tiedon hallittavuutta tai löydettävyyttä. Sisällön kuvailua luonnollisen kielen termein tai käsittein kutsutaan indeksoinniksi. Yhdenmukaisuuden vuoksi voidaan käyttää tarkoitusta varten laadittua asiasanastoa, joka kattaa toimialan kannalta keskeisen termistön. Semanttisessa webissä ja yhdistetyssä tiedossa käytettävät ontologiat vievät ajatuksen pitemmälle määrittelemällä termit käsitteinä ja niiden välisinä merkityssuhteina. Metatiedon tuottamisen helpottamiseksi ja tehostamiseksi on kehitetty erilaisia menetelmiä, joilla sisältöä kuvailevia termejä voidaan tuottaa tekstiaineistosta automaattisesti. Tässä tutkielmassa keskitytään avaintermien automaattiseen eristämiseen tekstistä sekä metatiedon laatuun ja sen arvioinnin menetelmiin. Esimerkkitapauksena käsitellään ontologiaa hyödyntävän Maui-indeksointityökalun käyttöä asiakirjallisen tiedon automaattiseen asiasanoittamiseen. Automaattisesti eristetyn metatiedon laatua verrataan alkuperäiseen ihmisten määrittämään asiasanoitukseen käyttäen tarkkuus- ja saantimittauksia. Lisäksi evaluointia täydennetään aihealueen asiantuntijoiden esittämillä subjektiivisilla laatuarvioilla. Tulosten perusteella selvitetään tekstin esikäsittelyn ja sanaston hierarkian merkitystä automaattisen asiasanoituksen laadun kannalta sekä pohditaan keinoja annotointimenetelmän jatkokehittämiseksi.
2012
Sven Buschbeck, Anthony Jameson, Raphael Troncy, Houda Khrouf, Osma Suominen and Adrian Spirescu:
A Demonstrator for Parallel Faceted Browsing.
Proceedings of the IESD Challenge track at the International Workshop on Intelligent Exploration of Semantic Data (IESD 12), Galway, Ireland, October, 2012.
bib pdf
Kim Viljanen, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen:
Normalized Access to Ontology Repositories.
Proceedings of the Sixth International Conference on Semantic Computing (IEEE ICSC 2012), IEEE Press, Palermo, Italy, September, 2012.
bib pdf Ontology repositories, such as NCBO Bioportal, ONKI and Cupboard, help finding and using ontologies on the Semantic Web. However, currently each ontology repository constitutes a separate island with its own user interface, APIs, users, ontology languages and set of ontologies. Because there is not a universal way to access all ontology repositories, doing global search, browsing, and inference over all available ontology repositories turns out to be technically difficult and is generally not done. Ontologies are not reused as much as they could and hence the full potential of ontologies is not achieved. To address the problem, we propose the Normalized Ontology Repository (NOR) approach to make the ontology repositories universally accessible while maintaining their unique functionalities and strengths. The SKOS language is used as the lowest common denominator for presenting the ontologies. In addition, a simple API for searching and accessing the ontologies is defined. As a proof-of-concept evaluation, we present three case implementations to demonstrate the NOR approach: 1) the distributed architecture of the ONKI repository, 2) the metasearch for ONKI and NCBO Bioportal, and 3) publishing informal ontological concept collections as NOR end-points, demonstrated with the semantic portal CultureSampo and the metadata editor SAHA.
Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen:
Improving Fiction Literature Access by Linked Open Data -Based Collaborative Knowledge Storage - the BookSampo Project.
World Library and Information Congress: 78th IFLA General Conference and Assembly, Helsinki, IFLA, http://conference.ifla.org/ifla78, August, 2012.
bib pdf BookSampo is a joint project between the Finnish public libraries and semantic web researchers, to improve fiction literature search and recommendation. In the project, dozens of librarians around Finland have used a collaborative web-based metadata editor to input diverse knowledge about fiction literature into a shared database. Particularly, the project has sought to improve access by indexing not only bibliographical information about the books, but focusing on the content and context of the works. In order to do this, the database employs advanced techniques such as functional, content-centered indexing, ontological vocabularies and the networked data model of linked open data. To demonstrate the functionality this makes possible, the fiction literature portal http://www.kirjasampo.fi/ was created. This portal uses the knowledge created in the project to offer advanced semantic search and recommendation based on the database created. In addition, web services exposing direct access to the data have been used for example in culture hack events to answer more complex questions, such as where in Finland are the most crimes committed in fiction literature.
Thea Lindquist, Eero Hyvönen, Juha Törnroos, Eetu Mäkelä:
Leveraging linked data to enhance subject access - A case study of the University of Colorado Boulder s World War I collection online.
World Library and Information Congress: 78th IFLA General Conference and Assembly, Helsinki, IFLA, http://conference.ifla.org/ifla78, August, 2012.
bib link Academic users often find work with online primary sources both rewarding and challenging. Improving subject access in these sources is essential as digital collections propagate and work with primary sources becomes increasingly important in humanities curricula. A user needs assessment was conducted with humanities users at the University of Colorado Boulder to facilitate engagement with these sources. Two of the major user needs identified were improving findability and context, particularly for historical subjects. Linked Data can help meet these needs by linking related concepts in the sources using a specialized vocabulary, enriching them with outside resources, and enabling semantically rich services that empower users. This paper discusses a project the authors undertook to enhance subject access in CU’s WWI Collection Online by deep linking historical data on the civilian experience in occupied Belgium. This work is intended to lead to a richer understanding of forces shaping the WWI period.
Eero Hyvönen, Aleksi Lindblad and Eetu Mäkelä:
TravelSampo System for Creating Mobile Audio Guide Tours Enriched with Linked Data.
Proceedings of CIDOC 2012 - Enriching Cultural Heritage, Helsinki, Finland, CIDOC, http://www.cidoc2012.fi/en/cidoc2012/programme, June, 2012.
bib pdf TravelSampo [1] is a prototype system, by which museums are able to create interactively audio guide tours inside museums and outside in the open air. The system includes a web-based editor by which a curator can describe objects in an exhibition, or in the open air, using a set of shared ontologies published in the National Ontology Service ONKI (http://onki.fi/), and upload related audio descriptions, text, and images. Each exhibit object is given an identifier and a geo-location. When the end-user is near the object, either in a museum or in the open air, information related to the object can be given to her based on the object identifier or GPS location. A major novelty of TravelSampo lies in its ability to associate the object metadata automatically with millions of semantically related pieces of information available though the Linked Data cloud (http://linkedata.org/) and the CultureSampo system (http://www.kulttuurisampo.fi/). For example, a painting can be linked, based on the underlying ontologies and metadata, with the biography of the painter in Wikipedia or in the National Biography, with other paintings of the artist in the collections of other museums, with photos and books about the artist, and so on. This gives the end-user a richer experience than is possible with traditional audio guide systems. For the museums, TravelSampo offers a cost-efficient and dynamic way of creating information rich audio guide programs, and re-using and linking each others collections through linked data, leading to a win-win situation. The paper presents and discusses the underlying ideas of TravelSampo and our experiences in developing the systems especially from the content publishers’, i.e. the museums’ viewpoint. [1] E. Mäkelä, J. Väätäinen, R. Alitalo, O. Suominen, E. Hyvönen: Discovering Places of Interest through Direct and Indirect Associations in Heterogeneous Sources - The TravelSampo System. Terra Cognita 2011: Foundations, Technologies and Applications of the Geospatial Web, CEUR Workshop Proceedings, Vol-798, 2011. http://ceur-ws.org/Vol-798/proceedings.pdf
Eero Hyvönen, Thea Lindquist, Juha Törnroos and Eetu Mäkelä:
History on the Semantic Web as Linked Data - An Event Gazetteer and Timeline for World War I.
Proceedings of CIDOC 2012 - Enriching Cultural Heritage, Helsinki, Finland, CIDOC, http://www.cidoc2012.fi/en/cidoc2012/programme, June, 2012.
bib pdf Events are an essential component of cultural heritage (CH) Linked Data (LD): they link actors, places, times, objects, and other events into larger narrative structures, providing a rich basis for semantic searching, recommending, analysis, and visualization of CH data. This paper argues that shared vocabularies (gazetteers, ontologies) of events, such as the “Battle of Normandy” or “Crucifixion of Jesus”, are necessary to facilitate the aggregation and linking of heterogeneous content from various collections. For example, biographies, histories, photos, and paintings often reference or depict events. A set of general requirements for an event gazetteer is presented, based on the needs of publishing, aggregating, and reusing cultural heritage content as Linked Data. After this, a metadata model addressing the presented requirements for representing historical events is outlined. The model is being applied in a case study aimed at developing an event ontology for World War I (WWI). Our goals from an end-user perspective are twofold: 1) Facilitate event-based cataloging for curators in memory organizations; 2) Utilize semantic event descriptions and narrative event structures in end-user applications for searching and linking documents and other content about WWI, and for structuring and visualizing them.
Jouni Tuominen, Kim Viljanen and Eero Hyvönen:
Ontologiapalvelut semanttisessa webissä (Ontology services on the Semantic Web). (in Finnish), Tietojenkäsittelytiede, no. 34, pp. 17-36, Tietojenkäsittelytieteen Seura ry, April, 2012.
bib pdf Ontologiat ovat keskeinen osa semanttista webiä: ne toimivat yhteisinä jaettuina käsitteistöinä, joiden avulla tietokoneet voivat käsitellä tietoa älykkäämmin. Jotta eri toimijat voivat hyödyntää yhteisiä käsitteistöjä sovelluksissaan, ontologiat on julkaistava heidän käyttöönsä. Yksinkertaisimmillaan ontologiat voidaan julkaista tiedostomuodossa. Tällöin jokainen toimija joutuu toteuttamaan itse toiminnallisuuksia ontologioiden hyödyntämiseen. Koska osa toiminnallisuuksista on yleisiä, useissa järjestelmissä toistuvia, niiden toteuttaminen valmiina palveluina on mielekästä. Palveluita voidaan tarjota ihmiskäyttäjille käyttöliittymäkomponentteina sekä ohjelmalliseen käyttöön rajapintoina, joita käyttämällä toiminnallisuudet voidaan integroida asiakasjärjestelmiin. Tässä artikkelissa kuvataan ontologioiden käyttäjäryhmien tarpeita sekä ontologiapalveluiden toteutuksia. Yleisten ontologioiden käyttämiseen liittyvien toiminnallisuuksien tarjoamiseksi esitetään ontologiapalvelu ONKI, joka on osa Suomalaiset semanttisen webin ontologiat -hankesarjassa (FinnONTO, 2003–2012) kehitettyä ontologiainfrastruktuuria. Artikkeli perustuu Jouni Tuomisen pro gradu -työhön, jolle Tietojenkäsittelytieteen Seura ry myönsi lukuvuoden 2009–2010 pro gradu -palkinnon. Tutkimustyöhön ovat osallistuneet myös Kim Viljanen ja Eero Hyvönen.
Eetu Mäkelä, Eero Hyvönen and Tuukka Ruotsalo:
How to deal with massively heterogeneous cultural heritage data – lessons learned in CultureSampo. Semantic Web – Interoperability, Usability, Applicability, vol. 3, no. 1, January, 2012.
bib pdf link This paper presents the CultureSampo system for publishing heterogeneous linked data as a service. Discussed are the problems of converting legacy data into linked data, as well as the challenge of making the massively heterogeneous yet interlinked cultural heritage content interoperable on a semantic level. Novel user interface concepts for then utilizing the content are also presented. In the approach described, the data is published not only for human use, but also as intelligent services for other computer systems that can then provide interfaces of their own for the linked data. As a concrete use case of using CultureSampo as a service, the BookSampo system for publishing Finnish fiction literature on the semantic web is presented.
2011
Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen:
BookSampo--Lessons Learned in Creating a Semantic Portal for Fiction Literature.
The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, pp. 173-188, Springer-Verlag, 2011.
bib pdf link BookSampo is a semantic portal in use, covering metadata about practically all Finnish fiction literature of Finnish public libraries on a work level. The system introduces a variety of semantic web novelties deployed into practise: The underlying data model is based on the emerging functional, content-centered metadata indexing paradigm using RDF. Linked Data (LD) principles are used for mapping the metadata with tens of interlinked ontologies in the national FinnONTO ontology infrastructure. The contents are also linked with the large LD metadata repository of related cultural heritage content of CultureSampo. BookSampo is actually based on using CultureSampo as a semantic web service, demonstrating the idea of re-using semantic content from multiple perspectives without the need for modifications. Most of the content has been transformed automatically from existing databases, with the help of ontologies derived from thesauri in use in Finland, but in addtion tens of volunteered librarians have participated in a Web 2.0 fashion in annotating and correcting the metadata, especially regarding older litarature. For this purpose, semantic web editing tools and public ONKI ontology services were created and used. The paper focuses on lessons learned in the process of creating the semantic web basis of BookSampo.
Matias Frosterus, Eero Hyvönen and Joonas Laitio:
Creating and Publishing Semantic Metadata about Linked and Open Datasets.
AAAI Fall Symposium 2011, Open Government Knowledge: AI Opportunities and Challenges, Arlington, USA, November, 2011.
bib pdf We present a comprehensive system for producing interoperable metadata for Linked Open datasets and governmental datasets published in various formats.
Eero Hyvönen:
Linked Open Aalto, Project Proposal. Aalto University, Department of Media Technology, November, 2011.
bib pdf Linked Open Aalto is a research project aiming at developing a semantic web approach for creating and publishing interlinked educational, research, and managerial contents produced at different communities, schools, departments, research groups, and persons in Aalto. By using semantic Linked (Open) Data principles, technologies, and open datasets available, Aalto contents can be interlinked with related teaching and research materials in Finland and internationally. By aggregating and combining local contents from separate incompatible data silos and systems, the end-user can be provided with a global, cross-disciplinary perspective to knowledge produced in Aalto and other universities. For example, a web page describing a course can be interlinked automatically with related research results, publications, projects, Wikipedia pages, research groups, researchers, internationally available video lectures, open course materials, events in Aalto, conferences, blog discussions, and so on.
Alexander García Castro, Ken Baclawski, John Bateman, Christoph Lange and Kim Viljanen (eds.):
.
Proceedings of the ISWC 2011 Workshop Ontologies Come of Age in the Semantic Web (OCAS), CEUR Workshop Proceedings, Vol 809, http://ceur-ws.org, ISSN 1613-0073, October, 2011.
bib link
Matias Frosterus, Eero Hyvönen and Mika Wahlroos:
Extending Ontologies with Free Keywords in a Collaborative Annotation Environment.
Proceedings of the ISWC 2011 Workshop Ontologies Come of Age in the Semantic Web (OCAS), CEUR Workshop Proceedings, Vol 809, http://ceur-ws.org, ISSN 1613-0073, Bonn, Germany, October, 2011.
bib pdf Semantic web technologies have introduced the idea of annotating content in terms of concepts taken from ontologies. Since concepts are defined in terms of properties and relations to other concepts, descriptions grow up into larger RDF graphs that can be used as a basis for data integration and intelligent information retrieval. Since ontologies do not typically contain all the possible concepts needed for annotation, it is usually necessary to offer the annotator the possibility to introduce new free keywords or tags in addition to the predefined ontology concepts. The problem then is that free keywords/tags do not have ontological connections to the rest of the RDF graph, unless such relations are defined by the annotator.We present a process for integrating free keywords into the ontological framework, and a practical tool implementation of it, discussing the challenges and possibilities introduced by the system. We also describe a case study performed for the Finnish Defence Forces, where the tool is used for creating a faceted semantic search portal featuring the free keywords and the ontological concepts at the same time.
Joonas Laitio:
Semantic Web Data Quality Control. MSc Thesis, Aalto University, School of Electrical Engineering, Degree Programme of Automation and Systems Technology, October, 2011.
bib pdf Data quality is a growing concern on the Semantic Web. The amount of data available is growing faster than ever, and the emphasis thus far has been on creating and interlinking data without much regard to how good the data actually is. The trend is shifting from creating new data to refining what already exists. Data quality is a subjective concept and a formal representation for it is often troublesome. First, we must define what is meant by data quality - what are the different facets of the concept. Second, a way for representing this quality must be found. Third, actual processes to refine data and improve its quality and ways to take data quality into account on the Semantic Web must be developed. This work presents some solutions to the problem. Many ways to annotate quality metadata as RDF are first discovered, along with their pros and cons. A framework for managing RDF-based quality metadata is presented, with a set of tools for specifically managing the quality annotations. Additionally, an automatic annotation system and a schema validation system, within the restraints of the open world assumption, have been designed, implemented and integrated into the framework. The system has been tested using real life datasets with promising first results.
Sini Pessala, Katri Seppälä, Osma Suominen, Matias Frosterus, Jouni Tuominen and Eero Hyvönen:
MUTU: An Analysis Tool for Maintaining a System of Hierarchically Linked Ontologies.
ISWC 2011 - Ontologies come of Age Workshop (OCAS), Bonn, Germany, October, 2011.
bib pdf We consider ontology evolution in a system of light-weight Linked Data ontologies, aligned with each other to form a larger ontology system. When one ontology changes, the human editor must keep track of the actual changes and of the modifications needed in the related ontologies in order to keep the system consistent. This paper presents an analysis tool MUTU, by which such changes and their potential effects on other ontologies can be found. Such an analysis is useful for the ontology editors for understanding the differences between ontology versions, and for updating linked ontologies when changes occurred in other components of an ontology system.
Jouni Tuominen, Nina Laurenne and Eero Hyvönen:
Biological Names and Taxonomies on the Semantic Web - Managing the Change in Scientific Conception.
Proceedings of the 8th Extended Semantic Web Conference (ESWC 2011), Springer-Verlag, Heraklion, Greece, June, 2011.
bib pdf Biodiversity management requires the usage of heterogeneous biological information from multiple sources. Indexing, aggregating, and finding such information is based on names and taxonomic knowledge of organisms. However, taxonomies change in time due to new scientific findings, opinions of authorities, and changes in our conception about life forms. Furthermore, organism names and their meaning change in time, different authorities use different scientific names for the same taxon in different times, and various vernacular names are in use in different languages. This makes data integration and information retrieval difficult without detailed biological information. This paper introduces a meta-ontology for managing the names and taxonomies of organisms, and presents three applications for it: 1) publishing biological species lists as ontology services (ca. 20 taxonomies including more than 80,000 names), 2) collaborative management of the vernacular names of vascular plants (ca. 26,000 taxa), and 3) management of individual scientific name changes based on research results, covering a group of beetles. The applications are based on the databases of the Finnish Museum of Natural History and are used in a living lab environment on the web.
Matias Frosterus, Eero Hyvönen and Joonas Laitio:
DataFinland - A Semantic Portal for Open and Linked Dataset.
Proceedings of the 8th Extended Semantic Web Conference (ESWC 2011), pp. 243-254, Springer-Verlag, Heraklion, Greece, June, 2011.
bib pdf link The number of open datasets available on the web is increasing rapidly with the rise of the Linked Open Data (LOD) cloud and various governmental efforts for releasing public data in different formats, not only in RDF. The aim in releasing open datasets is for developers to use them in innovative applications, but the datasets need to be found first and metadata available is often minimal, heterogeneous, and distributed making the search for the right dataset often problematic. To address the problem, we present DataFinland, a semantic portal featuring a distributed content creation model and tools for annotating and publishing metadata about LOD and non-RDF datasets on the web. The metadata schema for DataFinland is based on a modified version of the voiD vocabulary for describing linked RDF datasets, and annotations are done using an online metadata editor SAHA connected to ONKI ontology services providing a controlled set of annotation concepts. The content is published instantly on an integrated faceted search and browsing engine HAKO for human users, and as a SPARQL endpoint and a source file for machines. As a proof of concept, the system has been applied to LOD and Finnish governmental datasets.
Reetta Sinkkilä, Osma Suominen and Eero Hyvönen:
Automatic Semantic Subject Indexing of Web Documents in Highly Inflected Languages.
Proceedings of the 8th Extended Semantic Web Conference (ESWC 2011), pp. 215-229, Springer-Verlag, Heraklion, Greece, June, 2011.
bib pdf Structured semantic metadata about unstructured web documents can be created using automatic subject indexing methods, avoiding laborious manual indexing. A succesful automatic subject indexing tool for the web should work with texts in multiple languages and be independent of the domain of discourse of the documents and controlled vocabularies. However, analyzing text written in a highly inflected language requires word form normalization that goes beyond rule-based stemming algorithms. We have tested the state-of-the art automatic indexing tool Maui on Finnish texts using three stemming and lemmatization algorithms and tested it with documents and vocabularies of different domains. Both of the lemmatization algorithms we tested performed significantly better than a rule-based stemmer, and the subject indexing quality was found to be comparable to that of human indexers.
Kim Viljanen, Jouni Tuominen, Eetu Mäkelä and Eero Hyvönen:
Combining Distributed Ontology Repositories into a Global Service. June, 2011. Draft paper.
bib pdf Ontologies and vocabularies are a key resource for creating interoperable metadata on the Semantic Web. To make finding and using ontologies easier, the idea of Ontology Repositories has been introduced with current implementations including e.g. the NCBO Bioportal, ONKI and Cupboard. There is a genuine need for different kinds of Ontology Repositories, each focusing on different kinds specific user-needs, different ontologies and different organizational requirements which cannot be addressed by a single general implementation. However, at the moment each Ontology Repository is a separate island with its own user interfaces and APIs. They also use varying ontology languages such as OWL, SKOS, and RDF Schema. Due to this, global search, browsing, and inference over the repositories is difficult and generally not done which means that, for example, finding and reusing existing ontologies becomes difficult. To address the problems, we have developed a loosely coupled Network of Ontology Repositories (NOR) architecture that makes the repositories globally interoperable while maintaining their unique functionalities and strengths. To participate in the network, each ontology repository is required to implement a shared API. As a proof-of-concept evaluation, we present three case implementations demonstrating different aspects of the NOR approach: 1) internal distributed architecture of ONKI, 2) global search of ONKI and NCBO Bioportal, 3) publishing non-ontological concept collections as NOR endpoints, demonstrated with the semantic portal CultureSampo and the metadata editor SAHA.
Kaisa Hypén and Eetu Mäkelä:
An ideal model for an information system for fiction and its application: Kirjasampo and Semantic Web. Library Review, vol. 60, no. 4, April, 2011.
bib link Purpose – Library Director Jarmo Saarti introduced a wide or ideal model for fiction in literature in his dissertation, published in 1999. It introduces those aspects that should be included in an information system for fiction. Such aspects include literary prose and its intertextual references to other works, the writer, readers and critics receptions of the work as well as a researcher s view. It is also important to note how libraries approach a literary work by means of inventory, classification and content description. The most ambiguous of the aspects relates to that context in cultural history, which the work reflects and is a part of. The paper aims to discuss these issues. Design/methodology/approach – Since the model consists of several components which are not found in present library information systems and cannot be implemented by them, a new way had to be found to produce, save, process and present fiction‐related metadata. The Semantic Computing Research Group of Aalto University has developed several Semantic Web services for use in the field of culture, so cooperation with it and the use of Semantic Web tools were a natural starting point for the construction of the new service. Kirjasampo will be based on the Semantic Web RDF data model. The model enables a flexible linking of metadata derived from different sources, and it can be used to build a Semantic Web that can be approached contextually from different angles. Findings – The “semantically enriched” ideal model for fiction has hence been realised, at least to some extent: Kirjasampo supports literature‐related metadata that is more varied than earlier and aims to account for different contexts within literature and connections with regard to other cultural phenomena. It also includes contemporary reviews of works and, as such, readers receptions as well. Modern readers can share their views on works, once the user interface of the server is completed. It will include several features from the Kirjasto 2.0‐application, which enables the evaluation, description and recommendations of works. The service should be online by the end of Spring 2011. Research limitations/implications – The project involves novel collaboration between a public library and a computer science research unit, and utilises a novel approach to the description of fiction. Practical implications – The system encourages user participation in the description of fiction and is of practical benefit to librarians in understanding both how fiction is organised and how users interpret the same. Originality/value – Upon completion, the service will be the first Finnish information system for libraries built with the tools of the Semantic Web which offers a completely new user environment and application for data produced by libraries. It also strives to create a new model for saving and producing data, available to both library professionals and readers. The aim is to save, accumulate and distribute literary knowledge, experiences and silent information.
Lyndon Nixon, Stamatia Dasiopoulou, Jean-Pierre Evain, Eero Hyvönen, Ioannis Kompatsiaris and Raphael Troncy:
Multimedia, Broadcasting and eCulture.
Handbook of Semantic Web Technologies (John Domingue, Dieter Fensel and James Hendler (eds.)), Springer-Verlag, January, 2011.
bib link
Katariina Nyberg:
Document Classification Using Machine Learning and Ontologies. MSc Thesis, Aalto University, School of Science, Degree Programme of Information Networks, January, 2011.
bib pdf This master s thesis explores a way in which documents can be automatically classified based on their contents. Automatic classification of data is one of the main applications of machine learning. With the help of already classified data a model for the most likely class can be learned. Whether adding background knowledge from ontologies can be added to the model in order to improve the classification accuracy, is also explored in this master s thesis. A new machine learning model is introduced that incorporates ontology information. The proposed method for learning a classification model and enhancing it with ontology information is used in a case study for the Finnish National Archives and a set of digital documents that have been manually classified. An RDF schema for representing documents, sentences and words is created in order to prepare tha data for the machine learning analysis. The words are put into base form and matched semi-automatically with concepts of the General Finnish Ontology YSO. Then the ontology enhanced model is applied on the data and the most likely classes for documents are learned. The master s thesis shows that the classification accuracy of the model increases when ontology information is added to it.
2010
Tomi Kauppinen, Panu Paakkarinen, Eetu Mäkelä, Heini Kuittinen, Jari Väätäinen and Eero Hyvönen:
Geospatio-temporal Semantic Web for Cultural Heritage.
Digital Culture and E-Tourism: Technologies, Applications and Management Approaches, 2010.
bib pdf link People frequently need to find knowledge related to places when they plan a leisure trip, when they are executing that plan in a certain place, or when they want to virtually explore a place they have visited in the past. In this chapter we present and discuss a set of methods for searching and browsing spatiotemporally referenced knowledge related to cultural objects, e.g. artifacts, photographs and visiting sites. These methods have been implemented in the semantic cultural heritage portal CULTURESAMPO that offers map-based interfaces for a user to explore hundreds of thousands of content objects and points of interest in Finland. Our goal is to develop and demonstrate novel ways to help the user 1) to decide where to go for a trip, and 2) to learn more about the neighborhoods and points of interest during the visit.
Kim Viljanen, Jouni Tuominen and Eero Hyvönen:
A Network of Ontology Repositories. December, 2010. Draft paper.
bib pdf Ontologies and vocabularies are a key resource for creating interoperable metadata on the Semantic Web. To make the finding and using ontologies easier, the idea of Ontology Repositories have been introduced with current implementations including e.g. the NCBO Bioportal, ONKI and Cupboard. However, at the moment each ontology repository is a separate island with its own user interfaces and APIs. They also use varying ontology languages such as OWL, SKOS, RDF Schema and others. Due to this, global search, browsing, and inference over the repositories is difficult and generally not done. At the same time, there is a genuine need for different kinds of Ontology Repositories, each focusing on different kinds specific user-needs, different ontologies and different organizational requirements which can not be addressed by a single global implementation. Since there are benefits of having interoperability among the repositories, we have developed a loosely coupled Network of Ontology Repository (NOR) architecture that makes the repositories globally interoperable while maintaining their unique functionalities and strengths. To participate in the network, each ontology repository is required to implement a shared API. As a proof-of-concept, we present a global metasearch prototype for searching simultaneously hundreds of ontologies in the ONKI and NCBO Bioportal repositories.
Eetu Mäkelä:
View-Based User Interfaces for the Semantic Web. Dissertation, Aalto University, School of Science and Technology, Espoo, November, 2010. D.Sc. dissertation.
bib pdf This thesis explores the possibilities of using the view-based search paradigm to create intelligent user interfaces on the Semantic Web. After surveying several semantic search techniques, the view-based search paradigm is explained, and argued to fit in a valuable niche in the field. To test the argument, numerous portals with different user interfaces and data were built using the paradigm. Based on the results of these experiments, this thesis argues that the paradigm provides a strong, extendable and flexible base on which to built semantic user interfaces. Designing the actual systems to be as adaptable as possible is also discussed.
Kim Viljanen, Jouni Tuominen, Mikko Salonoja and Eero Hyvönen:
Global Access to Distributed Ontology Repositories.
Poster Papers, the Extended Semantic Web Conference ESWC 2010, Heraklion, Greece, June, 2010.
bib pdf Ontology repository systems are used for publishing and sharing ontologies. However, currently the repositories form separate islands of ontologies, which hinders the user from finding and utilizing the most suitable ontological concepts and ontologies on a global level. In contrast, this paper presents the idea of creating a network of Linked Open Ontology Services (LOOS) based on a set of ontology services that publish their content via a shared API. This facilitates global search and browsing over all ontologies in the network. LOOS has been implemented in the National Finnish Ontology Service ONKI serving currently 79 ontologies.
Kim Viljanen, Jouni Tuominen, Mikko Salonoja and Eero Hyvönen:
Linked Open Ontology Services.
Workshop on Ontology Repositories and Editors for the Semantic Web (ORES 2010), the Extended Semantic Web Conference ESWC 2010, CEUR Workshop Proceedings, Vol. 596, Heraklion, Greece, June, 2010.
bib pdf link Ontology repository systems are used for publishing and sharing ontologies and vocabularies for content indexing, information retrieval, content integration, and other purposes. However, interlinking these distributed repositories to provide global search and browsing over the repositories has not been made. In the spirit of Linked Open Data, we propose creating a network of Linked Open Ontology Services (LOOS) consisting of ontology repositories that publish their content using a shared API. To test the approach, we have defined an HTTP API and present a proof-of-concept implementation consisting of three client applications that are used for accessing a LOOS network of over 50 ontology servers, part of the Ontology Library Service ONKI.
Jouni Tuominen, Mikko Salonoja, Kim Viljanen and Eero Hyvönen:
A User Interface for Ontology Repositories.
Workshop on Ontology Repositories and Editors for the Semantic Web (ORES 2010), the Extended Semantic Web Conference ESWC 2010, CEUR Workshop Proceedings, Vol. 596, Heraklion, Greece, June, 2010.
bib pdf link Finding ontologies and concepts from a collection of ontologies is a recurring task in many use cases, such as content indexing, searching, and ontology developing. To facilitate this, efficient search and browsing methods are needed. This paper introduces ONKI2, an ontology browser providing a user interface for a repository of ontologies. The system provides a multi-facet search facility for finding an ontology. Finding concepts is supported by autocompletion-based text search that can be refined with additional restrictions. ONKI2 is in use in the Finnish Ontology Library Service ONKI for a collection of 79 ontologies and vocabularies.
Osma Suominen and Eero Hyvönen:
Expressing and Aggregating Rich Event Descriptions.
Proceedings of the 6th Workshop on Scripting and Development on the Semantic Web, Heraklion, Greece, May, 2010.
bib pdf Publishing information about upcoming events such as concerts and discussion group meetings in a structured format allows the event information to be aggregated, filtered and delivered to potential participants. Making automatic personalized recommendations about events requires structured metadata such as machine-understandable locations and semantic descriptions about the topic and audience of the event. We present a survey of the state of current semantic representation formats for events, including iCalendar and its RDFa and microformat representations, and show that their support for expressing rich structured metadata is limited. We have also tested how well different tools support and understand the formats. Based on the surveys we have implemented a rich event information schema for a health-oriented activity portal and developed an aggregation and validation tool for gathering and processing event information.
Jouni Tuominen:
Helppokytkentäiset ontologiapalvelut semanttisessa webissä. MSc Thesis (in Finnish), University of Helsinki, Department of Computer Science, May, 2010.
bib pdf link Ontologiat luovat semanttisen webin perustan: ne toimivat yhteisinä jaettuina käsitteistöinä, joiden avulla tietokoneet voivat käsitellä tietoa älykkäämmin. Jotta eri toimijat voivat hyödyntää yhteisiä käsitteistöjä sovelluksissaan, ontologiat on julkaistava heidän käyttöönsä. Yksinkertaisimmillaan ontologiat voidaan julkaista datana, tiedostomuodossa. Tällöin jokainen toimija joutuu toteuttamaan itse toiminnallisuuksia ontologioiden hyödyntämiseen. Osa toiminnallisuuksista on yleisiä, useissa järjestelmissä toistuvia, kuten ontologian visualisointi, selaaminen ja käsitehaku. On kuitenkin kustannustehokkaampaa toteuttaa yleisiä ontologiatoiminnallisuuksia valmiina palveluina. Palveluita voidaan tarjota ihmiskäyttäjille käyttöliittymäkomponentteina sekä ohjelmalliseen käyttöön rajapintoina, joita käyttämällä toiminnallisuudet voidaan integroida asiakasjärjestelmiin. Lisäksi käytettäessä ontologioita palveluina toimijoiden käytössä on aina ontologioiden ajantasaiset versiot. Tässä tutkielmassa kuvataan ontologioiden käyttäjäryhmien -- ontologioiden kehittäjien, tiedon annotoijien, tiedon hakijoiden ja semanttisen webin sovellusten kehittäjien -- tarpeita sekä esitellään ontologioiden hyödyntämiseen kehitettyjä sovelluksia. Yleisten ontologioiden käyttämiseen liittyvien toiminnallisuuksien tarjoamiseksi esitetään ontologiapalvelu ONKI, joka julkistettiin virallisesti käyttöön syyskuussa 2008.
2009
Osma Suominen, Eero Hyvönen, Kim Viljanen and Eija Hukka:
HealthFinland-a National Semantic Publishing Network and Portal for Health Information. Journal of Web Semantics, vol. 7, no. 4, pp. 287-297, Dec, 2009.
bib pdf Providing citizens with reliable, up-to-date and individually relevant health information on the web is done by governmental, non-governmental, business and other organizations. Currently the information is published with little co-ordination and co-operation between the publishers. For publishers, this means duplicated work and costs due to publishing same information twice on many websites. Also maintaining links between websites requires work. From the citizens point of view, finding content is difficult due to e.g. differences in layman’s vocabularies compared to medical terminology and difficulties in aggregating information from several sites. To solve these problems, we propose as a solution a national scale semantic publishing system HealthFinland which consists of a 1) a centralized content infrastructure of health ontologies and services with tools, 2) a distributed semantic content creation channel based on several health organizations, and 3) an intelligent semantic portal aggregating and presenting the contents from intuitive and health promoting end-user perspectives for human users as well as for other web sites and portals.
Mikko Salonoja:
Palveluiden semanttinen kuvailu ja haku. MSc Thesis, Helsinki University of Technology, Department of Automation and Systems Technology, September, 2009.
bib pdf Ihminen tarvitsee kuluttajana ja yrittäjänä usein toisten ihmisten ja organisaatioden tukea palveluiden muodossa. Aiemmin tuntemattomien palveluiden löytäminen voi kuitenkin olla vaikeaa Internetin tiedon määrän lisääntyessä, koska tiedonhakijalle mielekäs sisältö hautautuu helposti epäolennaisen tiedon joukkoon. Tämän ongelman helpottamiseksi semanttisen webin tekniikat tarjoavat uusia mahdollisuuksia. Tässä diplomityössä tutkittiin millaisia palveluiden haun kannalta hyödyllisiä ratkaisuja on aiemmin toteutettu. Tämän jälkeen nämä ratkaisut peilattiin kahden Internetissä sijaitsevan palvelun, PKT-säätiön ylläpitämä Yrityksen palveluhakemisto ja Suomen Asiakastiedon ylläpitämä Aarrepalvelu, parannusehdotuksiksi. Tutkittuja palveluita ehdotettiin tässä diplomityössä parannettavan useilla erilaisilla tavoilla. Näistä osalla ei ollut suoraan mitään tekemistä semanttisen webin tekniikoiden kanssa ja osa taas liittyi semanttiseen webiin hyvin kiinteästi. Erityisesti maantieteellisen tiedon käsittelyssä havaittiin olevan runsaasti kehitettävää molempien tutkittujen palveluiden kohdalla. Myös eroja parannusehdotusten välillä oli havaittavissa. Johtuen Aarre-palvelun suuremmasta tietokannasta ja suuremmasta käyttäjäjoukosta siinä sanojen välisen verkon hyödyntäminen näytti olevan merkityksellisempää kuin Yrityksen palveluhakemiston kohdalla.
Matias Frosterus and Eero Hyvönen:
Bridging the Search Gap between the Web of Pages and Web of Data by Combining Ontological Document Expansion with Text Search.
Proceedings of the International Conferences on Digital Libraries and the Semantic Web 2009 (ICSD2009), Trento, Italy, September, 2009.
bib pdf The Semantic Web extends traditional web documents, i.e. the Web of Pages, with conceptual structures based on ontologies and metadata, i.e. the Web of Data. This paper presents a hybrid document search approach combining the benefits of the traditional text search of literal documents and the semantic search based on their underlying conceptual structures. The approach is based on document expansion, where documents are automatically annotated with not only the concepts explicitly present in a given document, but also with the ontologically related concepts using smaller weights. Our test results using the CLEF Test Suite suggest that document expansion alone achieves better recall than text search at the expense of precision. As a solution, a method of combining document expansion with text search is presented in which better recall was obtained without sacrificing precision. This approach seems promising when integrating unstructured, textual content with the Semantic Web of Data.
Jussi Kurki and Eero Hyvönen:
Authority Control of People and Organizations on the Semantic Web.
Proceedings of the International Conferences on Digital Libraries and the Semantic Web 2009 (ICSD2009), Trento, Italy, September, 2009.
bib pdf Authors and documents with identical titles are common in the digital library environment. In order to manage identities correctly, authority control is used by library and information scientists for disam- biguating and cross-referencing entity names. We argue that the benefits of traditional authority control can be enhanced by using techniques and technologies of the Semantic Web, leading to simpler management of multiple languages, better linkability of resources, simpler reuse of au- thority registries in applications, and less work in indexing. To demon- strate our propositions, we have created a prototype of an ontology server and service called ONKI People that is used in two ways: First, it is a centralized authority service providing human end-users with efficient and easy to use authority finding and disambiguation services based on faceted semantic search and visualizations. The services are available on- line also as AJAX and Web Services API for machines to use. Second, the underlying RDF triple store can be used as a content resource in ap- plications such as semantic cultural heritage portals. The paper discusses and demonstrates both use cases in a real life setting.
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä, Joeli Takala, Kimmo Puputti, Heini Kuittinen, Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg:
CultureSampo - A National Publication System of Cultural Heritage on the Semantic Web 2.0.
Proceedings of the 6th European Semantic Web Conference (ESWC2009), Heraklion, Greece, May 31 - June 4, 2009. Springer-Verlag.
bib pdf CULTURESAMPO is an application demonstration of a national level publication system of cultural heritage contents on the Web, based on ideas and technologies of the Semantic (Web and) Web 2.0. On the semantic side, the system presents new solutions to interoperability problems of dealing with multiple ontologies of different domains, and to problems of integrating multiple metadata schemas and cross-domain content into a homogeneous semantic portal. A novelty of the system is to use semantic models based on events and narrative process descriptions for modeling and visualizing cultural phenomena, and for semantic recommendations. On the Web 2.0 side, CULTURESAMPO proposes and demonstrates a content creation process for collaborative, distributed ontology and content development including different memory organizations and citizens. The system provides the cultural heritage contents to end-users in a new way through multiple (nine) thematic perspectives, based on semantic visualizations. Furthermore, CULTURESAMPO services are available for external web-applications to use through semantic AJAX widgets.
Jouni Tuominen, Tomi Kauppinen, Kim Viljanen and Eero Hyvönen:
Ontology-Based Query Expansion Widget for Information Retrieval.
Proceedings of the 5th Workshop on Scripting and Development for the Semantic Web (SFSW 2009), 6th European Semantic Web Conference (ESWC 2009), CEUR Workshop Proceedings, Vol. 449, Heraklion, Greece, May 31 - June 4, 2009.
bib pdf link In this paper we present an ontology-based query expansion widget which utilizes the ontologies published in the ONKI Ontology Service. The widget can be integrated into a web page, e.g. a search system of a museum catalogue, enhancing the page by providing a query expansion functionality. We have tested the system with general, domain-specific and spatio-temporal ontologies.
Jouni Tuominen, Matias Frosterus, Kim Viljanen and Eero Hyvönen:
ONKI SKOS Server for Publishing and Utilizing SKOS Vocabularies and Ontologies as Services.
Proceedings of the 6th European Semantic Web Conference (ESWC 2009), pp. 768-780, Springer-Verlag, Heraklion, Greece, May 31 - June 4, 2009.
bib pdf Vocabularies are the building blocks of the Semantic Web providing shared terminological resources for content indexing, information retrieval, data exchange, and content integration. Most semantic web applications in practical use are based on lightweight ontologies and, more recently, on the Simple Knowledge Organization System (SKOS) data model being standardized by W3C. Easy and cost-efficient publication, integration, and utilization methods of vocabulary services are therefore highly important for the proliferation of the Semantic Web. This paper presents the ONKI SKOS Server for these tasks. Using ONKI SKOS, a SKOS vocabulary or a lightweight ontology can be published on the web as ready-to-use services in a matter of minutes. The services include not only a browser for human usage, but also Web Service and AJAX interfaces for concept finding, selecting and transporting resources from the ONKI SKOS Server to connected systems. Code generation services for AJAX and Web Service APIs are provided automatically, too. ONKI SKOS services are also used for semantic query expansion in information retrieval tasks. The idea of publishing ontologies as services is analogous to Google Maps. In our case, however, vocabulary services are provided and mashed-up in applications. ONKI SKOS was published in the beginning of 2008 and is to our knowledge the first generic SKOS server of its kind. The system has been used to publish and utilize some 60 vocabularies and ontologies in the National Finnish Ontology Service ONKI www.yso.fi.
Tomi Kauppinen, Kimmo Puputti, Panu Paakkarinen, Heini Kuittinen, Jari Väätäinen and Eero Hyvönen:
Learning and Visualizing Cultural Heritage Connections between Places on the Semantic Web.
Proceedings of the Workshop on Inductive Reasoning and Machine Learning on the Semantic Web (IRMLeS2009), The 6th Annual European Semantic Web Conference (ESWC2009), May 31 - June 4, 2009.
bib pdf Semantic web techniques can be used to relate two things together. However, usually this relation is not accompanied with a measure that would tell how interesting the relation is. Data mining tradition provides interestingness measures; it is natural to try and fit semantic web and data mining traditions together. In this paper we use support and confidence values provided by association rule mining as interestingness measures for relations. The presented method is tailored to location ontologies in order to find out what interesting mutual relations two places have based on annotations in the cultural heritage domain. The method also uses ontology-based reasoning to group places together. We present tests of running the method against a set of over 60,000 annotations in order to find out cultural heritage connections between places.
Kim Viljanen, Jouni Tuominen and Eero Hyvönen:
Ontology Libraries for Production Use: The Finnish Ontology Library Service ONKI.
Proceedings of the 6th European Semantic Web Conference (ESWC 2009), pp. 781-795, Springer-Verlag, Heraklion, Greece, May 31 - June 4, 2009.
bib pdf This paper discusses problems of creating and using ontology library services in production use. One approach to a solution is presented with an online implementation--the Finnish Ontology Library Service ONKI--that is in pilot use on a national level in Finland. ONKI contributes to previous research on ontology libraries in many ways: First, mashup and web service support with various tools is provided for cost-efficient utilization of ontologies in indexing and search applications. Second, services covering the different phases of the ontology life cycle are provided. Third, the services are provided and used in real world applications on a national scale. Fourth, the ontology framework is being developed by a collaborative effort by organizations representing different application domains, such as health, culture, and business.
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä, Joeli Takala, Kimmo Puputti, Heini Kuittinen, Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg:
CultureSampo - Finnish Culture on the Semantic Web 2.0. Thematic Perspectives for the End-user.
Proceedings, Museums and the Web 2009, Indianapolis, USA, April 15-18, 2009.
bib pdf We present an overview of CultureSampo, an ambitious system for creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic web challenge of aggregating highly heterogeneous, cross-domain cultural heritage collections and other contents into a semantically rich intelligent system for human and machine users. At the same time, CultureSampo is an approach to solve the social and practical Web 2.0 challenge of organizing the underlying collaborative ontology development and content creation work of memory organizations and citizens. This paper focuses on CultureSampo’s search, recommendation, and visualization services for the end-users. The key idea here is to access cultural heritage on the Semantic Web through nine “thematic perspectives”, such as places on the maps, the social network of cultural persons, timelines, and narrative texts, e.g. biographies and literary works.
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä, Joeli Takala, Kimmo Puputti, Heini Kuittinen, Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg:
CultureSampo - Finnish Cultural Heritage Collections on the Semantic Web 2.0.
Proceedings of the 1st International Symposium on Digital Humanities for Japanese Arts and Cultures (DH-JAC-2009), Ritsumeikan University, Kyoto, Japan, March, 2009.
bib pdf This paper presents an overview of the SemanticWeb 2.0 application CultureSampo, an ambitious system for creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic web challenge of aggregating highly heterogeneous, cross-domain cultural heritage content into a semantically rich intelligent system for human and machine users. At the same time, CultureSampo is an approach to solve the social and practical Web 2.0 challenge of organizing the underlying collaborative ontology development and content creation work of memory organizations and citizens.
Tomi Kauppinen, Heini Kuittinen, Jouni Tuominen, Katri Seppälä and Eero Hyvönen:
Extending an Ontology by Analyzing Annotation Co-occurrences in a Semantic Cultural Heritage Portal.
Proceedings of the ASWC 2008 Workshop on Collective Intelligence (ASWC-CI 2008) organized as a part of the 3rd Asian Semantic Web Conference (ASWC 2008), Bangkok, Thailand, February 2-5, 2009.
bib pdf Ontologies aim to capture knowledge about things and their relationships. Publishing ontologies on the Semantic Web enables people and organizations to use shared ontologies in annotating e.g. photographs, videos, music, and other types of cultural objects. Search engines also use relationships provided by ontologies in semantic search, e.g. for query expansion or for view-based search. However, building ontologies is a time-consuming process, and it should be helped by automatic finding of interesting, possible relationships. Finding the correct concept for annotation purposes is helped by subsumption and partonomy hierarchies and associative relationships. In this paper we show how an analysis of co-occurrences of concepts in annotations can be used to provide interesting relationships for enriching ontological structures. We use association rule mining techniques and test the idea using a set of annotations of cultural objects in CULTURESAMPO portal and the Finnish General Upper Ontology YSO. The results are visualized in the ONKI SKOS browser to give an additional layer on top of the original relationships of the YSO ontology. An analysis shows that best ranked relationships should also be included in the ontology as subclassof or associative relationships.
2008
Tuukka Ruotsalo, Katri Seppälä, Kim Viljanen, Eetu Mäkelä, Jussi Kurki, Olli Alm, Tomi Kauppinen, Jouni Tuominen, Matias Frosterus, Reetta Sinkkilä and Eero Hyvönen:
Ontology-based Approach for Interoperability of Digital Collections. Signum, no. 5, 2008.
bib pdf This paper presents solutions and lessons learned in FinnONTO project carried out in Finland in 2003–2007. The paper focuses on three aspects of interoperability of digital collections. First, transforming thesauri to ontologies. Second, publishing ontologies for the use of indexers and content providers. Third, ontology based methods for improving end user access to digital collections. The first aspect is analysed through case studies done with Finnish thesauri. The second is discussed by presenting the ONKI ontology server. The last aspect is demonstrated in the scope of the semantic portal CultureSampo for publishing cultural heritrage on the Semantic Web.
Eero Hyvönen, Suvi Kettula:
Kulttuurisampo (CultureSampo. Museo-lehti, no. 4, Museoliitto, Helsinki, Finland, Nov, 2008.
bib pdf Kulttuurisampo kokoaa suomalaisen kulttuurin palapeliä aivan uudella tavalla, kirjoittavat Eero Hyvönen ja Suvi Kettula
Eero Hyvönen:
Semanttinen web ja paikkatietoihin perustuvat palvelut (Semantic web and services based on geographical data.
Historiaa kunnioittaen, tulevaisuuteen suunnaten. Maanmittaustieteen päivät 2008, Maanmittaustieteiden Seura, julkaisu n:o 45, ss. 8-16, Espoo, Finland, Nov, 2008.
bib pdf Artikkelissa luodaan katsaus kansallisessa Suomalaiset semanttisen webin ontologiat hankkeessa FinnONTO 2003-2007 ja FinnONTO 2.0 2008-2010 kehitettyihin paikkaontologioihin SUO (Suomalainen paikkaontologia) ja SAPO (Suomen ajallinen paikkaontologia), näiden julkaisemiseen AJAX-palveluina Kansallisessa ontologiapalvelussa ONKI, sekä paikkaontologioiden avulla Kulttuurisampo-portaaliin kehitettyihin palveluhin.
Eero Hyvönen, Kim Viljanen, Jouni Tuominen, Katri Seppälä, Tomi Kauppinen, Matias Frosterus, Reetta Sinkkilä, Jussi Kurki, Olli Alm, Eetu Mäkelä and Joonas Laitio:
National Ontology Infrastructure Service ONKI. Oct 1, 2008.
bib pdf This paper presents the national level cross-domain ontology and ontology service infrastructure ONKI used in Finland. The novelty of ONKI is based on two ideas. First, the core ontologies are developed collaboratively by experts transforming thesauri into mutually aligned lightweight ontologies, based on a large top ontology that is extended by various domain specific ontologies. Second, the National Ontology Service ONKI has been implemented for publishing ontologies cost-efficiently as ready to use services. ONKI provides legacy and other applications with ready to use functionalities for using ontologies on the HTML level by Ajax and semantic widgets. ONKI has been used in various applications for creating mash-up applications in a way analogous to using Google Maps, but in our case external applications are mashed-up with ontology support for indexing and information retrieval.
Eero Hyvönen, Eetu Mäkelä, Tomi Kauppinen, Olli Alm, Jussi Kurki, Tuukka Ruotsalo, Katri Seppälä Kim Viljanen, Jouni Tuominen, Tuomas Palonen, Matias Frosterus, Reetta Sinkkilä, Panu Paakkarinen, Joonas Laitio, Katariina Nyberg:
CultureSampo - A Collective Memory of Finnish Cultural Heritage on the Semantic Web 2.0.
Semantic Computing Research Group, Helsinki University of Technology and University of Helsinki, Sept 29, 2008.
bib pdf This paper presents the Semantic Web 2.0 application CULTURESAMPO, an ambitious system of creating a collective semantic memory of the cultural heritage of a nation on the Semantic Web 2.0, combining ideas underlying the Semantic Web and the Web 2.0. The system addresses the semantic challenge of aggregating highly heterogeneous, cross-domain cultural heritage into a semantically rich intelligent system for human and machine users. At the same time, CULTURESAMPO is an approach to solve the social and practical Web 2.0 challenge of organizing the underlying collaborative ontology development and content creation work of memory organizations and citizens.
Jouni Tuominen, Matias Frosterus, Kim Viljanen and Eero Hyvönen:
ONKI-SKOS - Publishing and Utilizing Thesauri in the Semantic Web.
AI and Machine Consciousness - Proceedings of the 13th Finnish Artificial Intelligence Conference STeP 2008, Espoo, Finland, August 20-22, 2008.
bib pdf Thesauri and other controlled vocabularies act as building blocks of the Semantic Web by providing shared terminology for facilitating information retrieval, data exchange and integration. Representation and publishing methods are needed for utilizing thesauri efficiently, e.g., in content indexing and searching. W3C has provided the Simple Knowledge Organization System (SKOS) data model for expressing concept schemes, such as thesauri. A standard representation format for thesauri eliminates the need for implementing thesaurus specific rules or applications for processing them. However, there do not exist general tools which provide out of the box support for publishing and utilizing SKOS vocabularies in applications, without needing to implement application specific user interfaces for end users. For solving this problem the ONKI-SKOS server is presented.
Jussi Kurki:
Finding People and Organizations on the Semantic Web.
AI and Machine Consciousness - Proceedings of the 13th Finnish Artificial Intelligence Conference STeP 2008, Espoo, Finland, August 20-22, 2008.
bib pdf Finding people is essential in finding information. Librarians and information scientists have studied authority control - psychologists and sociologists social networks. In aforementioned, authors link to documents (and co-authors) creating access points to information. In latter, social paths serve as channels for rumours as well as expertise. Key problems include identification and disambiguation of individuals followed by difficulties of tracking the social connections. With semantic web, these aspects can be approached simultaneously. In this paper, we define a simple ontology for describing people and organizations. The model is based on FOAF and other existing vocabularies. We also demonstrate search and visualization tools for finding people.
Kim Viljanen, Jouni Tuominen and Eero Hyvönen:
Publishing and Using Ontologies as Mash-Up Services.
Proceedings of the 4th Workshop on Scripting for the Semantic Web (SFSW2008), 5th European Semantic Web Conference 2008 (ESWC 2008), CEUR Workshop Proceedings, Vol. 368, Tenerife, Spain, June 1-5, 2008.
bib pdf link The Semantic Web is based on using ontologies for enabling semantically disambiguated data exchange between distributed systems on the web. This requires efficient means for publishing ontologies on the web to ensure the availability, sharing and acceptance of the ontologies. Support services are needed for utilizing ontologies easily and cost-effectively in applications and legacy systems lacking ontology support. To address these vital needs, this paper presents the ONKI ontology service which provides ready-to-use mash-up functionalities, such as semantic disambiguation, concept finding and concept fetching as ready-to-use web widgets for adding ontology support to e.g. HTML forms using JavaScript. Two implementations of the ONKI Server are presented: ONKI-SKOS for ontologies presented in the Simple Knowledge Organization System (SKOS) language and ONKI-Geo for geographical ontologies with a map interface. The presented ONKI systems are operational on the web, used in the National Finnish Ontology Service. They have been successfully used in several pilot applications.
Tomi Kauppinen, Jari Väätäinen and Eero Hyvönen:
Creating and Using Geospatial Ontology Time Series in a Semantic Cultural Heritage Portal.
S. Bechhofer et al.(Eds.): Proceedings of the 5th European Semantic Web Conference 2008 ESWC 2008, LNCS 5021, Tenerife, Spain, pp. 110-123, Springer-Verlag, June 1-5, 2008.
bib pdf Content annotations in semantic cultural heritage portals commonly make spatiotemporal references to historical regions and places using names whose meanings are different in different times. For example, historical administrational regions such as countries, municipalities, and cities have been renamed, merged together, split into parts, and annexed or moved to and from other regions. Even if the names of the regions remain the same (e.g., “Germany”), the underlying regions and their relationships to other regions may change (e.g., the regional coverage of “Germany” at different times). As a result, representing and finding the right ontological meanings for historical geographical names on the semantic web creates severe problems both when annotating contents and during information retrieval. This paper presents a model for representing the meaning of changing geospatial resources. Our aim is to enable precise annotation with temporal geospatial resources and to enable semantic search and browsing using related names from other historical time periods. A simple model and metadata schema is presented for representing and maintaining geospatial changes from which an explicit time series of temporal part-of ontologies can be created automatically. The model has been applied successfully to representing the complete change history of municipalities in Finland during 1865–2007, and the resulting ontology time series is used in the semantic cultural heritage portal CULTURESAMPO to support faceted semantic search of contents and to visualizing historical regions on overlaying maps originating from different historical eras.
Reetta Sinkkilä, Eetu Mäkelä, Tomi Kauppinen and Eero Hyvönen:
Combining Context Navigation with Semantic Autocompletion to Solve Problems in Concept Selection.
First International Workshop on Semantic Metadata Management and Applications, SeMMA 2008, Located at the Fifth European Semantic Web Conference (ESWC 2008), Tenerife, Spain, June 2nd, 2008. Proceedings (Khalid Belhajjame, Mathieu d Aquin, Peter Haase and Paolo Missier (eds.)), CEUR Workshop Proceedings, vol. 346, pp. 61-68, CEUR-WS.org, Tenerife, Spain, June 1-5, 2008.
bib pdf Many tasks on the semantic web require the user to choose concepts from a limited vocabulary e.g. for describing an indexed resource or for use in semantic search. Semantic autocompletion interfaces offer an efficient way for concept selection. However, these interfaces usually do not expose the semantic context of the matched concepts, thereby making it hard to know if a matched concept is the right one, as well as hiding possibly more appropriate choices. Ontology browsers, on the other hand, show context but do not allow quick discovery or embedding into other applications. To lessen these problems, we present an interface combining semantic autocompletion with in-place ontological context navigation. Because required context differs between ontologies, the implementation was designed to make it easy to add different contexts and visualizations. To test the applicability of our idea and implementation the, system was tested on three ontologies with different requirements and structure.
Eero Hyvönen, Eetu Mäkelä, Tuukka Ruotsalo, Tomi Kauppinen, Olli Alm, Jussi Kurki, Joeli Takala, Kimmo Puputti and Heini Kuittinen:
CultureSampo-Finnish Culture on the Semantic Web.
Posters of the 5th European Semantic Web Conference 2008 (ESWC 2008), Tenerife, Spain, June 1-5, 2008.
bib pdf This paper presents the semantic portal CULTURESAMPO---Finnish Culture on the Semantic Web . The portal provides memory organizations and other cultural content publishers with a national, shared semantic publication channel for heteroge- nous cultural contents. The content comes from over ten organizations and is annotated using various ontologies of the FinnONTO infrastructure. For the end-user, intel- ligent semantic search, recommendation, and visualization services for accessing and learning about cultural heritage are provided.
Tomi Kauppinen, Riikka Henriksson, Reetta Sinkkilä, Robin Lindroos, Jari Väätäinen and Eero Hyvönen:
Ontology-based Disambiguation of Spatiotemporal Locations.
Proceedings of the 1st international workshop on Identity and Reference on the Semantic Web (IRSW2008), 5th European Semantic Web Conference 2008 (ESWC 2008), CEUR Workshop Proceedings, ISSN 1613-0073, June 1-5, 2008.
bib pdf Geographic place names are semantically often highly ambiguous. For example, there are 491 places in Finland sharing the same name ”Isosaari” (great island) that are instances of several geographical classes, such as Island, Forest, Peninsula, Inhabited area, etc. Referencing unambiguously to a particular ”Isosaari”, either when annotating content or during information retrieval, can be quite problematic and requires usage of advanced search methods and maps for semantic disambiguation. Historical places introduce even more challenges, since historical metadata commonly make spatiotemporal references to historical regions and places using names whose meanings are non-existing or different in different times. This paper presents how these problems have been addressed in a large Finnish place ontology SUO and a historical geo-ontology SAPO. A location ontology server ONKI-Geo has been created for publishing the ontologies and utilizing them as mashup services. To demonstrate the usability of our ontologies, two case applications in the cultural heritage domain are presented.
Eero Hyvönen, Kim Viljanen, Jouni Tuominen and Katri Seppälä:
Building a National Semantic Web Ontology and Ontology Service Infrastructure - The FinnONTO Approach.
Proceedings of the European Semantic Web Conference ESWC 2008, pp. 95-109, Springer, Tenerife, Spain, June, 2008.
bib pdf This article presents the vision and results of creating a national level cross-domain ontology service infrastructure in Finland in the FinnONTO project. The novelty of the infrastructure is based on two ideas. First, a system of open source core ontologies is being developed by transforming thesauri into mutually aligned lightweight ontologies, including a top ontology of 20,000 concepts that is extended by various domain specific ontologies. Second, the ONKI Ontology Server framework for publishing ontologies as ready to use services has been designed and implemented. ONKI provides legacy and other applications with ready to use functionalities for using ontologies on the user interface level as semantic widgets. The idea is to use ONKI for creating mash-up applications in a way analogous to using Google or Yahoo Maps, but in our case external applications are mashed-up with ontology support. The ontology framework presented is operational on the web and is being used in creating the application demonstrations.
Robin Lindroos:
Paikkatiedon ontologiapalvelu. MSc Thesis, Helsinki University of Technology (TKK), May, 2008.
bib pdf Tämä diplomityö käsittelee menetelmiä, joilla paikkatietoaineistoja muunnetaan ontologiseen muotoon sekä esittelee palvelun, ONKI-Paikan, jolla ontologisessa muodossa olevaa paikkatietoa voidaan tuottaa, ylläpitää ja hakea. Palvelu perustuu paikkatiedon mallintamiseen Suomalaisen paikkaontologian SUO:n mukaisesti. Työ koostuu neljästä vaiheesta. Ensimmäisessä vaiheessa selvitetään menetelmä, jolla SUO-ontologia populoidaan paikkainstansseilla. Erityistä huomiota kiinnitetään paikkojen uniikkien tunnisteiden, URI:en luomiseen. Toisessa vaiheessa selvitetään, miten ontologian populointivaiheessa tuotetut paikkojen RDF-kuvaukset on tallennettava. Kolmannessa vaiheessa ratkotaan ontologisessa muodossa olevan paikkatietoaineiston suuren määrän tuomia ongelmia muun muassa kehittämällä paikkatiedon RDF-varastolle indeksointitietokanta nopeita hakuja varten. Neljännessä vaiheessa kehitetään rajapinta hakujen suorittamista varten sekä hakurajapintaa hyödyntävä graafinen, selaimessa toimiva käyttöliittymä. Työ on tehty osana FinnONTO-projektia, jossa kehitettiin suomalaisiin olosuhteisiin räätälöityjä semanttisen webin ontologioita sekä näitä hyödyntäviä palveluita.
Reetta Sinkkilä:
Käsitteen kontekstiperustainen valinta semanttisessa webissä. MSc Thesis, University of Helsinki, Department of Computer Science, May, 2008.
bib pdf Semanttisen webin ideana on kuvailla tietoa siten, että koneet pystyvät ymmärtämään sitä, ja käyttämään älykkäitä tekniikoita tiedon hyödyntämiseksi. Tiedon kuvailemise käytetään ontologioita jotka muodostavat laajoja käsiteverkkoja. Kuvailua varten ontologiasta on läydettävä tarkimmin kohdetta kuvaavat käsitteet. Tässä työssä tutkittiin minkälaisia menetelmiä käsitteenvalintaan on kehitetty ja millä tavoin käsitteenvalintaa voidaan tukea visuaalisin keinoin. Lisäksi perehdyttiin joihinkin semanttisen webin sovelluksiin niiden käsitteenvalinnan osalta. Käsitevalitsimissa havaittiin puutteita liittyen käsitteiden merkityksen disambiguointiin ja siihen, kuinka helppoa niiden avulla on valita paras käsite kuvaamaan tietoa. Ongelmia oli myös tavoissa esittää useiden ontologioiden yhdistelmiä. Näiden havaintojen pohjalta suunniteltiin ja toteutettiin yleinen ja monentyyppisen aineiston käsittelyyn soveltuva käsitevalitsin IRMA
Riikka Henriksson, Tomi Kauppinen and Eero Hyvönen:
Core Geographical Concepts: Case Finnish Geo-Ontology. Location and the Web (LocWeb) 2008 workshop, 17th International World Wide Web Conference WWW 2008, ACM International Conference Proceeding Series; Vol. 300, Pages 57-60, Beijing, China, April 21-25, 2008.
bib pdf In this paper we examine 1) the scope of geo-ontologies used especially for the purposes of information retrieval on the Web, 2) the core geographical concepts and their mutual relations, and 3) the properties the concepts have. Furthermore, we present the Finnish geo-ontology (Suomalainen paikkaontologia, SUO) and discuss the theories and principles that have governed the development process, as well as the limitations and requirements the use of geographical dictionaries as an instance data source have imposed to the content and the structure of SUO.
Kim Viljanen, Jouni Tuominen, Teppo Känsälä and Eero Hyvönen:
Distributed Semantic Content Creation and Publication for Cultural Heritage Legacy Systems.
Proceedings of the 2008 IEEE International Conference on Distributed Human-Machine Systems, IEEE Press, Athens, Greece, March 9-12, 2008.
bib pdf Cultural heritage is by nature strongly interlinked, e.g. thematically and historically, but at the same time distributed in heterogeneous collections of different memory organizations at different locations. In order to provide the end-users with aggregated homogeneous views to distributed heterogeneous contents, semantic portals have been created successfully based on metadata and shared (or aligned) ontologies. This paper discusses two problems encountered in such a distributed semantic content creation environment. First, during the content creation work, how could a publisher start using shared ontologies in legacy cataloguing and annotation systems that do not support ontologies. Second, during content publication, how could a publisher re-use the aggregated content in its own legacy publication system, e.g., on the ordinary web pages of a museum or in a collection browser. As a solution, we present the ONKI Ontology Server for adding shared ontological annotation functionalities to legacy cataloguing systems in a practical, cost-efficient and lightweight way. For distributed publishing of the aggregated semantic portal services, we introduce the lightweight mash-up web widget components called floatlets . A major idea behind both the ONKI functionalities and floatlets is that they can be easily integrated with legacy systems on the user interface level, in the same spirit as e.g. Google Maps.
Matias Frosterus:
Tekstiaineiston ontologiaperustainen indeksointi ja haku. MSc Thesis, Helsinki University of Technology, Department of Automation and Systems Technology, March, 2008.
bib pdf Informaation lisääntyessä yhteiskunnassa vaaditaan sen tehokasta käsittelyä yhä enemmän ammattilaisten lisäksi myös tavallisilta käyttäjiltä. Tällöin luonnollinen pyrkimys on yksinkertaistaa ja automatisoida tiedonhakuprosessia mahdollisimman paljon, johon semanttisen webin tekniikat tarjoavat uusia mahdollisuuksia. Tässä diplomityössä tutkittiin mahdollisuuksia dokumentin laajentamisen ja ontologisten käsitteiden hyödyntämisen kautta parantaa tiedonhakuprosessia tekstipohjaiseen aineistoon, kuten sanomalehtiarkistoon. Tätä tarkoitusta varten luotiin automaattinen annotointi ja hakusovellus Airo, joka suorittaa jonkin annetun ontologian pohjalta dokumentin laajennuksen. Tämä tapahtuu ontologisella käsiteklusteroinnilla, jossa jonkin käsitteen esiintyminen tekstissä nostaa myös ontologian hierarkiassa läheisten käsitteiden painoa kyseistä dokumenttia indeksoitaessa ja haettaessa. Järjestelmän testit osoittivat, että käsitehaku yhdistettynä sanahakuun laskee haun tarkkuutta, mutta nostaa saantia. Sen sijaan hybridimenetelmä dokumentin- ja kyselyn laajennuksesta, jossa perinteisen sanahaun tuottamien dokumenttien käsitteillä suoritetaan laajentava haku, nosti saantia tarkkuuden kärsimättä. Luotu järjestelmä on ontologiariippumaton ja jokaisen ontologian tuottamat käsitteistykset talletetaan omaan indeksiinsä, jolloin niitä voidaan hakea erikseen.
Osma Suominen:
Käyttäjäkeskeinen moninäkymähaku semanttisessa portaalissa. MSc Thesis, University of Helsinki, Department of Computer Science, February, 2008.
bib pdf Tiedonhakuun webissä on kehitetty sanahaun lisäksi rikkaampia tiedonhaku- ja selausmenetelmiä, jotka mahdollistavat tutkivan tiedonhaun. Niistä on hyötyä silloin, kun käyttäjä ei etukäteen tiedä täsmälleen, mitä hän on etsimässä. Yksi tällainen hakukäyttöliittymätyyppi on moninäkymähaku, jossa haun kohteena oleva tietosisältö luokitellaan moniulotteiseksi avaruudeksi fasettiluokituksen periaattein. Käyttöliittymä mahdollistaa aineiston haun ja selaamisen minkä tahansa ulottuvuuden tai niiden yhdistelmän suhteen. Moninäkymähaun kehitys lähti liikkeelle käyttöliittymätutkimuksen piiristä. Moninäkymähakuun perustuvia käyttöliittymiä käytettiin myöhemmin semanttisen webin sovelluksissa ja portaaleissa, joissa kuitenkaan ei samassa määrin huomioitu käyttäjiä suunnitteluprosessin aikana. Tutkielmassa käyttäjäkeskeisiä suunnittelu- ja tiedonjäsennysmenetelmiä sovellettiin terveysaiheista materiaalia sisältävän tervesuomi.fi-portaalin suunnitteluun sekä toteutettiin moninäkymähakua käyttävä portaalin prototyyppi semanttisen webin teknologioiden avulla. Portaalin informaatioarkkitehtuuri suunniteltiin korttienjärjestämismenetelmän avulla ja sen käyttöliittymä rakennettiin käyttäjäkeskeisellä suunnittelu- ja arviointiprosessilla. Tulosten arviointi osoittaa, että käyttäjäkeskeisistä menetelmistä oli merkittävää hyötyä portaalin suunnittelussa. Valmiin prototyypin käytettävyyden arviointi osoitti, että portaalin käytettävyys on käyttäjäkeskeisten suunnittelu- ja arviointimenetelmien ansiosta saatu hyvälle tasolle. Arviointi myös paljasti portaalin aineistoissa, käyttöliittymässä ja tiedon jäsennystavassa ongelmia, joihin terveysportaalin jatkokehityksessä voidaan puuttua.
2007
Thomas Häggström:
Toimintakeskeisen semanttisen moninäkymähaun toteutus ja evaluointi kulttuurialan portaalisovelluksessa. MSc Thesis, Helsinki University of Technology (TKK), December, 2007.
bib pdf Diplomityön tavoitteena on tutkia semanttisen webin tarjoamia mahdollisuuksia tiedonhaussa. Tähän tavoitteeseen pyrin toteuttamalla tiedonhakujärjestelmän, jossa lukuisista eri museoista peräisin olevat heterogeeniset aineistot ovat haettavissa yhdellä käyttöliittymällä. Työni kirjallisuusosassa käsittelen tiedonhaun teoriaa, metadatan käyttöä ja semanttisen webin toimintaperiaatteita. Kirjallisuusosan yhteydessä paneudun tutkimusryhmässä aikaisemmin tehtyyn tutkimukseen erityisesti sisällönkuvailun ja moninäkymähaun osalta. Työssäni kehitin heterogeenisen aineiston yhdistämisen mahdollistavaa toimintakeskeistä sisällönkuvailun menetelmää ja tietomallia. Mallin varaan toteutin moninäkymähaun periaatteella toimivan tiedonhakujärjestelmän, jolla toimintakeskeisesti kuvailtua aineistoa voidaan hakea. Saadakseni tarkkaa tietoa tiedonhakujärjestelmän toimivuudesta ja soveltuvuudesta suunniteltuun käyttötarkoitukseen, evaluoin valmista tiedonhakujärjestelmää käyttäjäkeskeisen evaluointimallin mukaisesti. Evaluointia varten suunnittelin kokeen, johon kuuluivat hakutehtävät ja tiedon keruun menetelmät, kuten kyselylomakkeet, transaktioloki sekä videokuvan ja äänen kaappaus. Toimintakeskeinen tietomalli osoittautui tiedonhakujärjestelmän sovelluskehityksen aikana toimivaksi ja heterogeenisiä aineistoja yhdistäväksi tietomalliksi. Työssä kehittämäni toimintakeskeinen semanttinen moninäkymähaku sekä aineiston kartta- ja aikajanaprojisointi toimivat täysin toimintakuvausten ja temaattisten roolien varassa. Käyttäjätestit puolestaan todistivat tiedonhakujärjestelmän toimivaksi. Hakujärjestelmä tuki jokaista tehtävätyyppiä ja käyttäjät pitivät arvioissaan järjestelmää uudentyyppisenä, tehokkaana, toimivana ja hyödyllisenä. Vaikka käyttöliittymä tarjosi perinteisen vapaatekstihaun, käytettiin toimintakeskeistä semanttista moninäkymähakua jokaisen tehtävätyypin suorituksen yhteydessä. Käyttäjät olivat melko tyytyväisiä hakujärjestelmän palauttamien dokumenttien relevanssiin. Kartta- ja aikajanaprojektioita pidettiin innovatiivisina ja toimivina lisäominaisuuksina. Käyttäjätestien aikana kritisoitiin moninäkymähaun käytön vaikeutta ja näkymien sisältöä pidettiin vaikeasti ymmärrettävänä. On selvää, että moninäkymähakua on edelleen kehitettävä. Evaluoinnin tulosten perusteella näyttää siltä, että moninäkymähaku ja vapaatekstihaku soveltuvat erityyppisten tehtävien suorittamiseen ja ovat näin ollen toisiaan täydentäviä hakutapoja. Moninäkymähakua hyödynnettiin tehtävissä, joissa lähtötiedot eivät olleet tarkkoja. Tehtävissä, joissa tehtävänannossa annettiin tarkkaa metatietoa, oli tekstihaku hyödyllisempi.
Eetu Mäkelä, Osma Suominen and Eero Hyvönen:
Automatic Exhibition Generation Based on Semantic Cultural Content.
Proceedings of the Cultural Heritage on the Semantic Web Workshop at the 6th International Semantic Web Conference (ISWC 2007), Busan, Korea, November 12, 2007.
bib pdf In this paper, we argue for a need to shift focus in semantic search from the items themselves to using them as lenses to wider topics. A system for doing this in the cultural heritage domain is presented, duplicating on the web the way exhibitions in the real world are organized. An interface for specifying such exhibitions is presented, combining a general narrative pattern with semantic autocompletion and the novel concept of domain-centric view-based search. This also solves a number of problems view-based search has previously encountered in the cultural heritage domain. Presented also are multiple visualizations for the exhibition, supporting the user in making sense of the data and in doing exploratory search.
Eero Hyvönen, Olli Alm and Heini Kuittinen:
Using an Ontology of Historical Events in Semantic Portals for Cultural Heritage.
Proceedings of the Cultural Heritage on the Semantic Web Workshop at the 6th International Semantic Web Conference (ISWC 2007), Busan, Korea, November 12, 2007.
bib pdf We argue that an ontology of historical events is needed in semantic portals for cultural heritage due to three reasons. First, ontological identifiers (URIs) of events, such as the World War II or coronation of Napoleon, are needed in order to make collection metadata mutually interoperable in terms of related events---in the vein as identifiers are needed for identifying artifact types, persons, and geolocations when annotating collection items. Second, events are of central importance in creating semantic links between cultural contents in applications such as recommendation systems. Third, historical events are important as content items of their own, forming the backbone of chronological histories.
Tuukka Ruotsalo and Eero Hyvönen:
An Event-based Approach for Semantic Metadata Interoperability.
Proceedings of the 6th International Semantic Web Conference (ISWC 2007), Busan, Korea, Springer-Verlag, November 11-15, 2007.
bib pdf This paper presents a method for making metadata conforming to heterogeneous schemas semantically interoperable. The idea is to make the knowledge embedded in the schema structures interoperable and explicit by transforming the schemas into a shared, event-based representation of knowledge about the real world. This enables and simplifies accurate reasoning services such as cross-domain semantic search, browsing, and recommending. A case study of transforming three different schemas and datasets is presented. An implemented knowledge-based recommender system utilizing the results in the semantic portal \CS\ was found useful in a preliminary user study.
Eetu Mäkelä, Tuukka Ruotsalo and Eero Hyvönen:
Automatic Exhibition Generation Based on Semantic Cultural Content.
Poster proceedings of the 6th International Semantic Web Conference, Busan, Korea, November 11-15, 2007.
bib pdf This paper shortly presents an automatic exhibition generation interface that turns the focus of semantic search from search items to the concepts they are annotated with.
Eetu Mäkelä, Kim Viljanen, Olli Alm, Jouni Tuominen, Onni Valkeapää, Tomi Kauppinen, Jussi Kurki, Reetta Sinkkilä, Teppo Känsälä, Robin Lindroos, Osma Suominen, Tuukka Ruotsalo and Eero Hyvönen:
Enabling the Semantic Web with Ready-to-Use Web Widgets.
Proceedings of the First Industrial Results of Semantic Technologies Workshop, ISWC2007, pp. 56-69, CEUR Workshop Proceedings, Vol. 293, November 11, 2007.
bib pdf link A lot of functionality is needed when an application, such as a museum cataloguing system, is extended with semantic capabilities, for example ontological indexing functionality or multi-facet search. To avoid duplicate work and to enable easy and cost-efficient integration of information systems with the Semantic Web, we propose a web widget approach. Here, data sources are combined with functionality into readyto-use software components that allow adding semantic functionality to systems with just a few lines of code. As a proof of the concept, we present a collection of general semantic web widgets and case applications that use them, such as the ontology server ONKI, the annotation editor SAHA and the culture portal CultureSampo.
Eetu Mäkelä, Reetta Sinkkilä and Eero Hyvönen:
Combining Cross-ontology Navigation with Semantic Autocompletion.
Poster proceedings of the 6th International Semantic Web Conference, Busan, Korea, November 11-15, 2007.
bib pdf Semantic autocompletion interfaces offer an efficient way for concept selection useful in both search and annotation applications. However, these interfaces usually do not expose the semantic context of the matched concepts, thereby making it hard to know if a matched concept is the right one, as well as hiding possibly more appropriate choices. To lessen these problems, we present an in-place ontological context navigation interface to be used with semantic autocompletion.
Kim Viljanen, Jouni Tuominen, Eero Hyvönen, Eetu Mäkelä and Osma Suominen:
Extending Content Management Systems with Ontological Annotation Capabilities.
Poster proceedings of the 6th International Semantic Web Conference, Busan, Korea, November 11-15, 2007.
bib pdf Producing semantic metadata requires efficient methods, e.g., concept finding, for accessing and using ontologies. To add such functionalities to metadata applications such as cataloging systems in museums, we propose a \emphmash-up approach where ready-to-use user interface components for using specific ontologies are made available to be integrated into applications. As a proof-of-concept, we present the \emphOntology Service ONKI wich implements semantic autocompletion concept search and concept browsing for ontologies as shared mash-up components.
Eetu Mäkelä, Tuukka Ruotsalo and Eero Hyvönen:
Domain-Centric View-Based Search.
Poster proceedings of the 6th International Semantic Web Conference, Busan, Korea, November 11-15, 2007.
bib pdf In current Semantic Web view-based search systems views are formed by selecting properties and enumerating all their values as selections. This approach breaks down with multiple content types, such as in the cultural heritage domain, because the number of differing properties, and therefore views becomes unmanageable. We propose a novel solution termed Domain-Centric View-Based Search, in which views are created based on common property ranges and domain ontologies.
Jussi Kurki and Eero Hyvönen:
Relational Semantic Search: Searching Social Paths on the Semantic Web.
Poster Proceedings of the International Semantic Web Conference (ISWC 2007), Busan, Korea, Nov, 2007.
bib pdf This paper presents a system for searching semantic relations between web resources, in our case significant persons of art history. The system is based on the Union List of Artists Names (ULAN) metadata of some 120,000 persons and organizations.
Eero Hyvönen, Robin Lindroos, Tomi Kauppinen and Riikka Henriksson:
An ontology service for geographical content.
Poster Proceedings of the International Semantic Web Conference (ISWC 2007), Busan, Korea, Nov, 2007.
bib pdf Geographic place names are widely used but are semantically often highly ambiguous. For example, there are 491 places in Finland sharing the same name Isosaari (great island) that are instances of several geographical classes, such as Island, Forest, Peninsula, Inhabited area, etc. Referencing unambiguously to a particular Isosaari , either when annotating content or during information retrieval, can be quite problematic and requires usage of advanced search methods and maps for semantic disambiguation. This paper presents an ontology server, ONKI-Paikka, for solving the place finding and place name disambiguation problem. In ONKI-Paikka, places can be found by a faceted search engine, combined with semantic autocompletion and a map service for constraining search and for visualizing results. The service can be connected to legacy applications cost-effectively by using Ajax-technology in the same spirit as Google Maps that is used in ONKI-Paikka as a subservice.
Eero Hyvönen, Tuukka Ruotsalo, Thomas Häggström, Mirva Salminen, Miikka Junnila, Mikko Virkkilä, Mikko Haaramo, Eetu Mäkelä, Tomi Kauppinen and and Kim Viljanen:
CultureSampo-Finnish Culture on the Semantic Web: The Vision and First Results (based on the STeP 2006 paper below).
In: K. Robering (ed.): Information Technology for the Virtual Museum. LIT Verlag, Berlin., Nov, 2007.
bib pdf This paper concerns the idea of publishing heterogenous cultural content on the Semantic Web. By heterogenous content we mean metadata describing potentially any kind of cultural objects, including artifacts, photos, paintings, videos, folklore, cultural sites, cultural process descriptions, biographies, history etc. The metadata schemas used are different and the metadata may be represented at different levels of semantic granularity. This work is an extension to previous research on semantic cultural portals, such as MuseumFinland, that are usually based on a shared homogeneous schema, such as Dublin Core, and focus on content of similar kinds, such as artifacts. Our experiences suggest that a semantically richer event-based knowledge representation scheme than traditional metadata schemas is needed in order to support reasoning when performing semantic search and browsing. The new key idea is to transform different forms of metadata into event-based knowledge about the entities and events that take place in the world or in fiction. This approach facilitates semantic interoperability and reasoning about the world and stories at the same time, which enables implementation of intelligent services for the end-user. These ideas are addressed by presenting the vision and solution approaches taken in two prototype implementations of a new kind of cross-domain semantic cultural portal “CULTURESAMPO—Finnish Culture on the Semantic Web”.
Kim Viljanen, Jouni Tuominen and Eero Hyvönen:
ONKI Ontology Server--Extending Legacy Systems with Ontology Mash-up Services. November, 2007. Draft paper.
bib pdf The Semantic Web is based on using shared ontologies for enabling semantically disambiguated data exchange between distributed systems on the web. This requires, from the ontology publisher s viewpoint, efficient means for publishing ontologies on the web to ensure the availability and acceptance of the ontologies. From the ontology user s viewpoint, support services are needed for utilizing ontologies easily and cost-effectively in the users own systems that are typically legacy systems without ontology support. This paper presents the ONKI ontology server for addressing these vital needs. For the publisher, ONKI provides a server and a Simple Knowledge Organization (SKOS) compatible light-weight ontology browser with ready-made web interfaces for making ontologies available both for human and machine users. For external legacy and other applications, ONKI provides centralized ontology services for semantic disambiguation, concept finding, and concept fetching. A major contribution of ONKI is to provide these services as ready-to-use functionalities for creating mash-up applications very cost-efficiently. Two prototypes of the system---ONKI-SKOS for all kinds of ontologies and ONKI-Geo for geographical ontologies with a map mash-up interface---are operational on the web and are currently being successfully used in several pilot applications.
Eero Hyvönen, Joeli Takala, Olli Alm, Tuukka Ruotsalo and Eetu Mäkelä:
Semantic Kalevala - Accessing Cultural Contents Through Semantically Annotated Stories.
Proceedings of the Cultural Heritage on the Semantic Web Workshop at the 6th International Semantic Web Conference (ISWC 2007), Busan, Korea, Nov, 2007.
bib pdf An event-based approach is presented for annotating events and narrative structures underlying texts and stories semantically. The idea is applied to using the Finnish national epic Kalevala for accessing related cultural contents, such as artifacts, paintings etc. in a semantic portal.
Eero Hyvönen, Kim Viljanen and Osma Suominen:
HealthFinland - Finnish Health Information on the Semantic Web.
Proceedings of the 6th International Semantic Web Conference (ISWC 2007), Busan , Korea, Springer-Verlag, Nov, 2007.
bib pdf This paper shows how semantic web techniques can be applied to solving problems of distributed content creation, discovery, linking, aggregation, and reuse in health information portals, both from end-users and content publishers viewpoints. As a case study, the national semantic health portal \HF\ is presented. It provides citizens with intelligent searching and browsing services to reliable and up-to-date health information created by various health organizations in Finland. The system is based on a shared semantic metadata schema, ontologies, and ontology services. The content includes metadata about thousands of web documents such as web pages, articles, reports, campaign information, news, services, and other information related to health.
Eero Hyvönen, Kim Viljanen, Osma Suominen and Eija Hukka:
HealthFinland - Publishing Health Promotion Information on the Semantic Web. International Journal of Health Care Engineering, vol. 15, no. 5, pp. 325, oct, 2007. Abstract of a longer paper.
bib
Olli Alm:
Tekstidokumenttien automaattinen ontologiaperustainen annotointi. MSc Thesis, University of Helsinki, Department of Computer Science, September, 2007.
bib pdf Semanttisen Webin perustavana ajatuksena on tuoda Internetiin – tai suppeammassa mielessä hyperlinkitettyyn aineistoon – järjestystä määrittelemällä eksplisiittisiä, koneluettavia käsitteistöjä ja kuvaamalla Internetin sisältämää aineistoa tällä käsitteistöllä. Nämä kaksi työvaihetta kuuluvat keskeisesti Semanttisen Webin ydinalueisiin. Tässä tutkielmassa määritellään Semanttisen Webin liittyvän aineiston kuvailun eli ontologiaperustaisen annotoinnin piirteitä ja toisaalta myös rajoja. Ontologiaperustainen annotointi on aineiston kuvailua, jonka määrittävänä piirteenä on tietomalli. Annotoinnin automatisointi on keskeinen haaste ontologiaperustaisten järjestelmien tuottamisessa, sillä manuaalisesti tehtävä annotointi on yleensä hidasta ja aikaa vievää. Automaattista annotointia edustavien järjestelmien joukko on kirjava, eikä täsmällistä määrittelyä automaattisen annotoinnin ongelmakentästä esiinny kirjallisuudessa. Työssä määritellään automaattisille annotointijärjestelmille malli, jonka avulla voidaan vertailla järjestelmiä toisiinsa ja mallintaa uusia. Mallia sovelletaan työssä ontologiaperustaisten järjestelmien vertailuun ja automaattisen annotointijärjestelmän Pokan, toteuttamisessa.
Eero Hyvönen, Kim Viljanen, Eetu Mäkelä, Tomi Kauppinen, Tuukka Ruotsalo, Onni Valkeapää, Katri Seppälä, Osma Suominen, Olli Alm, Robin Lindroos, Teppo Känsälä, Riikka Henriksson, Matias Frosterus, Jouni Tuominen, Reetta Sinkkilä and Jussi Kurki:
Elements of a National Semantic Web Infrastructure - Case Study Finland on the Semantic Web (Invited paper).
Proceedings of the First International Semantic Computing Conference (IEEE ICSC 2007), Irvine, California, September, 2007. IEEE Press.
bib pdf This article presents the vision and results of creating the basis for a national semantic web content infrastructure in Finland in 2003-2007. The main elements of the infrastructure are shared and open metadata schemas, core ontologies, and public ontology services. Several practical applications testing and demonstrating the usefulness of the infrastructure are overviewed in the fields of eCulture, eHealth, eGovernment, eLearning, and eCommerce.
Kim Viljanen, Eero Hyvönen, Eetu Mäkelä, Osma Suominen and Jouni Tuominen:
Mash-up Ontology Services for the Semantic Web. Demo track at the European Semantic Web Conference ESWC 2007, Innsbruck, Austria, June 4-5, 2007.
bib pdf We present ONKI ontology server, a mash-up approach for integrating ontology library services with semantic web applications. The idea of ONKI is to provide applications with ready-to-use ontology service functionalities, such as semantic autocompletion, browsing, and annotation support, at the user interface level using AJAX mash-up technologies. The system is being integrated with various semantic web applications.
Jari Väätäinen:
Ajallisesti muuttuvan paikkatiedon hallinta. Mediatekniikka, EVTEK, May, 2007.
bib pdf Paikannimiä käytetään monissa arkistoissa, museoissa ja tietokannoissa ainoana paikka-tietona. Paikannimistössä ja rajoissa tapahtuu kuitenkin ajan kuluessa muutoksia. Valtioiden rajat muuttuvat, läänien ja maakuntien alueita määritetään uudestaan ja kunnat jakaantuvat tai yhdistyvät toisiinsa tai vaihtavat nimeään. Tietojen hakeminen tämäntyyppisistä kohteista on tuottanut puutteellisia tai virheellisiä tuloksia, ellei hakijalla ole ollut tietoa tapahtuneista aluemuutoksista. Insinöörityön tavoitteena on löytää keinoja hallita ajallisesti muuttuvaa paikkatietoa niin, että tietoihin tehtävien hakujen tarkkuus paranee.Insinöörityöraportissa kuvataan erityyppiset muutokset, joita Suomen kunnissa on niiden olemassaolon aikana tapahtunut. Erilaiset muutokset luokitellaan seitsemään tyyppiin, jotka ovat perustaminen, yhdistyminen, jakaantuminen, nimenmuutos, aluesaanti toisesta maasta ja alueluovutus toiselle maalle ja aluesiirto kahden kunnan välillä. Samaa jakoa voidaan käyttää myös muissa hallinnollisissa alueissa tapahtuneisiin muutoksiin. Kuntien välisten muutosten tietoja käytettiin Geologian tutkimuskeskuksen (GTK) valokuvatietokannassa. Tietokanta toteutettiin Imatch-ohjelman avulla. Kuvat voidaan tietokannassa merkitä siihen kuntaan, jossa ne on kuvattu, mutta kuvia voidaan hakea myös ajallisesti myöhemmän kuntajaon perusteella. Eri aikoina olleita kuntia kuvaavien kuntakategorioiden väliset yhteydet toteutettiin yhdistävien ja poissulkevien loogisten operaattoreiden avulla.Kuntamuutostietoja käytettiin myös valmistettaessa Suomen ajallista paikkaontologiaa osana FinnONTO-projektia. Työssä kuvataan ontologioiden valmistuksen perusteet ja SAPO-ontologian valmistuksessa käytetyt menetelmät, mm. ontologiaversioiden väliset muutossillat ja sijainnin todennäköisyyden laskevan päättelykoneen periaate. Suomen ajallinen ontologia tulee vapaaseen käyttöön ONKI-ontologiakirjaston kautta, ja ensimmäisenä sitä käytetään FinnONTO-projektin semanttisen webin tekniikoita esittelevässä KulttuuriSampo-portaalissa.Alustavien kokeiden perusteella paikkatiedon ajallisten muutosten huomioonottaminen parantaa selvästi hakujen osuvuutta sekä GTK:n valokuvatietokannassa että paljon historiallista aineistoa sisältävässä KulttuuriSampo-portaalissa. Näin ollen näiden uusien ajallisen paikkatiedon hallintaan soveltuvien menetelmien käyttöönottoa voidaan suositella kaikkeen paikannimistön perusteella tapahtuvaan tiedon annotointiin eli nimeämiseen ja hakuun.
Tomi Kauppinen and Eero Hyvönen:
Modeling and Reasoning about Changes in Ontology Time Series.
Ontologies: A Handbook of Principles, Concepts and Applications in Information Systems (Rajiv Kishore, Ram Ramesh and Raj Sharman (eds.)), Integrated Series in Information Systems, pp. 319-338, Springer-Verlag, New York (NY), January 15, 2007.
bib pdf
Onni Valkeapää, Olli Alm and Eero Hyvönen:
A Framework for Ontology-based Adaptable Content Creation on the Semantic Web. Journal of Universal Computer Science, 2007.
bib pdf Creation of rich, ontology-based metadata is one of the major challenges in developing the Semantic Web. Emerging applications utilizing semantic web techniques, such as semantic portals, cannot be realized if there are no proper tools to provide metadata for them. This paper discusses how to make provision of metadata easier and cost-effective by an annotation framework comprising of annotation editor combined with shared ontology services. We have developed an annotation system supporting distributed collaboration in creating annotations, and hiding the complexity of the annotation schema and the domain ontologies from the annotators. Our system adapts flexibly to different metadata schemas, which makes it suitable for different applications. Support for using ontologies is based on ontology services, such as concept searching and browsing, concept URI fetching, semantic autocompletion and linguistic concept extraction. The system is being tested in various practical semantic portal projects.
2006
Eetu Mäkelä, Eero Hyvönen and Samppa Saarela:
Ontogator -- A Semantic View-based Search Engine Service for Web Applications.
Proceedings of the 5th International Semantic Web Conference (ISWC 2006), Nov, 2006.
bib pdf View-based search provides a promising paradigm for formulating complex semantic queries and representing results on the Semantic Web. A challenge for the application of the paradigm is the complexity of providing view-based search services through application programming interfaces (API) and web services. This paper presents a solution on how semantic view-based search can be provided efficiently through an API or as web service to external applications. The approach has been implemented as the open source tool Ontogator, that has been applied successfully in several practical semantic portals on the web.
Eero Hyvönen, Tuukka Ruotsalo, Thomas Häggström, Mirva Salminen, Miikka Junnila, Mikko Virkkilä, Mikko Haaramo, Eetu Mäkelä, Tomi Kauppinen and and Kim Viljanen:
CultureSampo-Finnish Culture on the Semantic Web: The Vision and First Results.
Developments in Artificial Intelligence and the Semantic Web - Proceedings of the 12th Finnish AI Conference STeP 2006, October 26-27, 2006.
bib pdf This paper concerns the idea of publishing heterogenous cultural content on the Semantic Web. By heterogenous content we mean metadata describing potentially any kind of cultural objects, including artifacts, photos, paintings, videos, folklore, cultural sites, cultural process descriptions, biographies, history etc. The metadata schemas used are different and the metadata may be represented at different levels of semantic granularity. This work is an extension to previous research on semantic cultural portals, such as MuseumFinland, that are usually based on a shared homogeneous schema, such as Dublin Core, and focus on content of similar kinds, such as artifacts. Our experiences suggest that a semantically richer event-based knowledge representation scheme than traditional metadata schemas is needed in order to support reasoning when performing semantic search and browsing. The new key idea is to transform different forms of metadata into event-based knowledge about the entities and events that take place in the world or in fiction. This approach facilitates semantic interoperability and reasoning about the world and stories at the same time, which enables implementation of intelligent services for the end-user. These ideas are addressed by presenting the vision and solution approaches taken in two prototype implementations of a new kind of cross-domain semantic cultural portal “CULTURESAMPO—Finnish Culture on the Semantic Web”
Kim Viljanen, Teppo Känsälä, Eero Hyvönen and Eetu Mäkelä:
ONTODELLA - A Projection and Linking Service for Semantic Web Applications.
Proceedings of the 17th International Conference on Database and Expert Systems Applications (DEXA 2006), Krakow, Poland, pp. 370-376, IEEE, September 4-8, 2006.
bib pdf ps Content in semantic web portals is often projected along application specific navigational taxonomies and linked semantically. This paper presents a logic-based method and a server ONTODELLA for these tasks. We argue that logic rules between the content layer and the application layer add flexibility and better architectural separation of content and functionality. The system has been implemented and applied succesfully in several semantic portals.
Markus Holi and Eero Hyvönen:
Fuzzy View-Based Semantic Search.
Proceedings of the 1st Asian Semantic Web Conference (ASWC2006), Beijing, China, Springer-Verlag, September 3-7, 2006.
bib pdf
Eero Hyvönen and Eetu Mäkelä:
Semantic Autocompletion.
Proceedings of the first Asia Semantic Web Conference (ASWC 2006), Beijing, Springer-Verlag, New York, August 4-9, 2006.
bib pdf This paper generalizes the idea of traditional syntactic text autocompletion onto the semantic level. The idea is to autocomplete typed text into ontological categories instead of words in a vocabulary. The idea has been implemented and its application for semantic indexing and content-based information retrieval in multi-facet search is proposed. Four operational semantic portals on the web using the implementation are presented as application cases.
Eetu Mäkelä:
View-Based Search Interfaces for the Semantic Web. MSc Thesis, University of Helsinki, June, 2006.
bib pdf This thesis explores the possibilities of using the view-based search paradigm to create intelligent search interfaces on the Semantic Web. After surveying several current semantic search techniques, the view-based search paradigm is explained, and argued to fit in a valuable niche in the field. To test the argument, OntoViews, a semantic view-based search portal creation tool was designed and implemented, and eight portals with five vastly different user interfaces were built using it. Based on the results of these experiments, this thesis argues that the paradigm, particularly as implemented in the OntoViews tool provides a strong, extensible and flexible base on which to built semantic search applications. The particular problems faced in applying view-based search for semantic interfaces are noted, along with explanations on how they were solved in the OntoViews architecture. Finally, directions and ideas for future research are presented for both the paradigm and the implementation architecture, respectively.
2005
Markus Holi and Eero Hyvönen:
Modeling Degrees of Overlap in Semantic Web Ontologies.
Proceedings of the ISWC Workshop Uncertainty Reasoning for the Semantic Web (Paulo C. G. da Costa, Kathryn B. Laskey, Kenneth J. Laskey and Michael Pool (eds.)), CEUR Workshop Proceedings, Galway, Ireland, Nov, 2005.
bib pdf
Eetu Mäkelä, Kim Viljanen, Petri Lindgren, Mikko Laukkanen and Eero Hyvönen:
Semantic Yellow Page Service Discovery: The Veturi Portal.
Poster paper, 4th International Semantic Web Conference, Nov, 2005.
bib pdf A prototype semantic yellow page service portal is described. Our idea is to represent service offerings as events and processes in terms of ontologies. Based on versatile semantic descriptions, users can be provided with a flexible view-based search engine enhanced with semantic text autocompletion.
2004
Tomi Kauppinen and Eero Hyvönen:
Bridging the Semantic Gap between Ontology Versions.
Proceedings of the 11th Finnish AI Conference, Web Intelligence Symposium, Conference Series - No 20, vol. 2, pp. 63-72, Finnish Artificial Intelligence Society, Vantaa, Finland, September 1-3, 2004.
bib pdf
Eero Hyvönen, Tomi Kauppinen, Mirva Salminen, Kim Viljanen and Pekka Ala-Siuru (eds.):
Web Intelligence-Proceedings of the 11th Finnish AI Conference. September 1-3, 2004.
bib