BookSampo - Semantic Portal for Finnish Fiction Literature
What is BookSampo?
BookSampo is a semantic portal in use, covering metadata about
practically all Finnish fiction literature of Finnish public libraries on
a work level. The system introduces a variety of semantic web novelties
deployed into practise: The underlying data model is based on the emerging functional, content-centered
metadata indexing paradigm using RDF. Linked Data (LD) principles are used for mapping the
metadata with tens of interlinked ontologies in the national FinnONTO ontology infrastructure.
The contents are also linked with the large LD metadata repository of related
cultural heritage content of CultureSampo. BookSampo is actually based on using CultureSampo
as a semantic web service, demonstrating the idea of re-using semantic content from multiple
perspectives without the need for modifications.
Public Portal On the Web
The portal has been online since autumn 2011 at:
The BookSampo portal interface application was implemented by the Finnish public library consortium Kirjastot.fi.
The system is based on the fiction literature ontology KAUNO (developed from the Kaunokki thesaurus) and other ontologies developed in the FinnONTO-project, and metadata from library databases, biographies, review articels, and other sources. The contents, search, and recommending services of the portal come through the APIs of the
CultureSampo system, that contains all semantic content of the system interlinked in RDF with other cultural materials.
Collaborative Content Creation
Most of the BookSampo content has been transformed automatically from existing databases,
with the help of ontologies derived from thesauri in use in Finland,
but in addtion tens of volunteered librarians have participated in a Web 2.0
fashion in annotating and correcting the metadata using the SAHA metadata editor connected to the ONKI ontology services of FinnONTO, especially regarding older literature.
Dr Eetu Mäkelä
Helsinki University of Technology, Department of Media Technology
first.last [ at ] aalto.fi
Professor Eero Hyvönen
Helsinki University of Technology, Department of Media Technology and University of Helsinki
first.last [ at ] aalto.fi
Publications about BookSampo
Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen: Fiction Literature as Linked Open Data - the BookSampo Dataset
. Semantic Web – Interoperability, Usability, Applicability, vol. 4, no. 3, pp. 299-306, 2013. bib pdf link
The BookSampo dataset provides information as linked data on fiction literature published in Finland going back to the 15th century, along with rich descriptions of both their content and context. The dataset contains data on nearly 400,000 subjects, including literary works, authors, book covers, reviews, awards, images, and movies, over 3 million triples in total. The data has been applied as the basis of the BookSampo portal in public use in Finland, and is aligned with the cross-domain cultural heritage contents and ontologies of CultureSampo, another in-use semantic portal. The data has been used to answer complex questions, such as what topics should one write about, if one wants to get a literary award (based on statistics). The metadata was transformed into RDF from legacy library databases, then enriched manually by dozens of librarians in a Web 2.0 fashion in Finnish public libraries, and is constantly updated at a rate of some new 90,000 triples monthly.
Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen: Improving Fiction Literature Access by Linked Open Data -Based Collaborative Knowledge Storage - the BookSampo Project
. World Library and Information Congress: 78th IFLA General Conference and Assembly, Helsinki
, IFLA, http://conference.ifla.org/ifla78, August, 2012. bib pdf
BookSampo is a joint project between the Finnish public libraries and semantic web researchers, to improve fiction literature search and recommendation. In the project, dozens of librarians around Finland have used a collaborative web-based metadata editor to input diverse knowledge about fiction literature into a shared database. Particularly, the project has sought to improve access by indexing not only bibliographical information about the books, but focusing on the content and context of the works. In order to do this, the database employs advanced techniques such as functional, content-centered indexing, ontological vocabularies and the networked data model of linked open data. To demonstrate the functionality this makes possible, the fiction literature portal http://www.kirjasampo.fi/ was created. This portal uses the knowledge created in the project to offer advanced semantic search and recommendation based on the database created. In addition, web services exposing direct access to the data have been used for example in culture hack events to answer more complex questions, such as where in Finland are the most crimes committed in fiction literature.
Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen: BookSampo--Lessons Learned in Creating a Semantic Portal for Fiction Literature
. Proceedings of ISWC-2011, Bonn, Germany
, Springer-Verlag, 2011. bib pdf
BookSampo is a semantic portal in use, covering metadata about practically all Finnish fiction literature of Finnish public libraries on a work level. The system introduces a variety of semantic web novelties deployed into practise: The underlying data model is based on the emerging functional, content-centered metadata indexing paradigm using RDF. Linked Data (LD) principles are used for mapping the metadata with tens of interlinked ontologies in the national FinnONTO ontology infrastructure. The contents are also linked with the large LD metadata repository of related cultural heritage content of CultureSampo. BookSampo is actually based on using CultureSampo as a semantic web service, demonstrating the idea of re-using semantic content from multiple perspectives without the need for modifications. Most of the content has been transformed automatically from existing databases, with the help of ontologies derived from thesauri in use in Finland, but in addtion tens of volunteered librarians have participated in a Web 2.0 fashion in annotating and correcting the metadata, especially regarding older litarature. For this purpose, semantic web editing tools and public ONKI ontology services were created and used. The paper focuses on lessons learned in the process of creating the semantic web basis of BookSampo.
Kaisa Hypén and Eetu Mäkelä: An ideal model for an information system for fiction and its application: Kirjasampo and Semantic Web
. Library Review, vol. 60, no. 4, April, 2011. bib link
Purpose – Library Director Jarmo Saarti introduced a wide or ideal model for fiction in literature in his dissertation, published in 1999. It introduces those aspects that should be included in an information system for fiction. Such aspects include literary prose and its intertextual references to other works, the writer, readers and critics receptions of the work as well as a researcher s view. It is also important to note how libraries approach a literary work by means of inventory, classification and content description. The most ambiguous of the aspects relates to that context in cultural history, which the work reflects and is a part of. The paper aims to discuss these issues. Design/methodology/approach – Since the model consists of several components which are not found in present library information systems and cannot be implemented by them, a new way had to be found to produce, save, process and present fiction‐related metadata. The Semantic Computing Research Group of Aalto University has developed several Semantic Web services for use in the field of culture, so cooperation with it and the use of Semantic Web tools were a natural starting point for the construction of the new service. Kirjasampo will be based on the Semantic Web RDF data model. The model enables a flexible linking of metadata derived from different sources, and it can be used to build a Semantic Web that can be approached contextually from different angles. Findings – The “semantically enriched” ideal model for fiction has hence been realised, at least to some extent: Kirjasampo supports literature‐related metadata that is more varied than earlier and aims to account for different contexts within literature and connections with regard to other cultural phenomena. It also includes contemporary reviews of works and, as such, readers receptions as well. Modern readers can share their views on works, once the user interface of the server is completed. It will include several features from the Kirjasto 2.0‐application, which enables the evaluation, description and recommendations of works. The service should be online by the end of Spring 2011. Research limitations/implications – The project involves novel collaboration between a public library and a computer science research unit, and utilises a novel approach to the description of fiction. Practical implications – The system encourages user participation in the description of fiction and is of practical benefit to librarians in understanding both how fiction is organised and how users interpret the same. Originality/value – Upon completion, the service will be the first Finnish information system for libraries built with the tools of the Semantic Web which offers a completely new user environment and application for data produced by libraries. It also strives to create a new model for saving and producing data, available to both library professionals and readers. The aim is to save, accumulate and distribute literary knowledge, experiences and silent information.
Kirjasampo on semanttisen webin teknologiaa hyödyntävä portaali.
Se perustuu Aalto- ja Helsingin yliopiston
Semanttisen laskennan tutkimusryhmän (SeCo) johtamassa kansallisessa
FinnONTO-hankkeessa (2003-2012) kehitettyyn teknologiaan, menetelmiin ja julkaisukonsepteihin.
Hankkeen rahoittivat Tekes ja n. 40 organisaation konsortio, mukaan lukien Kirjastot.fi (Helsingin kaupunginkirjasto - Yleisten kirjastojen keskuskirjasto).
ydinaines on peräisen Helmet-tietokannasta, joka muunnettiin semanttisen webin RDF-muotoon.
Tämän jälkeen aineistoa rikastettiin, erityisesti vanhempien teosten osalta, kymmenien vapaaehtoisen kirjastonhoitajien toimesta
käyttämällä hyväksi SAHA-metadataeditoria
Aineisto sillattiin Linked Data -periaatteiden mukaisesti osaksi laajaa
joka sisältää satoja tuhansia erilaisia suomalaisia kulttuurikohteita toisiinsa yhdistettynä, nyt mukaan lukien suomalainen kertomakirjallisuus teostasolla.
Kirjasammon aineisto ja palvelut perustuvat Kulttuurisammmon tarjoamiin semanttisiin palveluihin, joita
Kirjasampo käyttää rahapintojensa kautta. Kirjasammon käyttöliittymä on suunniteltu ja toteutettu Kirjastot.fi-yhteisön toimesta
itsenäisesti omana hankkeenaan.
Lisätietoja ja tutkimusartikkeleita Kirjasammon taustalla olevasta työstä löytyy hankkeen tutkimussivulta.