» back to normal layout

Tervetuloa julkistustilaisuuteen Aalto-yliopistossa ja online:


Kirjasampo 2.0 – semanttinen haku, selailu ja data-analyysit

Suomalaisen kirjallisuuden päivänä, tiistaina 10.10.2023, klo 13:00–15:00
Aalto-yliopisto, TUAS-talo, sali AS2, Maarintie 8, Otaniemi

Semanttisen laskennan tutkimusryhmä (SeCo), Aalto-yliopiston tietotekniikan laitos, Helsingin yliopiston Digitaalisten ihmistieteiden keskus HELDIG ja yleisten kirjastojen Kirjastot.fi-verkkopalvelut kutsuvat Teidät avoimeen ja maksuttomaan tilaisuuteen, jossa julkistetaan prototyyppi Kirjasampo-palvelun uudesta semanttisesta käyttöliittymästä Kirjasampo 2.0 sekä sen perustana olevasta linkitetyn avoimen datan palvelusta. Kirjasampo 2.0 tarjoaa uuden vaihtoehtoisen tavan hakea ja tutkia suomalaista kirjallisuutta digitaalisten ihmistieteiden menetelmin täydentäen nykyistä Kirjasampo.fi:n käyttöliittymää.


Kirjasampo 2.0:n etusivu ja viisi sovellusnäkymää


Mikä on Kirjasampo 2.0?

Kirjasampo on yleisten kirjastojen tarjoama tietopalvelu, joka julkaisee rikasta linkitettyä tietoa lähes kaikesta Suomessa julkaistusta kaunokirjallisuudesta ja kirjailijoista. Nykyään aineistoon kuuluu myös mm. tietokirjoja. Järjestelmällä on vuosittain n. 1,6 miljoonaa käyttäjää, ja se on osa yleisten kirjastojen valtakunnallisia verkkopalveluita Kirjastot.fi. Järjestelmän semanttisen webin teknologioihin perustuva prototyyppi kehitettiin Aalto-yliopistossa ja Helsingin yliopistossa osana kansallista FinnONTO-hankesarjaa 2003–2012 ja se otettiin julkiseen käyttöön vuonna 2011 yleisten kirjastojen toimesta. Kirjasampo kuuluu laajempaan Sampo-datapalveluiden ja -portaalien sarjaan.

Kirjasammon nykyinen käyttöliittymä toteutettiin perinteisin menetelmin Kirjastot.fi-organisaation toimesta eikä se ole nykyisen Sampo-mallin mukainen, mikä mahdollistaisi aineistojen semanttisen haun, selailun, data-analyysit ja visualisoinnit. Nykyisen käyttöliittymän rikastamiseksi uusilla ominaisuuksilla suunniteltiin Aalto-yliopistossa yhteistyössä yleisten kirjastojen kanssa uusi, Sampo-mallin mukainen käyttöliittymän prototyyppi, Kirjasampo 2.0 – semanttinen haku, selailu ja data-analyysit kirjallisuuden tutkimiseen, joka toteutettiin Sampo-UI-työkalun avulla. Kirjasammon taustalla olevaa tutkimustyötä on esitelty tarkemmin hankkeen kotisivulla, jolta löytyy myös aiheeseen liittyviä artikkeleita.

Kirjasampo on tarkoitus liittää osaksi Suomen Akatemian rahoittamaa kansallista FIN-CLARIAH-tutkimusinfrastruktuuria.


Datapalvelu ja portaali verkossa

Kirjasampo-data julkaistaan verkossa toiminnallisena SPARQL-rajapintana Linked Data Finland -alustalla (LDF.fi):

https://www.ldf.fi/dataset/kirjasampo

Kirjasampo 2.0 -portaali, joka perustuu em. SPARQL-rajapintapalveluun, avataan osoitteessa:

https://analyysi.kirjasampo.fi

Lisätietoa Kirjasammosta


Alustava ohjelma

Tilaisuus alkaa Kirjasampo 2.0 -hankkeen vetäjän prof. Eero Hyvösen esitelmällä. Tämän jälkeen kuullaan yleisten kirjastojen edustajien Matti Sarmelan, Kaisa Hypénin ja Tuomas Aitonurmen puheenvuorot. Sitten projektiryhmän väitöskirjatutkija Heikki Rantala kertoo uuden käyttöliittymän toteutuksessa käytetystä Sampo-UI-työkalusta ja DI Annastiina Ahola esittelee, miten Kirjasampo 2.0:n datapalvelu ja semanttinen portaali luotiin ja miten tätä käytetään. Lopuksi FM Telma Peura esittelee datapalvelun avulla tehtyä kirjallisuuden tutkimusta.


Ilmoittaudu tilaisuuteen Aallossa tai etäyhteydelle Zoomin kautta

Julkistustilaisuus on kaikille avoin ja maksuton. Ilmoittautumalla saat sähköpostitse Zoom-linkin, jolla voit osallistua tilaisuuteen etänä.


Kirjasampoon liittyvät tutkimusartikkelit

2025

Eero Hyvönen, Annastiina Ahola, Petri Leskinen and Jouni Tuominen: Aggregating and Aligning Knowledge Graphs into a Global Service: SampoSampo System for Cross-cultural Data Search, Exploration, and Analysis. 2025. Abstract, submitted for peer review. bib pdf
Eero Hyvönen, Petri Leskinen, Henna Poikkimäki, Heikki Rantala, Jouni Tuominen, Senka Drobac, Ossi Koho, Ilona Pikkanen and Hanna-Leena Paloposki: Searching, exploring, and analyzing historical letters and the underlying networks: LetterSampo Finland (1809–1917) data service and semantic portal. 2025. Abstract, submitted for peer review. bib pdf

2024

Eero Hyvönen: How to Create a National Cross-domain Ontology and Linked Data Infrastructure and Use It on the Semantic Web. Semantic Web - Interoperability, Usability, Applicability, IOS Press, 2024. DOI: 10.3233/SW-243468. bib pdf link

2023

Heikki Rantala, Annastiina Ahola, Esko Ikkala and Eero Hyvönen: How to create easily a data analytic semantic portal on top of a SPARQL endpoint: introducing the configurable Sampo-UI framework. VOILA! 2023 Visualization and Interaction for Ontologies, Linked Data and Knowledge Graphs 2023, CEUR Workshop Proceedings, Vol. 3508, October, 2023. bib pdf link
Annastiina Ahola and Eero Hyvönen: Visualizing Literary Linked Data for Public Library Users in the New User Interface for BookSampo – Finnish Fiction Literature on the Semantic Web. VOILA! 2023 Visualization and Interaction for Ontologies, Linked Data and Knowledge Graphs 2023, CEUR Workshop Proceedings, Vol. 3508, July, 2023. bib pdf link
Annastiina Ahola, Telma Peura and Eero Hyvönen: Interfacing the BookSampo Knowledge Graph of Finnish Literature for Data Analyses in Digital Humanities. DARIAH Annual Event 2023, poster paper, DARIAH-EU, June, 2023. bib link
Eero Hyvönen: Creating and Using a National Linked Open Data Infrastructure for Cultural Heritage Applications and Digital Humanities Research: Lessons Learned. DARIAH Annual Event 2023, Budapest, Hungary, abstracts of papers, DARIAH-EU, June, 2023. bib link
Annastiina Ahola, Eero Hyvönen and Heikki Rantala: A User Interface Model for Digital Humanities Research: Case BookSampo – Finnish Fiction Literature on the Semantic Web. Proceedings of ESWC 2023, poster and demo papers, Springer-Verlag, June, 2023. bib
Annastiina Ahola, Eero Hyvönen and Heikki Rantala: A User Interface Model for Digital Humanities Research: Case BookSampo – Finnish Fiction Literature on the Semantic Web. Proceedings of ESWC 2023, poster and demo papers, Springer-Verlag, June, 2023. bib pdf
Eero Hyvönen: How to Create a National Cross-domain Ontology and Linked Data Infrastructure and Use It on the Semantic Web. Programming and Data Infrastructure in Digital Humanities, Book of Abstracts, pp. 7, High Performance Computing Centre, University of Évora, Portugal, March, 2023. bib link
Annastiina Ahola: Developing a tool for information retrieval and research purposes utilizing BookSampo data. MSc Thesis (in English), Aalto University, Department of Computer Science, February, 2023. bib pdf link
Telma Peura: Suomeksi yli rajojen. Kvantitatiivinen tutkimus suomenkielisten romaanien monimuotoisuudesta 1970-2020. MSc Thesis (in Finnish), University of Helsinki, Department of Digital Humanities, Helsinki Centre for Digital Humanities (HELDIG), January, 2023. bib pdf link
Eero Hyvönen: Digital Humanities on the Semantic Web: Sampo Model and Portal Series. Semantic Web – Interoperability, Usability, Applicability, vol. 14, no. 4, pp. 729-744, IOS Press, 2023. bib pdf link

2022

Telma Peura, Petri Leskinen and Eero Hyvönen: What Linked Data Can Tell about Geographical Trends in Finnish Fiction Literature - Using the BookSampo Knowledge Graph in Digital Humanities. 2022. Abstract under peer review. bib
Eero Hyvönen, Annastiina Ahola and Esko Ikkala: BookSampo Fiction Literature Knowledge Graph Revisited: Building a Faceted Search Interface with Seamlessly Integrated Data-analytic Tools. Theory and Practice of Digital Libraries (TDPL 2022), Accelerating Innovations Track, Padova, Italy, pp. 506–511, Springer, 2022. bib pdf link

2020

Eero Hyvönen: Semantic Sampo Portals for Digital Humanities Based on a National Linked Open Data Infrastructure. 2020. White paper, Aalto University, Semantic Computing Research Group (SeCo). bib pdf
Eero Hyvönen: Sampo Model and Semantic Portals for Digital Humanities on the Semantic Web. DHN 2020 Digital Humanities in the Nordic Countries. Proceedings of the Digital Humanities in the Nordic Countries 5th Conference, pp. 373-378, CEUR Workshop Proceedings, vol. 2612, Riga, Latvia, October, 2020. bib pdf link

2019

Eero Hyvönen: Linked Data in Use: Sampo Portals on the Semantic Web. EuropaNow, Council for European Studies (CES), Columbia University, September, 2019. bib pdf link

2013

Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen: Fiction Literature as Linked Open Data - the BookSampo Dataset. Semantic Web – Interoperability, Usability, Applicability, vol. 4, no. 3, pp. 299-306, 2013. bib pdf link
The BookSampo dataset provides information as linked data on fiction literature published in Finland going back to the 15th century, along with rich descriptions of both their content and context. The dataset contains data on nearly 400,000 subjects, including literary works, authors, book covers, reviews, awards, images, and movies, over 3 million triples in total. The data has been applied as the basis of the BookSampo portal in public use in Finland, and is aligned with the cross-domain cultural heritage contents and ontologies of CultureSampo, another in-use semantic portal. The data has been used to answer complex questions, such as what topics should one write about, if one wants to get a literary award (based on statistics). The metadata was transformed into RDF from legacy library databases, then enriched manually by dozens of librarians in a Web 2.0 fashion in Finnish public libraries, and is constantly updated at a rate of some new 90,000 triples monthly.

2012

Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen: Improving Fiction Literature Access by Linked Open Data -Based Collaborative Knowledge Storage - the BookSampo Project. World Library and Information Congress: 78th IFLA General Conference and Assembly, Helsinki, IFLA, http://conference.ifla.org/ifla78, August, 2012. bib pdf
BookSampo is a joint project between the Finnish public libraries and semantic web researchers, to improve fiction literature search and recommendation. In the project, dozens of librarians around Finland have used a collaborative web-based metadata editor to input diverse knowledge about fiction literature into a shared database. Particularly, the project has sought to improve access by indexing not only bibliographical information about the books, but focusing on the content and context of the works. In order to do this, the database employs advanced techniques such as functional, content-centered indexing, ontological vocabularies and the networked data model of linked open data. To demonstrate the functionality this makes possible, the fiction literature portal http://www.kirjasampo.fi/ was created. This portal uses the knowledge created in the project to offer advanced semantic search and recommendation based on the database created. In addition, web services exposing direct access to the data have been used for example in culture hack events to answer more complex questions, such as where in Finland are the most crimes committed in fiction literature.

2011

Eetu Mäkelä, Kaisa Hypén and Eero Hyvönen: BookSampo--Lessons Learned in Creating a Semantic Portal for Fiction Literature. The Semantic Web - ISWC 2011 - 10th International Semantic Web Conference, Bonn, Germany, pp. 173-188, Springer-Verlag, 2011. bib pdf link
BookSampo is a semantic portal in use, covering metadata about practically all Finnish fiction literature of Finnish public libraries on a work level. The system introduces a variety of semantic web novelties deployed into practise: The underlying data model is based on the emerging functional, content-centered metadata indexing paradigm using RDF. Linked Data (LD) principles are used for mapping the metadata with tens of interlinked ontologies in the national FinnONTO ontology infrastructure. The contents are also linked with the large LD metadata repository of related cultural heritage content of CultureSampo. BookSampo is actually based on using CultureSampo as a semantic web service, demonstrating the idea of re-using semantic content from multiple perspectives without the need for modifications. Most of the content has been transformed automatically from existing databases, with the help of ontologies derived from thesauri in use in Finland, but in addtion tens of volunteered librarians have participated in a Web 2.0 fashion in annotating and correcting the metadata, especially regarding older litarature. For this purpose, semantic web editing tools and public ONKI ontology services were created and used. The paper focuses on lessons learned in the process of creating the semantic web basis of BookSampo.
Kaisa Hypén and Eetu Mäkelä: An ideal model for an information system for fiction and its application: Kirjasampo and Semantic Web. Library Review, vol. 60, no. 4, April, 2011. bib link
Purpose – Library Director Jarmo Saarti introduced a wide or ideal model for fiction in literature in his dissertation, published in 1999. It introduces those aspects that should be included in an information system for fiction. Such aspects include literary prose and its intertextual references to other works, the writer, readers and critics receptions of the work as well as a researcher s view. It is also important to note how libraries approach a literary work by means of inventory, classification and content description. The most ambiguous of the aspects relates to that context in cultural history, which the work reflects and is a part of. The paper aims to discuss these issues. Design/methodology/approach – Since the model consists of several components which are not found in present library information systems and cannot be implemented by them, a new way had to be found to produce, save, process and present fiction‐related metadata. The Semantic Computing Research Group of Aalto University has developed several Semantic Web services for use in the field of culture, so cooperation with it and the use of Semantic Web tools were a natural starting point for the construction of the new service. Kirjasampo will be based on the Semantic Web RDF data model. The model enables a flexible linking of metadata derived from different sources, and it can be used to build a Semantic Web that can be approached contextually from different angles. Findings – The “semantically enriched” ideal model for fiction has hence been realised, at least to some extent: Kirjasampo supports literature‐related metadata that is more varied than earlier and aims to account for different contexts within literature and connections with regard to other cultural phenomena. It also includes contemporary reviews of works and, as such, readers receptions as well. Modern readers can share their views on works, once the user interface of the server is completed. It will include several features from the Kirjasto 2.0‐application, which enables the evaluation, description and recommendations of works. The service should be online by the end of Spring 2011. Research limitations/implications – The project involves novel collaboration between a public library and a computer science research unit, and utilises a novel approach to the description of fiction. Practical implications – The system encourages user participation in the description of fiction and is of practical benefit to librarians in understanding both how fiction is organised and how users interpret the same. Originality/value – Upon completion, the service will be the first Finnish information system for libraries built with the tools of the Semantic Web which offers a completely new user environment and application for data produced by libraries. It also strives to create a new model for saving and producing data, available to both library professionals and readers. The aim is to save, accumulate and distribute literary knowledge, experiences and silent information.
/var/www/html/include/secoweb/utils.php; Thu, 28 Nov 2024 01:09:35 +0000