Semi-automatic knowledge extraction to enrich open linked data

Elena Baralis, Giulia Bruno, Tania Cerquitelli, Silvia Chiusano, Alessandro Fiori, Alberto Grand

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

In this chapter we present the analysis of the Wikipedia collection by means of the ELiDa framework with the aim of enriching linked data. ELiDa is based on association rule mining, an exploratory technique to discover relevant correlations hidden in the analyzed data. To compactly store the large volume of extracted knowledge and efficiently retrieve it for further analysis, a persistent structure has been exploited. The domain expert is in charge of selecting the relevant knowledge by setting filtering parameters, assessing the quality of the extracted knowledge, and enriching the knowledge with the semantic expressiveness which cannot be automatically inferred. We consider, as representative document collections, seven datasets extracted from the Wikipedia collection. Each dataset has been analyzed from two point of views (i.e., transactions by documents, transactions by sentences) to highlight relevant knowledge at different levels of abstraction.

Original languageEnglish
Title of host publicationCases on Open-Linked Data and Semantic Web Applications
PublisherIGI Global
Pages156-180
Number of pages25
ISBN (Print)9781466628274
DOIs
Publication statusPublished - 2013

ASJC Scopus subject areas

  • Computer Science(all)

Fingerprint Dive into the research topics of 'Semi-automatic knowledge extraction to enrich open linked data'. Together they form a unique fingerprint.

Cite this