Semi-automatic knowledge extraction to enrich open linked data

Elena Baralis, Giulia Bruno, Tania Cerquitelli, Silvia Chiusano, Alessandro Fiori, Alberto Grand

Research output: Chapter in Book/Report/Conference proceedingChapter

1 Citation (Scopus)

Abstract

In this chapter we present the analysis of the Wikipedia collection by means of the ELiDa framework with the aim of enriching linked data. ELiDa is based on association rule mining, an exploratory technique to discover relevant correlations hidden in the analyzed data. To compactly store the large volume of extracted knowledge and efficiently retrieve it for further analysis, a persistent structure has been exploited. The domain expert is in charge of selecting the relevant knowledge by setting filtering parameters, assessing the quality of the extracted knowledge, and enriching the knowledge with the semantic expressiveness which cannot be automatically inferred. We consider, as representative document collections, seven datasets extracted from the Wikipedia collection. Each dataset has been analyzed from two point of views (i.e., transactions by documents, transactions by sentences) to highlight relevant knowledge at different levels of abstraction.

Original languageEnglish
Title of host publicationCases on Open-Linked Data and Semantic Web Applications
PublisherIGI Global
Pages156-180
Number of pages25
ISBN (Print)9781466628274
DOIs
Publication statusPublished - 2013

Fingerprint

Association rules
Semantics

ASJC Scopus subject areas

  • Computer Science(all)

Cite this

Baralis, E., Bruno, G., Cerquitelli, T., Chiusano, S., Fiori, A., & Grand, A. (2013). Semi-automatic knowledge extraction to enrich open linked data. In Cases on Open-Linked Data and Semantic Web Applications (pp. 156-180). IGI Global. https://doi.org/10.4018/978-1-4666-2827-4.ch008

Semi-automatic knowledge extraction to enrich open linked data. / Baralis, Elena; Bruno, Giulia; Cerquitelli, Tania; Chiusano, Silvia; Fiori, Alessandro; Grand, Alberto.

Cases on Open-Linked Data and Semantic Web Applications. IGI Global, 2013. p. 156-180.

Research output: Chapter in Book/Report/Conference proceedingChapter

Baralis, E, Bruno, G, Cerquitelli, T, Chiusano, S, Fiori, A & Grand, A 2013, Semi-automatic knowledge extraction to enrich open linked data. in Cases on Open-Linked Data and Semantic Web Applications. IGI Global, pp. 156-180. https://doi.org/10.4018/978-1-4666-2827-4.ch008
Baralis E, Bruno G, Cerquitelli T, Chiusano S, Fiori A, Grand A. Semi-automatic knowledge extraction to enrich open linked data. In Cases on Open-Linked Data and Semantic Web Applications. IGI Global. 2013. p. 156-180 https://doi.org/10.4018/978-1-4666-2827-4.ch008
Baralis, Elena ; Bruno, Giulia ; Cerquitelli, Tania ; Chiusano, Silvia ; Fiori, Alessandro ; Grand, Alberto. / Semi-automatic knowledge extraction to enrich open linked data. Cases on Open-Linked Data and Semantic Web Applications. IGI Global, 2013. pp. 156-180
@inbook{f3b31b2e041745309d235a2b08aa1e85,
title = "Semi-automatic knowledge extraction to enrich open linked data",
abstract = "In this chapter we present the analysis of the Wikipedia collection by means of the ELiDa framework with the aim of enriching linked data. ELiDa is based on association rule mining, an exploratory technique to discover relevant correlations hidden in the analyzed data. To compactly store the large volume of extracted knowledge and efficiently retrieve it for further analysis, a persistent structure has been exploited. The domain expert is in charge of selecting the relevant knowledge by setting filtering parameters, assessing the quality of the extracted knowledge, and enriching the knowledge with the semantic expressiveness which cannot be automatically inferred. We consider, as representative document collections, seven datasets extracted from the Wikipedia collection. Each dataset has been analyzed from two point of views (i.e., transactions by documents, transactions by sentences) to highlight relevant knowledge at different levels of abstraction.",
author = "Elena Baralis and Giulia Bruno and Tania Cerquitelli and Silvia Chiusano and Alessandro Fiori and Alberto Grand",
year = "2013",
doi = "10.4018/978-1-4666-2827-4.ch008",
language = "English",
isbn = "9781466628274",
pages = "156--180",
booktitle = "Cases on Open-Linked Data and Semantic Web Applications",
publisher = "IGI Global",

}

TY - CHAP

T1 - Semi-automatic knowledge extraction to enrich open linked data

AU - Baralis, Elena

AU - Bruno, Giulia

AU - Cerquitelli, Tania

AU - Chiusano, Silvia

AU - Fiori, Alessandro

AU - Grand, Alberto

PY - 2013

Y1 - 2013

N2 - In this chapter we present the analysis of the Wikipedia collection by means of the ELiDa framework with the aim of enriching linked data. ELiDa is based on association rule mining, an exploratory technique to discover relevant correlations hidden in the analyzed data. To compactly store the large volume of extracted knowledge and efficiently retrieve it for further analysis, a persistent structure has been exploited. The domain expert is in charge of selecting the relevant knowledge by setting filtering parameters, assessing the quality of the extracted knowledge, and enriching the knowledge with the semantic expressiveness which cannot be automatically inferred. We consider, as representative document collections, seven datasets extracted from the Wikipedia collection. Each dataset has been analyzed from two point of views (i.e., transactions by documents, transactions by sentences) to highlight relevant knowledge at different levels of abstraction.

AB - In this chapter we present the analysis of the Wikipedia collection by means of the ELiDa framework with the aim of enriching linked data. ELiDa is based on association rule mining, an exploratory technique to discover relevant correlations hidden in the analyzed data. To compactly store the large volume of extracted knowledge and efficiently retrieve it for further analysis, a persistent structure has been exploited. The domain expert is in charge of selecting the relevant knowledge by setting filtering parameters, assessing the quality of the extracted knowledge, and enriching the knowledge with the semantic expressiveness which cannot be automatically inferred. We consider, as representative document collections, seven datasets extracted from the Wikipedia collection. Each dataset has been analyzed from two point of views (i.e., transactions by documents, transactions by sentences) to highlight relevant knowledge at different levels of abstraction.

UR - http://www.scopus.com/inward/record.url?scp=84898254104&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84898254104&partnerID=8YFLogxK

U2 - 10.4018/978-1-4666-2827-4.ch008

DO - 10.4018/978-1-4666-2827-4.ch008

M3 - Chapter

SN - 9781466628274

SP - 156

EP - 180

BT - Cases on Open-Linked Data and Semantic Web Applications

PB - IGI Global

ER -