Combining semantics and social knowledge for news article summarization

Elena Baralis, Luca Cagliero, Saima Jabeen, Alessandro Fiori, Sajid Shah

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

With the diffusion of online newspapers and social media, users are becoming capable of retrieving dozens of news articles covering the same topic in a short time. News article summarization is the task of automatically selecting a worthwhile subset of news' sentences that users could easily explore. Promising research directions in this field are the use of semantics-based models (e.g., ontologies and taxonomies) to identify key document topics and the integration of social data analysis to also consider the current user's interests during summary generation. The chapter overviews the most recent research advances in document summarization and presents a novel strategy to combine ontology-based and social knowledge for addressing the problem of generic (not query-based) multi-document summarization of news articles. To identify the most salient news articles' sentences, an ontology-based text analysis is performed during the summarization process. Furthermore, the social content acquired from real Twitter messages is separately analyzed to also consider the current interests of social network users for sentence evaluation. The combination of ontological and social knowledge allows the generation of accurate and easy-to-read news summaries. Moreover, the proposed summarizer performs better than the evaluated competitors on real news articles and Twitter messages.

Original languageEnglish
Title of host publicationData Mining and Analysis in the Engineering Field
PublisherIGI Global
Pages209-230
Number of pages22
ISBN (Print)9781466660878, 1466660864, 9781466660861
DOIs
Publication statusPublished - May 31 2014

ASJC Scopus subject areas

  • Computer Science(all)
  • Engineering(all)

Fingerprint Dive into the research topics of 'Combining semantics and social knowledge for news article summarization'. Together they form a unique fingerprint.

Cite this