Everyday online communities and social networks are accessed by millions of Web users, who produce a huge amount of user-generated content (UGC). The UGC and its publication context typically evolve over time and reflect the actual user interests and behaviors. Thus, the application of data mining techniques to discover the evolution of common user behaviors and topic trends is becoming an appealing research issue. Dynamic association rule mining is a well-established technique to discover correlations, among data collected in consecutive time periods, whose main quality indexes (e.g., support and confidence) exceed a given threshold and possibly vary from one time period to another. This Chapter presents the DyCoM (Dynamic Context Miner) data mining system. It entails the discovery of a novel and extended version of dynamic association rules, namely the dynamic generalized association rules, from both the content and the contextual features of the user-generated messages posted on Twitter. A taxonomy over contextual data features is semi-automatically built and exploited to discover dynamic correlations among data at different abstraction levels and their temporal evolution in a sequence of tweet collections. Experiments, performed on both real Twitter posts and synthetic datasets, show the effectiveness and the efficiency of the proposed DyCoM framework in supporting user behavior and topic trend analysis from Twitter.
|Title of host publication||Behavior Computing: Modeling, Analysis, Mining and Decision|
|Publisher||Springer-Verlag London Ltd|
|Number of pages||21|
|ISBN (Print)||9781447129691, 9781447129684|
|Publication status||Published - Jan 1 2012|
ASJC Scopus subject areas
- Computer Science(all)