Open Access
December 2018 Marked self-exciting point process modelling of information diffusion on Twitter
Feng Chen, Wai Hong Tan
Ann. Appl. Stat. 12(4): 2175-2196 (December 2018). DOI: 10.1214/18-AOAS1148

Abstract

Information diffusion occurs on microblogging platforms like Twitter as retweet cascades. When a tweet is posted, it may be retweeted and henceforth further retweeted, and the retweeting process continues iteratively and indefinitely. A natural measure of the popularity of a tweet is the number of retweets it generates. Accurate predictions of tweet popularity can assist Twitter to rank contents more effectively and facilitate the assessment of potential for marketing and campaigning strategies. In this paper, we propose a model called the Marked Self-Exciting Process with Time-Dependent Excitation Function, or MaSEPTiDE for short, to model the retweeting dynamics and to predict the tweet popularity. Our model does not require expensive feature engineering but is capable of leveraging the observed dynamics to accurately predict the future evolution of retweet cascades. We apply our proposed methodology on a large amount of Twitter data and report substantial improvement in prediction performance over existing approaches in the literature.

Citation

Download Citation

Feng Chen. Wai Hong Tan. "Marked self-exciting point process modelling of information diffusion on Twitter." Ann. Appl. Stat. 12 (4) 2175 - 2196, December 2018. https://doi.org/10.1214/18-AOAS1148

Information

Received: 1 August 2017; Revised: 1 February 2018; Published: December 2018
First available in Project Euclid: 13 November 2018

zbMATH: 07029451
MathSciNet: MR3875697
Digital Object Identifier: 10.1214/18-AOAS1148

Keywords: B-spline , forecast , Hawkes process , integral equation , nonstationary self-exciting point process , popularity prediction , simulation

Rights: Copyright © 2018 Institute of Mathematical Statistics

Vol.12 • No. 4 • December 2018
Back to Top