SMP Challenge News: SMP Challenge dataset was released at Here!


People are interested in predicting the future. For example, which films will bomb or who will win the upcoming Grammy awards? Making predictions about the future in many aspects is not only fun matters but can bring real value to those who correctly predict the course of world events, such as which stocks are the best purchases for short-term gains. Predictive analytics is thus a field that has attracted major attention in both academia and the industry.

As social media has become an inseparable part of modern life, there has been increasing interest in research of leveraging and exploiting social media as an information source for inferring rich social facts and knowledge. Right now, a large number of social media datasets have been established for various research tasks and helped lead to great advancements in social media technology and applications.

Therefore, as a joint activity with the research teams from the Chinese Academy of Sciences (CAS), Academia Sinica (AS), and Microsoft Research Asia (MSRA), we are releasing a large-scale social media dataset for sociological understanding and predictions, namely Social Media Prediction (SMP) dataset, with over 770K posts and 80K users in total. Our goal is to make the SMP dataset as varied and rich as possible to thoroughly represent the social media “world”.


The Challenge is a team-based contest. Each team can have one or more members, and an individual can not be a member of multiple teams. At the end of the Challenge, all teams will be ranked based on both objective evaluation and human evaluation. The top performing teams will receive award certificates and/or cash prizes. At the same time, all accepted submissions are qualified for the conference’s grand challenge award competition.


If you intend to publish results that use the information and resources provided by this challenge, please include the following references:

  title={Sequential Prediction of Social Media Popularity with Deep Temporal Context Networks},
  author={Wu, Bo and Cheng, Wen-Huang and Zhang, Yongdong and Qiushi, Huang and Jintao, Li and Mei, Tao},
  booktitle={International Joint Conference on Artificial Intelligence (IJCAI)},
  location = {Melbourne, Australia}}
  author = {Wu, Bo and Mei, Tao and Cheng, Wen-Huang and Zhang, Yongdong},
  title = {Unfolding Temporal Dynamics: Predicting Social Media Popularity Using Multi-scale Temporal Decomposition},
  booktitle = {Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence (AAAI)}
  year = {2016},
  location = {Phoenix, Arizona}}
  author = {Wu, Bo and Cheng, Wen-Huang and Zhang, Yongdong and Mei, Tao},
  title = {Time Matters: Multi-scale Temporalization of Social Media Popularity},
  booktitle = {Proceedings of the 2016 ACM on Multimedia Conference (ACM MM)},
  year = {2016},
  location = {Amsterdam, The Netherlands}}


Unless otherwise stated, all deadlines are at 23:59 Anywhere on Earth (AoE), UTC-12.
  • April 15, 2017

    Dataset available for download (training set)

  • June 8, 2017

    Test set available for download

  • June 25, 2017

    Results submission (for Task1 and Task2)

  • June 28, - June 30, 2017

    Objective evaluation and human evaluation (for Task1 and Task2)

  • July 1, 2017

    Evaluation results announce (for Task1 and Task2)

  • July 14, 2017

    Paper (all tasks) submission deadline (please follow the instructions on the main conference website)

  • September 5, 2017

    Dataset released (training and testing dataset)


  • wenhuangcheng
  • wubo
  • QiushiHuang
  • YongdongZhang
  • meitao