DATASET

The SMP dataset contains two subsets collected from Flickr (a photo sharing platform), SMP-T1 and SMP-T2, for the two particular tasks, respectively. For each task, we split the data with time-order, resulting in train and test data ratio is 10:1.The tables below show the statistics of SMP-T1 subset and SMP-T2 subset.

DOWNLOAD LINK

You can download image URLs and their associated meta data here. The training data (including popularity labels) is available now.

Readme Document

SMP-T1 Train Image Urls (Path Sample: train/3@N58/1373.jpg)
SMP-T1 Train Data (include image paths, meta data and labels´╝ë
SMP-T1 Additional Train Data for Time Zone

SMP-T1 Test Data (include image download links, meta data and timezone)

SMP-T2 Train Image Urls (Path Sample: train/77@N93/551891.jpg)
SMP-T2 Train Data (include image paths, meta data and labels)
SMP-T2 Additional Train Data for Time Zone

SMP-T2 Test Data (include image download links, meta data and timezone)

Note that the datasets will ONLY be released to participants who have registered the challenge during the competition. Until the challenge completes, we will make the data publically available to the whole research community.

SMP-T1 Statistics

#Post #User Temporal Range (Years) Avg. Title Length Avg. Tag Count Avg. Description Length Avg. Views
432K 135 6 20 9 114 131

SMP-T2 Statistics

#Post #User #Categories Temporal Range (Months) Avg. Title Length #Tags #POIs Avg. Views
340K 80K 11 16 26 669 103K 306

*In the SMP-T2 dataset, we provide the category information for each photo.