Unless specified otherwise, the following datasets are released under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.

xHamster

This in an exhaustive dataset of metadatas of all videos published on the site from its creation - 2007 - until february 2013. This represents almost 800,000 entries.



For each entry, the following metadatas are available:

Metadata Description Example % of Dataset upload_date Day when the video was uploaded 4/30/2011 NA title Title of the video "Tea party at Dick's house" NA channels List of the video's tags ['Tea', 'Spoon', 'Sugar'] NA description Description of the video "What a spoon !" NA nb_views Number of times the video has been displayed 69 NA nb_votes Number of users who voted for or against this video 42 NA nb_comments Number of comments posted on this video 666 NA runtime Length of the video in seconds 4815 NA uploader Anonymized identifier of the uploader's username 6f60cbef5b891f80 NA

Download

JSON | CSV - 786,121 entries (50M)



Xnxx

This is a non-exhaustive dataset of metadatas for approximately one third of all videos published on the site until february 2013. This represents almost 1,200,000 entries.

For each entry, the following metadatas are available:

Metadata Description Example % of Dataset title Title of the video "Tea party at Dick's house" NA nb_comments Number of comments posted on this video 666 NA tags List of the video's tags ['Tea', 'Spoon', 'Sugar'] NA

The interest of this dataset is its Tag ecosystem. Unlike other pornographic sites, Uploaders can tag the videos at will. Xnxx has got more than 6,000 tags for describing its videos.

Download

JSON | CSV - 1,166,278 entries (50M)



Derivated Datasets

Category Rankings

Various ranking methods for all categories in xHamster and XNXX

xHamster | XNXX

Links Over/Under representation

Matrix of all categories links strengh

xHamster | XNXX