The Pornography database contains nearly 80 hours of 400 pornographic and 400 non-pornographic videos. For the pornographic class, we have browsed websites which only host that kind of material (solving, in a way, the matter of purpose). The database consists of several genres of pornography and depicts actors of many ethnicities, including multi-ethnic ones. For the non-pornographic class, we have browsed general-public purpose video network and selected two samples: 200 videos chosen at random (which we called "easy") and 200 videos selected from textual search queries like "beach", "wrestling", "swimming", which we knew would be particularly challenging for the detector (called "difficult"). In the figure below, we illustrate the diversity of the pornographic videos (top row) and the challenges of the “difficult” non-pornographic ones (middle row). The "easy" cases are shown at bottom row. The huge diversity of cases in both pornographic and non pornographic videos makes this task very challenging.
A summary of the Pornography database.
Ethnic diversity on the pornographic videos.
We preprocess the database by segmenting videos into shots. An industry-standard segmentation software, the STOIK Video Converter, has been used. As it is often done in video analysis, a key frame is selected to summarize the content of the shot into a static image. Although there are sophisticated ways to choose the key frame, we opted to simply selected the middle frame of each video shot. In total, there are 16,727 video segments.
The experimental evaluation is a classical 5-fold cross-validation. We report the image classification performance by using the Mean Average Precision (MAP), and the video classification by Accuracy Rate, where the final video label is obtained by majority voting over the images. A confusion table is also used to illustrate the results.
THIS DATABASE IS PROVIDED "AS IS" AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE. The videos, segments, and images provided were produced by third-parties, who may have retained copyrights. They are provided strictly for non-profit research purposes, and limited, controlled distributed, intended to fall under the fair-use limitation. We take no guarantees or responsibilities, whatsoever, arising out of any copyright issue. Use at your own risk.
It is necessary to sign a license agreement to get access to the data (videos, video segments and frames), you can find the license agreement here. Please print it, sign it and send a scanned copy to Arnaldo Araújo <arnaldo [at] dcc [dot] ufmg [dot] br>; see also the instructions page in the document for more information.
We have computed several visual features. They are freely available for download:
In order to make the comparison possible, the training and test folds are available for download:
If you make use of the Pornography database, please cite the following reference:
Papers reporting results on the Pornography database:
Please feel free to contact us if you have any questions or comments.
This work is supported by CAPES, CNPq, FAPEMIG, FAPESP.
Last updated on February 2017.