Data Sets

We have been collecting data sets and conducting baseline and advanced personal identification studies using biometric measurements.  We are committed to releasing all data collated to eligible research groups, with appropriate controls to forbid on-line distribution outside the research community.  Data is distributed using rsync.

If you are interested in obtaining any of the biometric datasets described below, please follow these instructions:

  • Download all applicable license agreements.  Several of our datasets require more than one license agreement.
  • Have the license agreement reviewed and signed BY AN INDIVIDUAL AUTHORIZED TO MAKE LEGAL COMMITMENTS ON BEHALF OF YOUR INSTITUTION.  WE CANNOT ACCEPT LICENSES SIGNED BY STUDENTS OR FACULTY MEMBERS.  YOUR INSTITUTION'S LEGAL OFFICE MUST REVIEW AND EXECUTE THE LICENSE.
  • Return the properly signed license agreement via your INSTITUTIONAL  e-mail address (we cannot accept license agreements sent through third party e-mail providers) to cvrl@nd.edu or fax, attention D. Wright, to 1-574-631-9260. 
  • Include in the e-mail/cover page the full name, title, address and phone number of the institution and institutional point of contact.
Upon our receipt and approval of the license agreement, download instructions will be sent to the licensee via e-mail.


Biometric Data Sets

IJCB 2017 Challenge
Data type: Face Still image
Approximate Download Size: 200 GB
To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

EFCT 2017
Data type: Face Still image
Approximate Download Size: 2.2 GB
To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND-IIITD Retouched Face Database
Data type: Face Still image 
Approximate Download Size: 4.3 GB
The ND-IIITD Retouched Faces databaseis a dataset of original face images and retouched versions of those face images. The database contains 2600 original images and 2275 altered
images.  It is meant for use in the problem of developing methods to classify a face image as original or retouched.
To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

The Point and Shoot Face and Person Recognition Challenge (PaSC)
Data type: Face Still image and Face Video
Approximate Download Size: 71 GB
Information about the PaSC effort may be found at http://www.nist.gov/itl/iad/ig/face.cfm and http://www.cs.colostate.edu/pasc.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

The ND-Near Infrared and Visible-Light (ND-NIVL) 
Data type: IR & Visible Face Still 
Approximate Download Size: 404 GB
The image collection comprises visible-light and near-IR face images of 574 subjects acquired from fall 2011 to spring 2012.  There are a total of 2,341 visible-light face images of the 574 persons.  There are a total of 22,264 near-IR face images, coming from two 230 subject-sessions.  A total of 402 subjects had both visible-light and near-IR face images collected in one or more sessions in fall 2011 and also one or more sessions in spring 2012.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

The Gender from Iris Dataset (ND-GFI)
Data type: IR Iris Still 
Approximate Download Size:1 GB
This is a set of iris images acquired using an LG 4000 sensor.  It is divided into left and right iris images, and it includes the gender of each subject.  In the basic part of the dataset there is one image per left and right iris of each of 750 males and 750 females, for 3000 total images.  An additional part of the dataset, for a different set of subjects, includes three images per iris.  Publications using this database must cite the paper listed in the license agreement.

The Notre Dame Contact Lense Dataset 2015(NDCLD15)
Data type: IR Iris Still 
Approximate Download Size:2.7 GB
As detailed in Robust Detection of Textured Contact Lenses in Iris Recognition Using BSIF, James S. Doyle and Kevin W. Bowyer, IEEE Access, Volume 3, 2015. Digital Object Identifier 10.1109/ACCESS.2015.2477470, this 
 database of 7300 images was constructed to evaluate contact lens detection under various experimental scenarios. The main dataset is composed of 6000 images for model training and 1200 images for model evaluation. Images were acquired using either an IrisAccess LG4000 or an IrisGuard AD100 sensor; both sensors are equally represented. The dataset is composed of images from one of three equally-represented classes: No Lens, Soft Lens, and Textured Lens. 

FACE Features Set
Data type: feature patterns
Approximate Download Size: 192 KB
The FACE Features Set comprises feature patterns for imagery that is amenable to human-assisted face clustering. The features were computed for faces observed in blurry point-and-shoot videos, images of women seen before and after the application of makeup, and photographs of twins.  Please follow the instructions in the README file regarding citation.

SN-Flip Crowd Video Data Set
Data type: Group Video
Approximate Download Size: 2.1 GB
Comprising 190 subjects recorded in 28 crowd videos over a two year period, SN-Flip captures variations in illumination, facial expression, scale, focus, and pose.  The videos were recorded with point-and-shoot camcorders from the Cisco Flip family of products, so the image quality is representative of typical web videos.  Ground truth information for subject identities and social groups is included to facilitate future research in vision-driven social network analysis.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND-CrossSensor-Iris-2013 Data Set
Data type: IR Iris still
Approximate Download Size: 122 GB
This data set was initially released for the Cross Sensor iris Recognition Challenge associated with the BTAS 2013 Conference. This dataset consists of 27 sessions of data with 676 unique subjects.  An average session contains 160 unique subjects which have multiple images from both the LG2200 and LG4000 iris sensors.  There are 29,986 images from the LG4000 and 116,564 images from the LG2200.  Every subject occurs in at least two sessions across the entire data set.  This data set spans three years, 2008 to 2010.  The initial images are taken from both sensors and are 640 by 480.  There are additional images included in this data set, known as the modified LG2200 images.  The original images have been stretched vertically by 5% to compensate for the non-unit aspect ratio of the digitizer used in the LG2200 computer-hosted runtime acquisition system (this elongation was suggested by Imad Malhas of IrisGuard inc.  in 2009).  Hence these additional images are of size 640 by 504.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND_Cosmetic_Contact_Lenses_2013 Data Set
Data type: IR Iris still
Approximate Download Size: 3.9 GB
This data set contains iris images of subjects without contact lenses, with soft contact lenses, and with cosmetic contact lenses, acquired using an LG4000 and an IrisGuard AD100 iris sensor.  The data set contains 4,200 TIFF files from the LG4000 sensor, 900 TIFF images from the AD100 sensor, and four metadata files describing the images.  For a more thorough description, please see the README document.  This data set corresponds to that in:  Doyle, J.S; Bowyer, K.W.; Flynn, P.J., "Variation in accuracy of textured contact lens detection based on sensor and lens pattern," Biomentrics: Theory, Applications and Systems (BTAS), 2013 IEEE Sixth International Conference on, vo.,no.,pp1,7, Sept. 29 2013-Oct.2 2013.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND-CrossSensor-Iris-2012 Data Set
Data type: IR Iris still
Approximate Download Size: 83.5 GB
This data set was initially released for the Cross Sensor Iris Recognition Challenge associated with the BTAS 2012 conference.  This dataset consists of 27 sessions of data with 676 unique subjects.  An average session contains 160 unique subjects which have multiple images from both the LG2200 and LG4000 iris sensors.  There are 29,939 images from the LG4000 and 117,503 images from the LG2200.  Every subject occurs in at least two sessions across the entrée data set.  This data set spans three years, 2008 to 2010.  The initial images are taken from both sensors and are 640 by 480.  There are additional images included in this data set, known as the modified LG2200 images.  The original images have been stretched vertically by 5% to compensate for the non-unit aspect ration of the digitizer used in the LG2200 computer-hosted runtime acquisition system (this elongation was suggested by Imad Malhas of IrisGuard Inc. in 2009).  Hence these additional images are of size 640 by 504.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND-TimeLapseIris-2012
Data type: IR Iris still
Approximate Download Size: 2.2 GB
This data set contains 6797 images collected from 23 subjects (46 different irises) between January 2004 and October 2008.  It corresponds to the data set used in a chapter in the book: Template Aging in Iris Biometrics:  Evidence of Increased False Reject Rate in ICE 2006, Sarah Baker, Kevin W. Bowyer, Patrick K. Flynn and P. Johathon Phillips, in Handbook of Iris Recognition, Mark Burge and Kevin W. Bowyer, editors, Springer, 2012.  to demonstrate the effects of elapsed time between probe and gallery image acquisition on iris recognition system performance.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND-2006 Data Set
Data Type: Face 3D
Approximate Download Size: 29 GB
The ND-2006 data set contains a total of 13,450 images containing 6 different types of expressions (Neutral, Happiness, Sadness, Surprise, Disgust and Other).  A total of 888 distinct persons, with as many as 63 images per subject are available in this data set.  The data set corresponds exactly to the data set described in: Faltemier, T.C.; Bowyer, K.W.; Flynn P.J.; Using a Multi-Instance Enrollment Representation to Improve 3D Face Recognition, Proc. First IEEE International Conference on Biometrics: Theory, Applications, and Systems, September 2007, pp.1 - 6.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

3D Twins Expression Challenge (3D-TEC) Data Set
Data Type: Face 3D
Approximate Download Size: 1.5 GB
This data set contains 3D face scans for 107 pairs of twins.  There are 107 x 2 = 214 individuals, each with a 3D face scan with a smiling expression and a scan with a neutral expression, and so 214 x 2 = 428 scans.  The scans were acquired with a Minolta Vivid 910.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND-Iris-Template-Aging-2008-2010
Data Type: IR Iris
Approximate Download Size: 4 GB
 This data set contains sequences of iris images of different persons, acquired using an LG4000 iris sensor.  Images are from Spring 2008, Spring 2009, and Spring 2010.  This allows two different one-year template aging studies, 2008-2009 and 2009-2010, and one two-year template aging study, 2008- 2010.  The data set contains 11,776 TIFF files.  It is used in the paper: Analysis of Template Aging in Iris Biometrics, Samuel P. Fenker and Kevin W. Bowyer, IEEE Compter Society Biometrics Workshop, June 2012.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND-TWINS-2009-2010
Data Type: Face Still 
Approximate Download Size: 250 GB
This data set contains 24,050 color photographs of the faces of 435 attendees at the Twins Days Festivals in Twinsburg, Ohio in 2009 and 2010.  All images were captured by Nikon D90 SLR cameras.  Images were captured under natural light in "indoor" and "outdoor" configurations ("indoor" was a tent).  Facial yaw varied from -90 to +90 degrees in steps of 45 degrees (zero degrees was frontal).  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND-QO-Flip Crowd Video Data Set
Data Type: Group Video
Approximate Download Size: 260 MB
The database contains 14 crowd videos of 90 subjects, five of whom appear in multiple videos and 85 of whom appear in one video.  These videos were acquired between November 200 and May 2010.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND_Iris_Contact_Lenses_2010
Data Type: IR Iris Still
Approximate Download Size: 7 GB
This is a data set of iris images that was used in a study of the effects of wearing contact lenses on the performance of iris recognition: "Degradataion of Iris Recognition Performance Due to Non-Cosmetic Prescription Contact Lenses", Sarah E. Baker, Amanda Hentz, Kevin W. Bowyer, and Patrick J. Flynn, Computer Vision and Image Understandy 114 (9), 1030-1044, September 2010.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND-Iris-0405 Data Set
Data Type: IR Iris Still
Approximate Download Size: 20.5 GB
A technical report describing this data can be found here.  The data set contains 64,980 iris images obtained from 356 subjects (712 unique irises) between January 2004 and May 2005.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

Multiple Biometric Grand Challenge (MBGC ) version 2 data collection
Data Type: IR Face and Iris Video, IR Iris Video, Visible Face Images and Video 
Approximate Download Size: 110 GB ( +  UTD data: 36 GB)
Information about the MBGC program maybe found here.  The MBGC v2 and MBGC v1 data sets are separately licensed; MBGC v1 licensees must complete a new license for MBGC v.2.  Please be sure to complete all three required forms to receive the full MBGC 2.o data collection.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

UTD data collection
Data Type: Visible Face Video 
Approximate Download Size:36 GB
Notre Dame distributes a subset of the Database of Moving Faces and People data set, assembled by A.J. O'Toole and H. Abdi at the University of Texas at Dallas. The subset contains between 1 and 9 videos of 297 unique human subjects, with a total of 1019 videos  and a data set size of 36GB.  To obtain this data set, you must agree to, and your institution must execute, both the data license agreement and the permission form.

Multiple Biometric Grand Challenge (MBGC) version 1 data collection
Data Type: IR Face and Iris Video, IR Iris Video, Visible Face Images and Video
Approximate Download Size: N/A
Information about the MBGC program may be found hereTo obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

Face Recognition Grand Challenge (FRGC v.2.0) data collection
Data Type:Data Type: 3D Face, Visible Face Images
Approximate Download Size: 72 GB
Information about the FRGC program may be found here.    Note: the FRGC 1.0a data collection has been superseded by the FRGC v2.0 collection and is no longer available.  As of 4/2/2014 the Bee Software has been separated from the FRGC 2.0 dataset.  If you wish to download it as well as the FRGC 2.0 dataset, please download and execute the Bee software license agreement.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

Face and Ocular Challenge Series (FOCS) data collection
Data Type: Visible Face Images, Visible Face Video, NIR Iris and Ocular Regi0n
Approximate Download Size: 98.5 GB
Information about the FOCS program may be found here.  Please be sure to execute all three required documents to receive download information.  To obtain this data set, retrieve the license agreement and follow instructions above.  Publications using this database must cite the paper listed in the license agreement.

ND-Collection B
Data Type: Face Still
Approximate Download Size: 57.5 GB
33,287 visible-light frontal face images captured from 487 human subjects from 2002 through 2004.  Each subject was photographed with a high-resolution digital camera (1600 x 1200 or 2272 x 1704) under different lighting and expression conditions.  Many subjects were photographed every week for 10 weeks in the Spring of 2002, 13 weeks in the fall of 2002 and 15 weeks in the spring of 2003.  The number of images per subject ranges from 4 to 227 with an average of 68.  Hence, this database provides a significant amount of "repeat data" to assess performance of face recognition systems with respect to time elapsed since enrollment.

ND-Collection C
Data Type: LWIR Face Still
Approximate Download Size: 360 MB
2,492 LWIR frontal face images from 241 human subjects captured in 2002.  All images are 320 x 240 and were captured by a Merlin-Uncooled camera purchased from Indigo Systems in 2001.  

ND-Collection D
Data Type: 3D + 2D Face Images
Approximate Download Size: 2.5 GB
953 3D ( and corresponding 2D) frontal face images from 277 human subjects captured in 2003.  These images were acquired with a Minolta Vivid 900 3D range scanner.

ND-Collection E
Data Type: Visible Light Ear
Approximate Download Size: 487 MB
464 visible light profile (ear) images from 114 human subjects captured in 2002.

ND-Collection F
Data Type: 3D + 2D Ear Images
Approximate Download Size: 2.5 GB
942 3D (and corresponding 2D) profile (ear) images from 302 human subjects captured in 2003 and 2004.

ND-Collection G
Data Type: 3D + 2D Ear Images
Approximate Download Size: 2 GB
738 3D ( and corresponding 2D) profile (ear) images from 235 human subjects captured between 2003 and 2005.

ND-Collection H
Data Type: 3D + 2D Hand Images
Approximate Download Size: 14.5 GB
1191 3D (and corresponding 2D) images of the back (non-palm) portion or 223 different human hands captured between 2003 and 2005.

ND-Collection J2
Data Type: 3D + 2D Ear Images
Approximate Download Size: 35 GB
1800 3D (and corresponding 2D) profile (ear) images from 415 human subjects captured between 2003 and 2005.  Corresponds to data used in Yan and Bowyer, "Biometric recognition using three-dimensional ear shape, " PAMI 29(8), August 2007.

ND-Collection X1
Data Type: IR + Visible Face Images
Approximate Download Size: 3 GB
2292 IR frontal face images and 2292 visible frontal face images from 82 human subjects captured from 2002-2004.