The NuCLS dataset contains over 220,000 labeled nuclei from breast cancer images from
TCGA. These nuclei were annotated through the collaborative effort of pathologists, pathology residents, and medical students using the
Digital Slide Archive. These data can be used in several ways to develop and validate algorithms for nuclear detection, classification, and segmentation, or as a resource to develop and evaluate methods for interrater analysis.