Dataset
multi-view data
METABRIC-BRCA: gene expression profiles, CNV profiles, and clinical information
multi-modal: text and image [TIP2023_Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering]
Handwritten、Caltech-101、Reuters、NUS-WIDE-Object、Animal with attributes (AWA): Link
Multi-modal Face Dataset : EEG\MEG\fMRI\Structural data
BBCSport、Citeseer、Core、3Sources、100leaves、ORL、texas、winconsin (some introductions)
Corel Image Features: no label
CUHK Face Sketch FERET Database (CUFSF): Face & Sketch
CCV: Columbia Consumer Video (CCV) Database, A Benchmark for Consumer Video Analysis
FCVID: Fudan-Columbia Video Dataset
MNIST: edge view and gray view
n-MNIST handwritten digit dataset
Other data
Reuters、Cora、WebKB、Movies、Newsgroup
SenITVehicle_2views_300samples_3clusters.mat
airlines_raw.csv.bz2、askubuntu_processed.csv.bz2
synthetic dataset for paper 'Auto-weighted Multi-view Constrained Spectral Clustering'
SensIT Vehicle (acoustic, seismic)
Wikipedia articles: Full - 2,866 multimedia documents (image + text) and features (matlab format)
LINQS :CiteSeer for Document Classification , cora, Social Spammer ,Drug-Target Interaction ,Stance Classification ,CiteSeer for Entity Resolution ,ArXiv ,PubMed Diabetes ,WebKB ,Terrorists ,Terrorist Attacks
https://github.com/thuiar/AWESOME-MSA/tree/ce3cc6f805f57a8e92c5a58d23bd73515426316b#related-datasets
(2020) CMU-MOSEAS, access
Multi-label-Multi-view
COREL 5K、IAPR TC-12、ESP GAME、PASCAL VOC 2007、MIR FLICKR
The Extreme Classification Repository: Multi-label Datasets & Code
Single-view
UL-FMTV : thermal infrared face dataset
Visual and Thermal face dataset
ALOI (1000 classes with small objects)
deformity detect
PKU-Market-Phone、PKU-Market-PCB