Methodology

Dataset --> Molecular Fingerprints --> Dimension reduction (PCA, t-SNE, UMAP) --> Clustering (k-means clustering, DBSCAN, HDBSCAN)