The data provided by you will be used in monitoring the traffic and analyzing the need of the researchers like you. We strictly against the use of any third part tools for monitoring your activities. We only use and relies on the data provided by the users.
The provided datasets are in which format?
Main/Parent folder is zipped. If you are using the Linux platform you will not require any special tool for its unzipping. On windows, software like WinZip, WinRAR or 7Zip will do great.
Parent folder have
1. About.pdf - Information about the dataset.
2. Compounds.tsv - All the respective compounds are in SMILES format with PubChem CID.
3. ReadMe - Have link of all the database used, with additional information.
4. sdf.zip - All the respective compounds are in 3D SDF format.
From where you are fetching the data?
Currently we are using only PubChem, so we can assure ourselves that we are providing you with the unified data.
How much data is reliable?
We have provided the link of the databases. With it, all the required information is in the zipped folder. You can cross check the data. If in any case you are not satisfied with the data we are providing, you can contact us.
You can cite our URL - https://sites.google.com/view/mud-data in your research articles or papers. Above it we request you to share it with your fellow researchers, juniors and students. We required all your possible support to grow.