Karami Lab Datasets and Tools

https://github.com/amir-karami/


This page can help researchers and students to find data and tools for their projects.

Public Available Datasets

  1. RateMyProfessor

  2. Twitter Archive

  3. Exploredata.net

  4. Happiness

  5. Google Dataset Search

  6. FBI Crime Data

  7. WhatsApp Data

  8. Stanford Activity Inequality Project

  9. Drug Overdose CDC Data

  10. Samsha

  11. American Time Use Survey

  12. TREC

  13. Zillow

  14. Film Corpus

  15. Some datasets

  16. Amazon Reviews

  17. UberData

  18. DBpedia

  19. NIST Reuters

  20. Natural Language Corpus Data: Beautiful Data

  21. A3 Datasets (Short Text Data)

  22. MARC Open_Access

  23. Big Crisis Data: Social Media in Disasters and Time-Critical Situations

  24. USA Facts

  25. Collaborative Information Seeking

  26. Fake News

  27. HTRC

  28. Penn Positive Psychology Center

  29. Google Science Datasets

  30. A List of Dataset for Practice

  31. MATLAB Datasets

  32. MINIST

  33. Reuters Corpora

  34. Data.gov

  35. US Bureau of Labor Statistics

  36. Federal Reserve economic data

  37. Information is Beautiful

  38. Wikipedias

  39. JStor Data for Research (DFR)

  40. Weather data

  41. eBird from Cornell Lab of Ornithology

  42. UN Data

  43. ASU DIEGO Lab

  44. ICPSR

  45. KDD Cup 2016

  46. KDD Cup 2015

  47. KDD Cup 2014

  48. Dr. Cohen and His Student Datasets in CMU

  49. Yahoo Datasets

  50. Information is Beautiful

  51. Film Corpus

  52. Samantha Kleinberg Datasets

  53. Global Health Observatory Data Repository

  54. Princeton OPR

  55. Novel Corpus

  56. What do Democrats do in their Spare Time?

  57. Multi-Relational Data

  58. 13 Machine Learning Datasets

  59. Cornel Public Opinion Surveys

  60. Department of Commerce

  61. Roper Center

  62. Pew Research Center data

  63. UCI Datasets

  64. 1000 Genomes

  65. American Gut (Microbiome Project)

  66. Collaborative Research in Computational Neuroscience (CRCNS)

  67. Gene Expression Omnibus (GEO)

  68. Sequence Read Archive(SRA)

  69. EBI ArrayExrepss

  70. ENCODE project

  71. Human Microbiome Project (HMP)

  72. ICOS PSP Benchmark

  73. Crow Flower

  74. Bureaue of Laber Statistics

  75. SWELL Knowledge Work Dataset

  76. Digg2009 Dataset

  77. MIT Cancer Genomics Data

  78. NIH Microarray data (FTP)

  79. OpenSNP genotypes data

  80. Pathguid: Protein-Protein Interactions Catalog

  81. Protein Data Bank

  82. PubChem Project

  83. PubGene (now Coremine Medical)

  84. Stanford Microarray Data

  85. The Personal Genome Project or PGP

  86. UCSC Public Data

  87. UniGene

  88. The Catalogue of Life

  89. Kdnuggets List

  90. Kaggle Datasets

  91. UCAL Health Policy

  92. Academic Torrents

  93. Political Blog Corpora