This page can help researchers and students to find data and tools for their projects.
Public Available Datasets
Twitter Archive
Exploredata.net
Happiness
Google Dataset Search
FBI Crime Data
WhatsApp Data
Stanford Activity Inequality Project
Drug Overdose CDC Data
Samsha
American Time Use Survey
TREC
Zillow
Film Corpus
Some datasets
Amazon Reviews
UberData
DBpedia
NIST Reuters
Natural Language Corpus Data: Beautiful Data
A3 Datasets (Short Text Data)
MARC Open_Access
Big Crisis Data: Social Media in Disasters and Time-Critical Situations
USA Facts
Collaborative Information Seeking
Fake News
HTRC
Penn Positive Psychology Center
Google Science Datasets
A List of Dataset for Practice
MATLAB Datasets
MINIST
Reuters Corpora
Data.gov
US Bureau of Labor Statistics
Federal Reserve economic data
Information is Beautiful
Wikipedias
JStor Data for Research (DFR)
Weather data
eBird from Cornell Lab of Ornithology
UN Data
ASU DIEGO Lab
ICPSR
KDD Cup 2016
KDD Cup 2015
KDD Cup 2014
Dr. Cohen and His Student Datasets in CMU
Yahoo Datasets
Samantha Kleinberg Datasets
Global Health Observatory Data Repository
Princeton OPR
Novel Corpus
What do Democrats do in their Spare Time?
Multi-Relational Data
13 Machine Learning Datasets
Cornel Public Opinion Surveys
Department of Commerce
Roper Center
Pew Research Center data
UCI Datasets
1000 Genomes
American Gut (Microbiome Project)
Collaborative Research in Computational Neuroscience (CRCNS)
Gene Expression Omnibus (GEO)
Sequence Read Archive(SRA)
EBI ArrayExrepss
ENCODE project
Human Microbiome Project (HMP)
ICOS PSP Benchmark
Crow Flower
Bureaue of Laber Statistics
SWELL Knowledge Work Dataset
Digg2009 Dataset
MIT Cancer Genomics Data
NIH Microarray data (FTP)
OpenSNP genotypes data
Pathguid: Protein-Protein Interactions Catalog
Protein Data Bank
PubChem Project
PubGene (now Coremine Medical)
Stanford Microarray Data
The Personal Genome Project or PGP
UCSC Public Data
UniGene
The Catalogue of Life
Kdnuggets List
Kaggle Datasets
UCAL Health Policy
Academic Torrents
Political Blog Corpora
Useful Tools and Packages
Top Python Libraries for Deep Learning, Natural Language Processing
Python Datasets
R for Pew Research Center Data 1
R for Pew Research Center Data 2
t-SNE
Hatesonar
Google Perspective API
Social Media Research Tools
NLPReViz
Weka
Mallet
R: Text Mining Package (tm)
NodeXL
Gephi
R: Topic Models Package (topicmodels)
Matlab: TMG
Matlab: Fuzzy Logic
R: Twitter API Package (twitteR)
R: Web Crawling Package (crawl)
Python: Twitter API (Tweepy)
R: Mallet Package (mallet)
R: Tuning of the LDA Models Parameters Package (ldatuning)
R: Fuzzy Clustering Package (fclust)
R: Latent Semantic Analysis Package (lsa)
R: Singular Value Decomposition Package (svd)
R: Time Series Clustering Package (dtwclust)
Mahout
R: Multi-Criteria Decision Making Methods Package (mcdm)
R: Topsis Package (topsis)
R: Rattle for Data Mining
R: rOpenSci
Karami Lab Datasets and Tools
https://github.com/amir-karami/