Data Sets and Softwares
1) Differentially Private Deep Learning: AdLM, pCDBN, and dp-Autoencoder
Link: https://github.com/haiphanNJIT/PrivateDeepLearning
These are the codes used in the papers titled: (1) Adaptive Laplace Mechanism: Differential Privacy Preservation in Deep Learning (https://arxiv.org/abs/1709.05750), (2) Preserving Differential Privacy in Convolutional Deep Belief Networks (https://arxiv.org/abs/1706.08839), and (3) Differential Privacy Preservation for Deep Auto-Encoders: an Application of Human Behavior Prediction (https://dl.acm.org/citation.cfm?id=3016005).
2) GeT_Move: An Efficient and Unifying Object Movement Pattern Mining
Link: https://github.com/jGetMove/jGetMove
Source(Citations)
- NhatHai Phan, Pascal Poncelet, and Maguelonne Teisseire. GeT_Move: An Efficient and Unifying Spatio-Temporal Pattern Mining Algorithm for Moving Objects. IDA 2012, Helsinki, Finland.
- NhatHai Phan, Dino Ienco, Pascal Poncelet, and Maguelonne Teisseire. Mining Representative Movement Patterns through Compression. PAKDD 2013, Goal Coast, Australia.
- NhatHai Phan, Dino Ienco, Pascal Poncelet, and Maguelonne Teisseire. Extracting Trajectories through an Efficient and Unifying Spatio-Temporal Pattern Mining System. ECML-PKDD 2012, Demo Paper, Bristol, UK.
3) SLR Club, a Professonal Protography Social Network
The data set is from a largescale web community called "the SLR Club - http://www.slrclub.com" in which users can post photos and comment on other users’ photos. The data set contains 149,799 posts and 1,909,013 comments by 144,514 users over a period of 1.5 years.
Data Set: 144,514 users - 149,799 posts - 1,909,013 comments - 1.5 years
Link: https://ix.cs.uoregon.edu/~haiphan/DataSets/SLRClub.rar
Source(Citations)
- NhatHai Phan, and Hyoseop Shin. Effective Clustering of Dense and Concentrated Online Communities. APWeb 2010, Busan, Korea.
- NhatHai Phan, Hoang Van Duc Thong, and Hyoseop Shin. Adaptive Combination of Tag and Link-based User Similarity in Flickr. ACM MM 2010, Firenze, Italy.
4) Animal Trajectory Data
Swainsoni dataset includes 43 objects evolving over 764 different timestamps. The dataset was generated from July 1995 to June 1998. Buffalo dataset concerns 165 buffaloes and the tracking time from year 2000 to year 2006. The original data has 26,610 reported locations and 3,001 time stamps.
Link: https://www.dropbox.com/s/64de87g93kie2ir/AnimalDatasets.zip?dl=0
Source(Citations)
- NhatHai Phan, Dino Ienco, Pascal Poncelet, and Maguelonne Teisseire. Mining Representative Movement Patterns through Compression. PAKDD 2013, Goal Coast, Australia.
- NhatHai Phan, Dino Ienco, Pascal Poncelet, and Maguelonne Teisseire. Mining Time Relaxed Gradual Moving Object Clusters. ACM GIS 2012, Redondo Beach, California.
- NhatHai Phan, Dino Ienco, Pascal Poncelet, and Maguelonne Teisseire. Extracting Trajectories through an Efficient and Unifying Spatio-Temporal Pattern Mining System. ECML-PKDD 2012, Demo Paper, Bristol, UK.
- NhatHai Phan, Pascal Poncelet, and Maguelonne Teisseire. GeT_Move: An Efficient and Unifying Spatio-Temporal Pattern Mining Algorithm for Moving Objects. IDA 2012, Helsinki, Finland.