On Finding Hubs in High Dimensions with Sampling
Huiwen Dong, Linghan Zeng, Zhiwen Zhao, Francesco Silvestri, Ninh Pham
AAAI 2025, Code
Scalable DBSCAN with Random Projections
HaoChuan Xu, Ninh Pham
NeurIPS 2024, Arxiv, Code, 5-min talk
This work is an extension of Xu's (honours) bachelor thesis
A Semi-supervised Feature Selection for Anomaly Detection
Jingrui Zhang, Gill Dobbie, Ninh Pham
Weakly Supervised and Cautious Learning (WSCL) Workshop at ECAI 2024
On Deploying Mobile Deep Learning to Segment COVID-19 RT-PCR Test Tube Images
Ting Xiang, Richard Dean, Jiawei Zhao, and Ninh Pham
PSIVT 2023, Data science showcase 2023
A Transductive Forest for Anomaly Detection with Few Labels
Jingrui Zhang, Ninh Pham, Gillian Dobbie
ECML-PKDD 2023, Python Code
Falconn++: A Locality-sensitive Filtering Approach for Approximate Nearest Neighbor Search
Ninh Pham, Tao Liu
NeurIPS 2022 (Spotlight - Top 5%), Arxiv, Online talk at MIT, Code, 5-min teaser at NeurIPS
This work is an extension of Liu's master thesis, Data Science Showcase 2023
Simple Yet Efficient Algorithms for Maximum Inner Product Search via Extreme Order Statistics
Ninh Pham
KDD 2021, Supplementary, Arxiv, Slide, C++ Code
Revisiting Wedge Sampling for Budgeted Maximum Inner Product Search
Stephan Lorenzen, Ninh Pham
ECML-PKDD 2020, Best Data Mining Paper Award, Slide, C++ Code, Datasets
IJCAI 2021 Sister Conference Best Paper Track (invited extended abstract)
L1-Depth Revisited: A Robust Angle-based Outlier Factor in High-dimensional Space
Ninh Pham
ECML-PKDD 2018, Slide, Supplementary, Python Code, C++ Code, Datasets
Hybrid LSH: Faster Near Neighbors Reporting in High-dimensional Space
Ninh Pham
EDBT 2017 (short paper), Python Code
Scalability and Total Recall with Fast CoveringLSH
Ninh Pham, Rasmus Pagh
CIKM 2016, Slide
Rasmus Pagh, Ninh Pham, Francesco Silvestri, Morten Stöckel
ESA 2015
Full version appears in Algorithmica 2017 (invited paper)
Efficient Estimation for High Similarities using Odd Sketches
Michael Mitzenmacher, Rasmus Pagh, Ninh Pham
WWW 2014, Best Paper Award, Selected as one of Notable Books and Articles in Computing of 2014, Slide, Matlab Code
Media: CPH Post, Videnskab, Berlingske
Fast and Scalable Polynomial Kernels via Explicit Feature Maps
Ninh Pham, Rasmus Pagh
KDD 2013 (Oral), Slide, Matlab Code (featured in scikit-learn), Updated version correcting variance analysis in the KDD version
Ninh Pham, Rasmus Pagh
KDD 2012, Slide
Online Discovery of Top-k Similar Motifs in Time Series Data
Hoang Thanh Lam, Ninh Pham, Toon Calders
SDM 2011
Two Novel Adaptive Symbolic Representations for Similarity Search in Time Series Databases
Ninh D. Pham, Quang Loc Le, Tran Khanh Dang
APWeb 2010
HOT aSAX: A Novel Adaptive Symbolic Representation for Time Series Discords Discovery
Ninh D. Pham, Quang Loc Le, Tran Khanh Dang
ACIIDS 2010
Others:
On the Power of Randomization in Big Data Analytics
PhD Thesis 2014, Slide