Robust Recovery via Double Over-parameterization
Over-parameterized models tend to overfit in robust recovery tasks, therefore require carefully fine-tuning early termination conditions.
We provide a double over-parameterization method for robust natural image recovery, which does not require fine-tuning and is easy to use.
[Github]
Deep Isometric Learning
We identify central principles in neural network architectural design that enable effective training of very deep vanilla models. Such principles may help people design new simple networks with much improved performance.
Large Scale Subspace Clustering
Subspace clustering is the problem of clustering a set of data drawn from multiple low-dimensional subspaces into their respective subspaces.
This python package provides implementation of scalable and provably correct subspace clustering methods that can handle 1M data points.
[Github]
Outlier Detection in Subspaces
Outliers are the points that do not lie in the underlying low-dimensional subspaces, which need to be detected and rejected.
This matlab code implements a random walk approach for outlier detection that can handle multiple (and possibly unknown number of) inlier groups.
[Download]
Exemplar Selection from Class-Imbalanced Data
Modern datasets usually contain drastically different number of samples from different classes. which can compromise the performance of existing learning methods by a significant amount.
This matlab code implements an efficient greedy algorithm for generating a balanced subset from imbalanced data.
[Download]