Code

“In theory, theory and practice are the same. In practice, they are not."  - Einstein A


We are slowly moving all codes and packages to github StatsLE.

- Kindly consider to cite our papers if you find any of the packages helpful to your projects.



[Python Package] Python code for the paper on "Directional diffusion models for graph representation learning", [arXiv]. 


References:



[R Package] This R package performs robust convex clustering as described in Liu, Sun and Tan (2019). Classical approaches towards convex clustering solves a convex optimization problem with the cost function being a squared loss plus a fusion penalty that encourages the estimated centroids for observations in the same cluster to be identical. These approaches are not resistant to adversarial samples. This package implemented a robust convex clustering algorithm, which performs well in cases with arbitrary outliers.


References:



[R and Python Package] This R package/Python package realizes the adaptive capped least squares for linear regression models with possible outliers. ACLS is robust against outliers in both the predictor and response space: it achieves optimal breakdown point asymptotically. Statistically, it also achieves fully efficient regression estimators.  


References:

Auto-ARR: Auto-adaptive robust regression


[Python Package] This Python package implements the algorithm by Sun, 2021.


References:


[R Package] This R package realizes the I-LAMM algorithm and uses it to solve regularized adaptive Huber regression. The choice of penalty functions includes the l1-norm, the smoothly clipped absolute deviation (SCAD) and the minimax concave penalty (MCP). Two tuning parameters lambda and tau (for adaptive Huber loss) are calibrated by cross-validation. As a by-product, this package also produces regularized least squares estimators, including the Lasso, SCAD and MCP.


References:


[Python Package] This Python package implements the Huber mean estimator, adaptive Huber regression and $\ell_1$-regularized Huber regression (Huber-Lasso) estimators efficiently. For all these methods, the robustification parameter tau is calibrated by a tuning-free principle.

[R Package] This R package implements the Huber mean estimator, adaptive Huber regression and $\ell_1$-regularized Huber regression (Huber-Lasso) estimators efficiently. For all these methods, the robustification parameter tau is calibrated by a tuning-free principle.


References:


[R Package] This R package performs robust and large-scale multiple testing for millions of possibly dependent tests. The dependence is specified using a latent factor model. It implements a robust procedure to estimate distribution parameters using the adaptive Huber loss (Sun et al., 2020) and accounts for strong dependence among test statistics via an approximate latent factor model (Fan et al., 2019). This method is tailored to heavy-tailed data whose distributions deviate far from Gaussian. Besides hypotheses testing, the software also outputs the estimated underlying factors and diagnostic plots.


References:


[R Package] This R package utilizes an orthogonality constrained optimization algorithm in Wei and Yin (2013) to solve various semiparametric dimension reduction problems, such as those in Ma and Zhu (2012) and Sun et al. (2018). It also serves as a general-purpose R-based optimization solver for problems with orthogonality constraints. Parallel computing is enabled through the OpenMP API.


References: