Github links

2024

🚀 PyPI Packages:

- PyTorch: [torch-cluster-based-norm](https://lnkd.in/g4hS-mCU)

- TensorFlow: [tf-cluster-based-norm](https://lnkd.in/gmwAFbDX)

- Keras: [keras-cluster-based-norm](https://lnkd.in/gSEurRbV)


🔗 GitHub Repository: Explore our codebase on GitHub: [cluster-based-norm](https://lnkd.in/gsJkAuHW)


This repository contains the code for benchmarking verbalizer baselines for text classification problems, published at LREC-COLING 2024.


This repository contains our implementation of the distributed collapsed Gibbs sampler for Dirichlet Process Mixture Model inference, proposed in the paper "Distributed Collapsed Gibbs Sampler for Dirichlet Process Mixture Models in Federated Learning" (Accepted to SIAM International Conference on Data Mining (SDM24)).


This repository contains our implementation of DisNPLBM proposed in the paper "Distributed MCMC inference for Bayesian Non-Parametric Latent Block Model" (accepted to The Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2024)), arxiv.


2023 

https://github.com/unsupervise/TabSRA  or

https://github.com/anselmeamekoe/TabSRA


https://github.com/unsupervise/MTS-CGAN or

https://github.com/MadaneA/MTS-CGAN.


2021-2022 

https://github.com/EtienneGof/MultiCoclustering

These models are based on the Dirichlet Process Mixture Model (DPMM, used for univariate dataset clustering) and extends it to multivariate datasets.



https://github.com/FlorentF9/skstab


https://github.com/spark-tss/spark-tss


https://github.com/FlorentF9/DESOM


Its Big Data Clustering Library (API) gathering clustering algorithms and quality indexes in Scala and Spark/Scala. Don't hesitate to ask questions or make recommendations in our Gitter. It is also in SparkPackages.

Some examples using the C4E API  are a avalaible here 

02/2015

         

For more informations about "Data Science & Big data" in LIPN


Since 2022 https://github.com/unsupervise


This repository  includes several codes recently produced in different thesis (in open source and Apache 2 license). 

Since 2012,  we implement our models in Scala under the Spark platform using spark-notebook. Some of our models are also available in C or Matlab.

In 2012,  HUG France: Présentation de Spark par Tugdual Sarazin from HUG France on Vimeo.

Our algorithms G-Stream, Mean-shift ...etc, have been cited at Scala Days Berlin 2016 and  StrataHadoop World 2016 - London, United Kingdom