2005ย
๐๐ง๐๐๐ง๐๐จ๐๐๐ซ โ a lightweight, modular, and efficient framework for multimodal learning.
๐ ๏ธ ๐๐ซ๐ฒ ๐ข๐ญ ๐ฒ๐จ๐ฎ๐ซ๐ฌ๐๐ฅ๐:
๐ Demo: https://lnkd.in/eCut8jYJ
๐ป Code: https://lnkd.in/ehcpeG2W
๐ Notebook: https://lnkd.in/eUrzQjHt
๐ Published paper: https://lnkd.in/eVMxJuHu
2024
For those seeking efficiency, we've developed SCB-Norm-Base, a lightweight version with fewer parameters to tune. Perfect for scenarios where resource constraints are a concern!
๐ PyPI Packages:
- PyTorch: [torch-cluster-based-norm](https://lnkd.in/g4hS-mCU)
- TensorFlow: [tf-cluster-based-norm](https://lnkd.in/gmwAFbDX)
- Keras: [keras-cluster-based-norm](https://lnkd.in/gSEurRbV)
๐ GitHub Repository: Explore our codebase on GitHub: [cluster-based-norm](https://lnkd.in/gsJkAuHW)
Verbalizer benchmarking for text classification: https://github.com/quang-anh-nguyen/verbalizer_benchmark
This repository contains the code for benchmarking verbalizer baselines for text classification problems, published at LREC-COLING 2024.
This repository contains our implementation of the distributed collapsed Gibbs sampler for Dirichlet Process Mixture Model inference, proposed in the paper "Distributed Collapsed Gibbs Sampler for Dirichlet Process Mixture Models in Federated Learning" (Accepted to SIAM International Conference on Data Mining (SDM24)).
This repository contains our implementation of DisNPLBM proposed in the paper "Distributed MCMC inference for Bayesian Non-Parametric Latent Block Model" (accepted to The Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2024)), arxiv.
2023ย
TabSRA: An Attention based Self-Explainable Model for Tabular Learning
https://github.com/unsupervise/TabSRAย or
https://github.com/anselmeamekoe/TabSRA
MTS-CGAN: Conditional Generative Adversarial Network for Multivariate Time Series
https://github.com/unsupervise/MTS-CGAN or
https://github.com/MadaneA/MTS-CGAN.
2021-2022ย
This code implements three model based block clustering methods.ย
https://github.com/EtienneGof/MultiCoclustering
These models are based on the Dirichlet Process Mixture Model (DPMM, used for univariate dataset clustering) and extends it to multivariate datasets.
skstab is a module for clustering stability analysis in Python with a scikit-learn compatible APIย ย
https://github.com/FlorentF9/skstab
Spark Time Series Set data analysis
https://github.com/spark-tss/spark-tss
2019: Deep Embedded Self-Organizing Map (DESOM) model,(Unsupervised Deep Learning) Git
https://github.com/FlorentF9/DESOM
Its Big Data Clustering Library (API) gathering clustering algorithms and quality indexes in Scala and Spark/Scala. Don't hesitate to ask questions or make recommendations in our Gitter. It is also in SparkPackages.
Some examples using the C4E APIย are a avalaible hereย
02/2015
ย ย ย ย ย ย ย ย ย
For more informations about "Data Science & Big data" in LIPN
Since 2022 https://github.com/unsupervise
This repositoryย includes several codes recently produced in different thesis (in open source and Apache 2 license).ย
Since 2012, ย we implement our models in Scala under the Spark platform using spark-notebook. Some of our models are also available in C or Matlab.
In 2012,ย HUG France: Prรฉsentation de Spark par Tugdual Sarazin from HUG France on Vimeo.
Our algorithms G-Stream, Mean-shift ...etc, have been cited at Scala Days Berlin 2016 andย StrataHadoop World 2016 - London, United Kingdom