Machine Learning Operations (MLOps): Overview, Definition, and Architecture
A Comprehensive Survey of Continual Learning: Theory, Method and Application
A reading survey on adversarial machine learning: Adversarial attacks and their understanding
Taking the Human out of Learning Applications: A Survey on Automated Machine Learning
Adversarial Machine Learning A Taxonomy and Terminology of Attacks and Mitigations
A Comprehensive Survey of Continual Learning: Theory, Method and Application
PyTorch: A flexible and dynamic framework for building custom AI models; widely used in research and production.
TensorFlow: A comprehensive platform supporting the entire machine learning workflow, from prototyping to deployment.
Keras: A high-level API running on top of TensorFlow, ideal for fast and easy neural network prototyping.
MXNet: A deep learning framework that supports a flexible programming model for different deep learning tasks.
Caffe: A deep learning library focused on speed and modularity, often used in computer vision.
Scikit-learn: A go-to library for data mining, predictive analysis, and classic machine learning algorithms.
XGBoost: A powerful gradient boosting framework designed for structured/tabular data tasks.
LightGBM: A gradient boosting framework optimized for speed and efficiency with large datasets.
CatBoost: A gradient boosting library specifically designed to handle categorical features effectively.
H2O.ai: An open-source platform for scalable machine learning, supporting distributed training.
PyTorch Geometric (PyG): A popular library built on PyTorch for creating and training GNNs.
Deep Graph Library (DGL): A framework designed for building and training GNNs, compatible with PyTorch, TensorFlow, and MXNet.
GraphNets: A library from DeepMind for building GNNs with TensorFlow and Sonnet.
Spektral: A Python library for GNNs built on TensorFlow and Keras.
StellarGraph: A machine learning library for graph-based data using GNNs, built on TensorFlow.
Karate Club: A Python library for unsupervised learning on graph-structured data.
Neo4j Graph Data Science: A graph database platform that includes machine learning capabilities.
PecanPy: A scalable Python library for embedding large-scale graphs.
Transformers: Pre-trained models for NLP tasks such as text generation, translation, and sentiment analysis.
NLTK: A robust toolkit for text processing, including tokenization, tagging, and parsing.
LangChain: A framework for building applications powered by large language models, integrating external resources like databases
OpenAI: Offers cutting-edge AI models, including GPT, for language understanding and generation.
LlamaIndex: Facilitates connecting language models with structured and unstructured external data sources.
SpaCy: An industrial-strength NLP library with support for tokenization, named entity recognition, and dependency parsing.
TextBlob: Simplifies NLP tasks like text classification, translation, and sentiment analysis.
Gensim: A library for topic modeling and document similarity using techniques like word2vec and LDA.
NumPy: Core library for numerical operations and efficient array handling in Python.
Pandas: Essential for data wrangling, manipulation, and analysis of structured data.
Dask: A parallel computing library for scaling NumPy, pandas, and scikit-learn workflows.
Vaex: A library for fast, memory-efficient data manipulation of large datasets.
Matplotlib: A foundational library for creating detailed and customizable visualizations.
Seaborn: Enhances Matplotlib by simplifying statistical data visualization with elegant default styles.
Plotly: An interactive visualization library for creating rich, web-based visualizations.
Bokeh: A library for interactive visualizations, focusing on web applications.
SciPy: A collection of advanced numerical routines for optimization, integration, and signal processing.
SymPy: A symbolic mathematics library useful for algebraic computations and calculus.
Numba: A JIT compiler that accelerates numerical computations with Python.
OpenCV: The industry-standard library for computer vision tasks like object detection, image processing, and video analysis.
PyTorch Lightning: Simplifies the creation of deep learning models, particularly in computer vision.
Fastai: A library for building computer vision and NLP models with PyTorch.
Albumentations: A fast and flexible library for augmenting image data in computer vision tasks.
Stable-Baselines3: A collection of pre-implemented reinforcement learning algorithms for research and development.
Ray RLlib: A scalable library for reinforcement learning applications.
Auto-sklearn: Automated machine learning built on scikit-learn.
TPOT: A genetic programming-based framework for AutoML.
DataRobot: A platform for automating the machine learning pipeline.
MLflow: A comprehensive platform for managing ML experiments, model deployment, and lifecycle tracking.
Kubeflow: A Kubernetes-native platform for deploying and managing machine learning workflows at scale.
Metaflow: A tool developed by Netflix for managing real-world machine learning workflows.
Weights & Biases (W&B): Focused on experiment tracking, hyperparameter optimization, and collaboration.
DVC (Data Version Control): A version control system tailored for machine learning projects.
TensorFlow Serving: A flexible system for serving TensorFlow models in production.
TorchServe: A PyTorch-native tool for deploying and serving PyTorch models.
Seldon Core: An open-source platform for deploying machine learning models on Kubernetes.
FastAPI: A modern web framework for building APIs, commonly used for deploying ML models.
Apache Airflow: A workflow orchestration tool for managing ML pipelines.
Prefect: A modern dataflow automation platform that simplifies pipeline orchestration.
Dagster: A data orchestrator for machine learning, analytics, and ETL pipelines.
Evidently AI: A library for monitoring ML model performance and detecting data drift.
WhyLabs AI Observatory: A platform for monitoring ML model health and data quality.
GitHub Actions: A popular tool for implementing CI/CD pipelines for ML projects.
Jenkins: A flexible automation server for building and deploying ML models.
Tekton: A Kubernetes-native framework for building CI/CD pipelines.
AWS SageMaker: A cloud platform for building, training, and deploying ML models.
Azure ML: Microsoft's MLOps platform for managing ML workflows.
Google Vertex AI: Google's platform for end-to-end ML model lifecycle management.
Optuna: A framework for hyperparameter optimization.
Hyperopt: A Python library for hyperparameter tuning.
gorgias: A framework for conversational AI and chatbot development.
PyCaret: A low-code library for automating machine learning workflows.
Orange Data Mining: A visual programming tool for data analysis and machine learning.
NetworkX: A comprehensive library for creating, analyzing, and manipulating complex networks.
GraphBLAS: A library for graph algorithms using linear algebra operations.
Snap.py: A Python library for large-scale network analysis and graph processing.
Deep Learning Projects: PyTorch, TensorFlow, and Transformers.
Classic Machine Learning: Scikit-learn and XGBoost.
NLP: OpenAI, LangChain, Transformers, NLTK.
Data Preparation: NumPy, Pandas.
Visualization: Matplotlib, Seaborn.
Scientific Research: SciPy, Theano.
Computer Vision: OpenCV.
A Comprehensive Survey of Machine Unlearning Techniques for Large Language Models
Digital Forgetting in Large Language Models: A Survey of Unlearning Methods
Fast Machine Unlearning Without Retraining Through Selective Synaptic Dampening\
Threats, Attacks, and Defenses in Machine Unlearning: A Survey
Researchers Develop New Technique to Wipe Dangerous Knowledge From AI Systems
Selective Forgetting: Advancing Machine Unlearning Techniques and Evaluation in Language Models
In-Context Unlearning: Language Models as Few Shot Unlearners
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
Knowledge Unlearning for Mitigating Privacy Risks in Language Models
KGA: A General Machine Unlearning Framework Based on Knowledge Gap Alignment
Towards Adversarial Evaluations for Inexact Machine Unlearning
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Unlearn What You Want to Forget: Efficient Unlearning for LLMs