List of AI Projects
In recent years, I worked as team lead and tech lead on 40+ AI projects, delivered 70+ product-level AI models.
List of AI Projects
In recent years, I worked as team lead and tech lead on 40+ AI projects, delivered 70+ product-level AI models.
1. Large Foundation Model Projects (7+ Projects, Covering LLM, SD, SORA, CLIP)
1.1 Multimodal LLMs
1.1.1 Video LLM - 9 Months
1.1.2 Chat with Image Pairs - 6 Months
1.2 Video Generation
1.2.1 Stable Video Diffusion (Unet/SVD) - 8 Months
1.2.2 Diffusion Transformer (DiT/SORA) - 4 Months
1.3 Human Image Generation
1.3.1 Stable Diffusion (SD) - 6 months
1.3.2 Avatar View Generation - 3 months
1.4 Multi-lingual CLIP - 6 Months
1.4.1 Multilingual Text Encoder (BERT) + Image Encoder (ViT)
2. Computer Vision Projects (30+ Projects, 50+ Models Delivered into Product)
2.1 Human Search Engine
2.1.1 Face Recognition
2.1.1.1 Large Resnet Models
2.1.1.2 Lightweight Mobilenet Models
2.2.2 Facial Attributes Recognition
2.2.2.1 Gender Recognition
2.2.2.2 Age Group Recognition
2.2.2.3 Ethnicity Recognition
2.2.2.4 Skin Color Clustering
2.2.3 Face Anti-spoofing
2.2.3.1 Still Image Face Anti-spoofing
2.2.3.2 Flash-based Face Anti-spoofing
2.2.4 Facial Virtual Try-on
2.2.4.1 Lipstick & Eyebrow Try-on
2.2.5 Hand Gesture Recognition
2.2.5.1 Uncivil Hand Gesture Recognition
2.2.5.2 VR-glasses Control Gesture Recognition
2.2.6 Human Behavior Understanding
2.2.6.1 Video Based Human Crowd Counting
2.2.6.2 TV Watching Audience Emotion and Attention Understanding
2.2 Large Scale Image Search Engine
2.2.1 20K Class Plant Image Classification
2.2.2 50K Class Animal Image Classification
2.2.3 10K Class Landmark Image Classification
2.2.4 Shopping Item Image Classification
2.2.4.1 Clothes Style recognition
2.2.4.2 Clothes Category Recognition
2.2.4.3 Re-ranking for Clothes Search
2.2.5 Generic Image Representation Learning
2.2.5.1 Supervised Representation Learning(CNN/ViT)
2.2.5.2 Unsupervised Representation Learning (Dino, SwAV, BYOL, MAE)
2.2.6 Mining Scene Object Segmentation (AI Competition, Winner)
2.3 Model based visual dataset cleaning
2.3.1 Plant Image Dataset Cleaning Model
2.3.2 Animal Image Dataset Cleaning Model
2.3.3 Human Image Dataset Cleaning Model
2.4 Video Highlight Detection
2.4.1 Fighting/Explosion Scene Detection
2.4.2 Dancing Scene Detection
2.4.3 Video Summarization
2.5 Model Inference Acceleration
2.5.1 Model Acceleration (TensorRT, Quantization)
2.5.2 Deep Hashing Code Learning Model (32X compact)
2.6 Misc.
2.6.1 MRI Image Segmentation - 4 months (For a Fortune 500 Company)
2.6.2 Video Based Hand Hygiene Gesture Recognition
3. NLP Projects (10+ Projects, 10+ Models Delivered into Products)
3.1 Video Search (BERT)
3.1.1 English Video Search model
3.1.2 Chinese Video Search model
3.1.3 Spanish Video Search model
3.1.4 Arabic Video Search model
3.2 Text Classification (BERT)
3.2.1 Video Query Classification x4 Languages
3.1.2 Video Title Classification languages x4 Languages
3.1.3 Video keywords extraction (English)