Visual Generative AI
Image / Video / 3D Generation
Diffusion Models / Flow Matching
Gaussian Splatting / NeRF
Visual Multimodal AI
Large Vision-Language Models (LVLMs)
Image / Video Retrieval and Detection
Vision-based Dialogue & ReasoningÂ
Visual Generative AI: Image / Video / 3D
Visual Multimodal AI: Large-Vision-Language Models / Vision-Language-Action / Visual Retrieval / Dialogue
Video-grounded Dialogue
Image / Video Retrieval