Know How’s A Multi modal Agent System for Converting Manufacturing

A multi-modal AI framework designed to automatically transform unstructured manufacturing videos into standardized digital manuals, referred to as "Know-how Heritage." Unlike conventional video-to-text approaches, our system integrates Range of Motion (ROM) and Hand-Object Interaction (HOI) analysis to extract quantitative data on ergonomic postures and finegrained tool manipulations.