Hierarchical Semi-Supervised Learning Framework for surgical gesture segmentation Based on Multi-Modality Data