a) bounding box containing each food item in a particular thali.
b) labels for each bounding box that classifies the surrounded food item.
c) Calorie estimation for the whole thali (Optional)
Dataset - The input dataset can be found in the dataset page.
Basic Instructions - Candidates are free to use any tools and technologies they want to. Some of them are mentioned below.
Pytorch, TensorFlow, Keras, etc.
YOLO, ViT, SAM etc.
Pretrained or fine-tuned, CLIP based models for zero-shot learning, etc.
Evaluation Metrics - The model performance will be measured using the following evaluation metrics.
mAP (mean average precision) - for bounding box prediction and classification