In this section, we evaluate and compare the performance of the proposed models across all subtasks using full classification report metrics. The evaluation is based on standard measures including precision, recall, F1-score, and accuracy, reported at both the class level and overall level (macro, weighted, and micro averages where applicable).