Fig. 2

Confusion Matrices showing the model’s performance in classifying surgical instruments. Values represent the proportion of true labels assigned to each predicted class. Correct predictions align along the diagonal, while misclassifications are represented by non-zero values off the diagonal
Model performance is shown on A. the whole test dataset and B. the subset of overlapping surgical tool images only