In total, 106 scenes from 28 different series were thus identified and extracted as video files. They were viewed and manually annotated based on several criteria: The episode, season number, and year of the first broadcast; Did the excerpt involve speaker identification? What methods were used?