For our experiments, we utilized strongly annotated synthetic data encompassing ten types of sound events, namely “Dog”, “Cat”, “Alarm_bell_ringing”, “Dishes”, “Frying”, “Blender”, “Running_water”, “Vacuum_cleaner”, and “Electric_shaver_toothbrush”. The dataset comprised 2045...