To overcome such challenging issues, we propose a novel knowledge distillation method,GLAMD, distilling both global and local knowledge from the teacher. We divide the feature maps into several patches and apply an attention mechanism for both the entire feature area and each patch to extract the ...