(R-2) and Rouge-L (R-L)(Lin,2004)‡‡https://github.com/pltrdy/files2rouge. We follow the generation settings inLewis et al. (2019). We omit the word embedding lookup table and softmax layer from both the model parameters and #Mult-Adds calculation. #Mult-Adds is calculated ...