christof dcs qmul ac uk This paper evaluates the translation quality of machine translation systems for 8 language pairs: translating French, German, Spanish, and Czech to English and back. We carried out an extensive human evaluation which allowed us not only to rank the different MT systems, ...
Scientific Credibility of Machine Translation Research: A Meta-Evaluation of 769 Papers 发表:ACL'21 (one of six Outstanding Papers) 作者:Benjamin Marie, Atsushi Fujita, Raphael Rubino 机构:日本国立情报通信研究机构(NICT) 研究目的: 近年来机器翻译学术paper可谓是纷至沓来,然而提出的新算法、新模型真的如...
2009. Meta-evaluation of automatic evaluation methods for machine translation using patent translation data in NTCIR-7. In Proceedings of the 3rd Workshop on Patent Translation, pages 9-16.Echizen-ya, Hiroshi., Ehara, Terumasa., Shimohata, Sayori., Fujii, Atsushi., Utiyama,...
translationandwhichraisedtheimportanceoftakingintoaccountthepurposeof thetranslation,ratherthancriteria,aswellastheneedfortranslationcriti- cismtobecomeaspecialisedsectoroflirycriticism. 1.2.ThreeAreasOfEvaluation 1.2.1.TheEvaluationofPublishedTranslations Translationevaluationisrelevthreeareasoftranslation(MartínezMelis199...
XSTS evaluation protocol(简称XSTS)简而言之,对于低资源语种翻译,作者认为更应该关注准确性而非流畅性。因此作者定义了一种5分制打分法,1到5分更加关注hypo和ref在语义上是否接近,而非考虑hypo的流畅度。 Calibration Set(校准集):校准集由机翻英语与英语原文配对组成。让不同语言的打分者给校准集打分、由此可以...
Papineni, K., Roukos, S., Ward, T., and Zhu, W.-J., BLEU: A method for automatic evaluation of machine translation, Proc. 40th Ann. Meeting on Association for Computational Linguistics, Philadelphia, 2002, Stroudsburg, Pa.: Association for Computational Linguistics, 2002, pp. 311–318....
ccMatrix, andLASER. As part of this effort, we created a new LASER 2.0 and improved fastText language identification, which improves the quality of mining and includes open sourced training and evaluation scripts. All of our data mining resources leverage publicly available data and are open ...
Development and evaluation of a deep learning model for protein–ligand binding affinity prediction. Bioinformatics 34, 3666–3674 (2018). Article Google Scholar Jiménez, J., Skalic, M., Martinez-Rosell, G. & De Fabritiis, G. Kdeep: protein–ligand absolute binding affinity prediction via ...
Details on selection of cases, division of model development and validation data and raw performance data were frequently ambiguous or missing. AI is reported as having high diagnostic accuracy in the reported areas but requires more rigorous evaluation of its performance....
evaluation May 21, 2018 install.sh start running 18 language model May 18, 2018 meta_eval.py small changes May 14, 2018 meta_eval2.py add changes May 16, 2018 meta_eval5.py change name for the outputs May 21, 2018 meta_nmt.py ...