然后计算它们在目标文本中的出现频率,并归一化。接着,需要计算原文本与目标文本之间的预测分数(score),该分数表示原文本与目标文本之间的差异程度。最后,将预测分数进行加权平均,得到最终的chrf评分。
CHRF(Character n-gram F-score)是一种基于字符级n-gram相似度的评价指标,主要用于衡量生成文本与参考文本之
CHRF指标是根据n-gram的精确匹配和错误匹配来评估翻译结果的准确性和流畅性的,其计算方法相对简单而有效。 CHRF指标的全称是Character n-gram F-score,其中的F-score是指综合考虑了精确度和召回率的评估指标。CHRF指标通过计算参考翻译和机器翻译之间的n-gram匹配数量,来评估机器翻译的质量。n-gram是指连续的n个...
Further experiments show that adding word unigrams and bi-grams to the standard CHRF score improves the correlations with direct assessments, though it is still not clear which option is better, unigrams only (CHRF+) or unigrams and bigrams (CHRF++). This should be investigated in future ...
chrF++ is a tool for automatic evaluation of machine translation output based on character n-gram precision and recall enhanced with word n-grams. The tool calculates the F-score averaged on all character and word n-grams, where the default character n-gram order is 6 and word n-gram order...
为综合评价不同技术和算法背景下的机器翻译在涉海领域的应用效能,构建涵盖100个代表性涉海例句的中英双语方向的测试集,基于涉海文本的语言结构特点选取BLEU、chrF++和BERTScore 3种自动评估指标,对人工智能助手ChatGPT(4.0)和文心一言(4.0),及Google Translate、Microsoft Translator、DeepL Translate、Tencent TranSmart、...
Add support for whitespace and return_sentence_level_score 2606bfa Merge remote-tracking branch 'upstream/master' into chrf++ 6ad34d8 Update CHANGELOG.md 286a7c9 codecov bot commented Nov 25, 2021 • edited Codecov Report Merging #641 (728b8e3) into master (1fef5ae) will decrease cov...
深度学习技术和生成式人工智能技术已在机器翻译领域引发质的变革,为该领域的进步开辟了新径.为综合评价不同技术和算法背景下的机器翻译在涉海领域的应用效能,构建涵盖100个代表性涉海例句的中英双语方向的测试集,基于涉海文本的语言结构特点选取BLEU,chrF++和BERTScore 3种自动评估指标,对人工智能助手ChatGPT(4.0)和...
Search or jump to... Search code, repositories, users, issues, pull requests... Provide feedback We read every piece of feedback, and take your input very seriously. Include my email address so I can be contacted Cancel Submit feedback Saved searches Use saved searches to filter your...