数据集简介 Ape210K 是一个新的大规模和模板丰富的数学单词问题数据集,包含 210K 个中国小学水平的数学问题。每个问题都包含最佳答案和得出答案所需的方程式。 Ape210K 具有更大的多样性,有 56K 个模板,是 Math23K 的 25 倍。分析表明,解决 Ape210K 不仅需要自然语言理解,还需要常识知识。
In this paper, we release a new large-scale and template-rich math word problem dataset named Ape210K. It consists of 210K Chinese elementary school-level math problems, which is 9 times the size of the largest public dataset Math23K. Each problem contains both the gold answer and the ...
210K Chinese elementary school-level math problems, which is 9 times the size of the largest public dataset Math23K. Each problem contains both the gold answer and the equations needed to derive the answer. Ape210K is also of greater diversity with 56K templates, which is 25 times more ...
Ape210K is a large-scale and template-rich math word problem (MWP) dataset. Ape210K contains 210,488 problems and 56,532 templates. We split the whole dataset into train/valid/test. An Example of the Math Word Problems Here is an example of the math word problems. ...
Ape210K: A Large-Scale and Template-Rich Dataset of Math Word Problems.Wei ZhaoMingyue ShangYang LiuLiang WangJingming Liu
Security Insights More master BranchesTags ape210k/data/train.ape.json Go to file Copy path Cannot retrieve contributors at this time 74.7 MB Download View raw (Sorry about that, but we can’t show files that are this big right now.)...
Ape210K 数据集:于 2020 年由猿辅导 AI Lab 和西北大学联合发布。Ape210K 是一个新的大规模和模板丰富的数学单词问题数据集,包含 210K 个中国小学水平的数学问题,是 Math23K 的 9 倍。每个问题都包含黄金答案和得出答案所需的方程式,有 56K 个模板,是 Math23K 的 25 倍。