math401-llm Source codes and datasets for How well do Large Language Models perform in Arithmetic tasks? Full evaluation of all size models. Dataset MATH 401 = 1 Euler Equation + 16 group * 25 problems Euler Equation. Add & Subtract of two integers within 10. Add & Subtract of two intege...