1 VS 1 X 6 : Mistral 7B vs Mistral 7B 6v6-fast.mp4 A new kind of benchmark ? Street Fighter III assesses the ability of LLMs to understand their environment and take actions based on a specific context. As opposed to RL models, which blindly take actions based on the reward function...