AMO-Bench from Meituan

I found a new benchmark paper from Meituan:AMO-Bench: Large Language Models StillStruggle in High School Math Competitions. This paper …

2025年11月2日 · 2 min · 251 words · BubbleBrain