meituan-longcat/AMO-Bench
Question AnsweringEN
The meituan-longcat/AMO-Bench dataset is a EN question answering resource from meituan-longcat at 2025.
About meituan-longcat/AMO-Bench
š AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
š Paper
š Project Page
š» Github Repo
Updates
2026.02.05: Leaderboard Update: Qwen3-Max-Thinking achieves a new SOTA with 65.1%, while GLM-4.7 sets ...
Details
- Task
- Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- meituan-longcat
- Year
- 2025