meta-agents-research-environments/gaia2
Reinforcement LearningENBenchmark
Meta-agents-research-environments/gaia2 is a reinforcement learning benchmark dataset in EN from meta-agents-research-environments in Parquet format.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About meta-agents-research-environments/gaia2
Gaia2
Paper | Code | Project Page
Dataset Summary
Gaia2 is a benchmark dataset for evaluating AI agent capabilities in simulated environments. The dataset contains 800 scenarios that test agent performance in environments where time ...
Details
- Task
- Reinforcement Learning
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- meta-agents-research-environments
- Year
- 2025