google/deepsearchqa
Question AnsweringENBenchmarkapache-2.0
Google/deepsearchqa is a question answering-focused benchmark dataset in EN distributed in Parquet format. It is distributed under the apache-2.0 license and falls in the n<1K size category, and has been downloaded 15.6K times.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About google/deepsearchqa
DeepSearchQA
A 900-prompt factuality benchmark from Google DeepMind, designed to evaluate agents on difficult multi-step information-seeking tasks across 17 different fields.
▶ Google DeepMind Release Blog Post▶ DeepSearchQA Leaderbo...
Details
- Task
- Question Answering
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- n<1K
- Creator
- Year
- 2025
- License
- apache-2.0
- Downloads
- 15623
- Likes
- 123