Question 1

What is the ai-safety-institute/AgentHarm dataset?

Accepted Answer

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Maksym Andriushchenko1,†,*, Alexandra Souly2,*

Mateusz Dziemian1, Derek Duenas1, Maxwell Lin1, Justin Wang1, Dan Hendrycks1,§, Andy Zou1,¶,§, Zico Kolter1,¶, Matt Fredrikson1,¶,*

...

Question 2

Is ai-safety-institute/AgentHarm a benchmark?

Accepted Answer

ai-safety-institute/AgentHarm is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download ai-safety-institute/AgentHarm?

Accepted Answer

ai-safety-institute/AgentHarm is available at its source: https://huggingface.co/datasets/ai-safety-institute/AgentHarm.

Question 4

What license is ai-safety-institute/AgentHarm released under?

Accepted Answer

ai-safety-institute/AgentHarm is distributed under the other license.

ai-safety-institute/AgentHarm

About ai-safety-institute/AgentHarm