Question 1

What is the walledai/AdvBench dataset?

Accepted Answer

Dataset Card for AdvBench

Paper: Universal and Transferable Adversarial Attacks on Aligned Language Models
Data: AdvBench Dataset

About

AdvBench is a set of 500 harmful behaviors formulated as instructions. These behaviors
range over...

Question 2

Is walledai/AdvBench a benchmark?

Accepted Answer

walledai/AdvBench is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download walledai/AdvBench?

Accepted Answer

walledai/AdvBench is available at its source: https://huggingface.co/datasets/walledai/AdvBench.

walledai/AdvBench

About walledai/AdvBench