walledai/HarmBench
General NLPEN
Walledai/HarmBench is a General NLP-focused dataset in EN distributed in Parquet format.
About walledai/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Paper: HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Data: Dataset
About
In this dataset card, ...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- walledai
- Year
- 2024