Question 1

What is the LLM-Tuning-Safety/HEx-PHI dataset?

Accepted Answer

HEx-PHI: Human-Extended Policy-Oriented Harmful Instruction Benchmark

This dataset contains 330 harmful instructions (30 examples x 11 prohibited categories) for LLM harmfulness evaluation.
In our work "Fine-tuning Aligned Language Models Compr...

Question 2

Is LLM-Tuning-Safety/HEx-PHI a benchmark?

Accepted Answer

LLM-Tuning-Safety/HEx-PHI is a dataset for training or evaluation; it isn't tracked as a standard LLM benchmark in our catalog.

Question 3

Where can I download LLM-Tuning-Safety/HEx-PHI?

Accepted Answer

LLM-Tuning-Safety/HEx-PHI is available at its source: https://huggingface.co/datasets/LLM-Tuning-Safety/HEx-PHI.

LLM-Tuning-Safety/HEx-PHI

About LLM-Tuning-Safety/HEx-PHI

Details

Related Text Generation datasets

FAQ