xTRam1/safe-guard-prompt-injection
General NLPEnglish
XTRam1/safe-guard-prompt-injection is a General NLP dataset in English from xTRam1 in Parquet format.
About xTRam1/safe-guard-prompt-injection
We formulated the prompt injection detector problem as a classification problem and trained our own language model
to detect whether a given user prompt is an attack or safe. First, to train our own prompt injection detector, we
required high-qu...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- xTRam1
- Year
- 2024