textdetox/multilingual_toxicity_dataset
Text ClassificationEN, RU, UK
Textdetox/multilingual_toxicity_dataset is a text classification dataset in EN, RU, UK from textdetox in Parquet format.
About textdetox/multilingual_toxicity_dataset
Multilingual Toxicity Detection Dataset
[2025] We extend our binary toxicity classification dataset to more languages! Now also covered: Italian, French, Hebrew, Hindglish, Japanese, Tatar. The data is prepared for TextDetox 2025 shared task.
[...
Details
- Task
- Text Classification
- Language
- EN, RU, UK
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- textdetox
- Year
- 2024