Skip to content

textdetox/multilingual_toxicity_dataset

Text ClassificationEN, RU, UK

Textdetox/multilingual_toxicity_dataset is a text classification dataset in EN, RU, UK from textdetox in Parquet format.

About textdetox/multilingual_toxicity_dataset

Multilingual Toxicity Detection Dataset [2025] We extend our binary toxicity classification dataset to more languages! Now also covered: Italian, French, Hebrew, Hindglish, Japanese, Tatar. The data is prepared for TextDetox 2025 shared task. [...

Details

Task
Text Classification
Language
EN, RU, UK
Format
Parquet
Rows / instances
N/A
Creator
textdetox
Year
2024
Download

Related Text Classification datasets

FAQ