google/jigsaw_toxicity_pred
Text ClassificationENcc0-1.0
The google/jigsaw_toxicity_pred dataset is a EN text classification resource from google at 2022 comprising 223,549 examples. With 727 downloads and 34 likes, it is actively used by the community. It is released under the cc0-1.0 license and is a 100K<n<1M-scale dataset.
About google/jigsaw_toxicity_pred
This dataset consists of a large number of Wikipedia comments which have been labeled by human raters for toxic behavior.
Details
- Task
- Text Classification
- Language
- EN
- Format
- Parquet
- Rows / instances
- 223549
- Size
- 100K<n<1M
- Creator
- Year
- 2022
- License
- cc0-1.0
- Downloads
- 727
- Likes
- 34