Skip to content

google/jigsaw_toxicity_pred

Text ClassificationENcc0-1.0

The google/jigsaw_toxicity_pred dataset is a EN text classification resource from google at 2022 comprising 223,549 examples. With 727 downloads and 34 likes, it is actively used by the community. It is released under the cc0-1.0 license and is a 100K<n<1M-scale dataset.

About google/jigsaw_toxicity_pred

This dataset consists of a large number of Wikipedia comments which have been labeled by human raters for toxic behavior.

Details

Task
Text Classification
Language
EN
Format
Parquet
Rows / instances
223549
Size
100K<n<1M
Creator
google
Year
2022
License
cc0-1.0
Downloads
727
Likes
34
Download Homepage

Related Text Classification datasets

FAQ