Skip to content

HuggingFaceFW/fineweb-edu-llama3-annotations

General NLPENodc-by

The HuggingFaceFW/fineweb-edu-llama3-annotations dataset is a EN General NLP resource from HuggingFaceFW at 2024 comprising 467,424 examples. With 265 downloads and 49 likes, it is actively used by the community. It is released under the odc-by license and is a 100K<n<1M-scale dataset.

About HuggingFaceFW/fineweb-edu-llama3-annotations

Annotations for 📚 FineWeb-Edu classifier This dataset contains the annotations used for training 📚 FineWeb-Edu educational quality classifier. We prompt Llama-3-70B-Instruct to score web pages from 🍷 FineWeb based on their educational value. No...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
467424
Size
100K<n<1M
Creator
HuggingFaceFW
Year
2024
License
odc-by
Downloads
265
Likes
49
Download Homepage

Related General NLP datasets

FAQ