HuggingFaceFW/fineweb-edu-llama3-annotations
General NLPENodc-by
The HuggingFaceFW/fineweb-edu-llama3-annotations dataset is a EN General NLP resource from HuggingFaceFW at 2024 comprising 467,424 examples. With 265 downloads and 49 likes, it is actively used by the community. It is released under the odc-by license and is a 100K<n<1M-scale dataset.
About HuggingFaceFW/fineweb-edu-llama3-annotations
Annotations for 📚 FineWeb-Edu classifier
This dataset contains the annotations used for training 📚 FineWeb-Edu educational quality classifier. We prompt Llama-3-70B-Instruct to score web pages from 🍷 FineWeb based on their educational value.
No...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- 467424
- Size
- 100K<n<1M
- Creator
- HuggingFaceFW
- Year
- 2024
- License
- odc-by
- Downloads
- 265
- Likes
- 49