mlabonne/FineTome-100k
General NLPEnglish
Created by mlabonne at 2024, the mlabonne/FineTome-100k is a General NLP dataset in English in Parquet format.
About mlabonne/FineTome-100k
FineTome-100k
The FineTome dataset is a subset of arcee-ai/The-Tome (without arcee-ai/qwen2-72b-magpie-en), re-filtered using HuggingFaceFW/fineweb-edu-classifier.
It was made for my article "Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth".
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- mlabonne
- Year
- 2024