Skip to content

mlabonne/FineTome-100k

General NLPEnglish

Created by mlabonne at 2024, the mlabonne/FineTome-100k is a General NLP dataset in English in Parquet format.

About mlabonne/FineTome-100k

FineTome-100k The FineTome dataset is a subset of arcee-ai/The-Tome (without arcee-ai/qwen2-72b-magpie-en), re-filtered using HuggingFaceFW/fineweb-edu-classifier. It was made for my article "Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth".

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Creator
mlabonne
Year
2024
Download

Related General NLP datasets

FAQ