HuggingFaceFW/finephrase
Text GenerationENodc-by
Created by HuggingFaceFW at 2026, the HuggingFaceFW/finephrase is a text generation dataset in EN in Parquet format. With 418.6K downloads and 130 likes, it is actively used by the community. It is released under the odc-by license and is a 1B<n<10B-scale dataset.
About HuggingFaceFW/finephrase
Dataset Card for HuggingFaceFW/finephrase
Dataset Summary
Synthetic data generated by DataTrove:
Model: HuggingFaceTB/SmolLM2-1.7B-Instruct (main)
Source dataset: HuggingFaceFW/fineweb-edu, config sample-350BT, split train
G...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 1B<n<10B
- Creator
- HuggingFaceFW
- Year
- 2026
- License
- odc-by
- Downloads
- 418613
- Likes
- 130