Skip to content

HuggingFaceFW/finephrase

Text GenerationENodc-by

Created by HuggingFaceFW at 2026, the HuggingFaceFW/finephrase is a text generation dataset in EN in Parquet format. With 418.6K downloads and 130 likes, it is actively used by the community. It is released under the odc-by license and is a 1B<n<10B-scale dataset.

About HuggingFaceFW/finephrase

Dataset Card for HuggingFaceFW/finephrase Dataset Summary Synthetic data generated by DataTrove: Model: HuggingFaceTB/SmolLM2-1.7B-Instruct (main) Source dataset: HuggingFaceFW/fineweb-edu, config sample-350BT, split train G...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
N/A
Size
1B<n<10B
Creator
HuggingFaceFW
Year
2026
License
odc-by
Downloads
418613
Likes
130
Download Homepage

Related Text Generation datasets

FAQ