OpenLLM-France/Lucie-Training-Dataset
Text GenerationEN, FR, DE
OpenLLM-France/Lucie-Training-Dataset is a text generation dataset in EN, FR, DE from OpenLLM-France in Parquet format.
About OpenLLM-France/Lucie-Training-Dataset
Lucie Training Dataset Card
The Lucie Training Dataset is a curated collection of text data
in English, French, German, Spanish and Italian culled from a variety of sources including: web data, video subtitles, academic papers,
digital books, n...
Details
- Task
- Text Generation
- Language
- EN, FR, DE
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- OpenLLM-France
- Year
- 2024