LLM360/AmberDatasets
General NLPEN
Created by LLM360 at 2023, the LLM360/AmberDatasets is a General NLP dataset in EN in Parquet format.
About LLM360/AmberDatasets
Amber-Data
This dataset contains the fully prepared data sequence used to train Amber, an
LLM360 model.
About LLM360
LLM360 is an initiative for comprehensive and fully open-sourced LLMs,
where all training details, model checkpo...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- LLM360
- Year
- 2023