open-phi/programming_books_llama
General NLPEnglish
The open-phi/programming_books_llama dataset is a English General NLP resource from open-phi at 2023.
About open-phi/programming_books_llama
Dataset Card for "programming_books_llama"
400M tokens of programming books generated by gpt-3.5 (70M tokens) and a finetuned codellama 34b. The gpt-3.5 data is extremely high quality. The llama data has lower quality and shorter length, but ...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- open-phi
- Year
- 2023