Skip to content

open-phi/programming_books_llama

General NLPEnglish

The open-phi/programming_books_llama dataset is a English General NLP resource from open-phi at 2023.

About open-phi/programming_books_llama

Dataset Card for "programming_books_llama" 400M tokens of programming books generated by gpt-3.5 (70M tokens) and a finetuned codellama 34b. The gpt-3.5 data is extremely high quality. The llama data has lower quality and shorter length, but ...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Creator
open-phi
Year
2023
Download

Related General NLP datasets

FAQ