philschmid/sharegpt-raw
General NLPEnglish
The philschmid/sharegpt-raw dataset is a English General NLP resource from philschmid at 2023.
About philschmid/sharegpt-raw
Prepraration
pip3 install -r requirements.txt
Data Cleaning
merge two raw json files and json beautify the merged file
python merge.py sharegpt_90k_raw_dataset/sg_90k_part1.json sharegpt_90k_raw_dataset/sg_90k_part2.json sharegpt...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- philschmid
- Year
- 2023