OpenLeecher/lmsys_chat_1m_clean
General NLPEN
Created by OpenLeecher at 2024, the OpenLeecher/lmsys_chat_1m_clean is a General NLP dataset in EN in Parquet format.
About OpenLeecher/lmsys_chat_1m_clean
Cleaning and Categorizing
A few weeks ago, I had the itch to do some data crunching, so I began this project - to clean and classify lmsys-chat-1m. The process was somewhat long and tedious, but here is the quick overview:
1. Removing ...
Details
- Task
- General NLP
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- OpenLeecher
- Year
- 2024