Skip to content

OpenLeecher/lmsys_chat_1m_clean

General NLPEN

Created by OpenLeecher at 2024, the OpenLeecher/lmsys_chat_1m_clean is a General NLP dataset in EN in Parquet format.

About OpenLeecher/lmsys_chat_1m_clean

Cleaning and Categorizing A few weeks ago, I had the itch to do some data crunching, so I began this project - to clean and classify lmsys-chat-1m. The process was somewhat long and tedious, but here is the quick overview: 1. Removing ...

Details

Task
General NLP
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
OpenLeecher
Year
2024
Download

Related General NLP datasets

FAQ