Skip to content

bavard/personachat_truecased

General NLPEnglish

Bavard/personachat_truecased is a General NLP dataset in English from bavard in Parquet format. And falls in the 100K<n<1M size category, and has been downloaded 3K times.

About bavard/personachat_truecased

A version of the PersonaChat dataset that has been true-cased, and also has been given more normalized punctuation. The original PersonaChat dataset is in all lower case, and has extra space around each clause/sentence separating punctuation mark....

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
bavard
Year
2022
Downloads
2960
Likes
45
Download Homepage

Related General NLP datasets

FAQ