Skip to content

derekxkwan/syntheory_plus

General NLPEnglishmit

The derekxkwan/syntheory_plus dataset is a English General NLP resource from derekxkwan at 2026. With 14.6K downloads and 0 likes, it is actively used by the community. It is released under the mit license and is a 100K<n<1M-scale dataset.

About derekxkwan/syntheory_plus

Notes Dataset Viewer is disabled as we wanted to keep everything in WAV format with CSV metadata instead of Parquet files and the dataset is too large for Dataset Viewer to index properly. Dataset Authors Derek Kwan and Patrick Don...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
derekxkwan
Year
2026
License
mit
Downloads
14612
Likes
0
Download Homepage

Related General NLP datasets

FAQ