Skip to content

jondurbin/cinematika-v0.1

General NLPEnglishcc-by-nc-4.0

Jondurbin/cinematika-v0.1 is a General NLP-focused dataset in English distributed in Parquet format. It is distributed under the cc-by-nc-4.0 license and falls in the 10K<n<100K size category, and has been downloaded 309 times.

About jondurbin/cinematika-v0.1

Cinematika Cinematika is a collection of 211 movie scripts converted to novel style, multi-character RP data. The conversions were performed using a mix of manual regexp parsing and LLM augmentation using in-context learning with a custom mistr...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
10K<n<100K
Creator
jondurbin
Year
2023
License
cc-by-nc-4.0
Downloads
309
Likes
60
Download Homepage

Related General NLP datasets

FAQ