Skip to content

dominguesm/Canarim-Instruct-PTBR-Dataset

General NLPPTcc-by-nc-4.0

Created by dominguesm at 2023, the dominguesm/Canarim-Instruct-PTBR-Dataset is a General NLP dataset in PT containing 317,932 records in Parquet format. With 351 downloads and 43 likes, it is actively used by the community. It is released under the cc-by-nc-4.0 license and is a 100K<n<1M-scale dataset.

About dominguesm/Canarim-Instruct-PTBR-Dataset

🐥 🇧🇷 Canarim Instruct Dataset [🐱 Github] What's Canarim? Canarim is a dataset with over 300,000 instructions in Portuguese, ranging from simple instructions like "Descreva os efeitos do aquecimento global" to more comple...

Details

Task
General NLP
Language
PT
Format
Parquet
Rows / instances
317932
Size
100K<n<1M
Creator
dominguesm
Year
2023
License
cc-by-nc-4.0
Downloads
351
Likes
43
Download Homepage

Related General NLP datasets

FAQ