dominguesm/Canarim-Instruct-PTBR-Dataset
General NLPPTcc-by-nc-4.0
Created by dominguesm at 2023, the dominguesm/Canarim-Instruct-PTBR-Dataset is a General NLP dataset in PT containing 317,932 records in Parquet format. With 351 downloads and 43 likes, it is actively used by the community. It is released under the cc-by-nc-4.0 license and is a 100K<n<1M-scale dataset.
About dominguesm/Canarim-Instruct-PTBR-Dataset
🐥 🇧🇷 Canarim Instruct Dataset
[🐱 Github]
What's Canarim?
Canarim is a dataset with over 300,000 instructions in Portuguese, ranging from simple instructions like "Descreva os efeitos do aquecimento global" to more comple...
Details
- Task
- General NLP
- Language
- PT
- Format
- Parquet
- Rows / instances
- 317932
- Size
- 100K<n<1M
- Creator
- dominguesm
- Year
- 2023
- License
- cc-by-nc-4.0
- Downloads
- 351
- Likes
- 43