andersonbcdefg/synthetic_retrieval_tasks
General NLPEnglishmit
Created by andersonbcdefg at 2024, the andersonbcdefg/synthetic_retrieval_tasks is a General NLP dataset in English in Parquet format. With 26 downloads and 79 likes, it is actively used by the community. It is released under the mit license and is a 100K<n<1M-scale dataset.
About andersonbcdefg/synthetic_retrieval_tasks
Synthetic data designed as prompts for generating embeddings training data for retrieval.
The "iteration" column refers to how the data was generated.
Iteration 1: Use the following pool of seed tasks, prompt GPT-3.5-Turbo to generate additional ...
Details
- Task
- General NLP
- Language
- English
- Format
- Parquet
- Rows / instances
- N/A
- Size
- 100K<n<1M
- Creator
- andersonbcdefg
- Year
- 2024
- License
- mit
- Downloads
- 26
- Likes
- 79