Skip to content

andersonbcdefg/synthetic_retrieval_tasks

General NLPEnglishmit

Created by andersonbcdefg at 2024, the andersonbcdefg/synthetic_retrieval_tasks is a General NLP dataset in English in Parquet format. With 26 downloads and 79 likes, it is actively used by the community. It is released under the mit license and is a 100K<n<1M-scale dataset.

About andersonbcdefg/synthetic_retrieval_tasks

Synthetic data designed as prompts for generating embeddings training data for retrieval. The "iteration" column refers to how the data was generated. Iteration 1: Use the following pool of seed tasks, prompt GPT-3.5-Turbo to generate additional ...

Details

Task
General NLP
Language
English
Format
Parquet
Rows / instances
N/A
Size
100K<n<1M
Creator
andersonbcdefg
Year
2024
License
mit
Downloads
26
Likes
79
Download Homepage

Related General NLP datasets

FAQ