Skip to content

sentence-transformers/embedding-training-data

Feature ExtractionEN

Sentence-transformers/embedding-training-data is a feature extraction-focused dataset in EN distributed in Parquet format.

About sentence-transformers/embedding-training-data

Training Data for Text Embedding Models [!NOTE] This repository contains raw datasets, all of which have also been formatted for easy training in the Embedding Model Datasets collection. We recommend looking there first. This repository conta...

Details

Task
Feature Extraction
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
sentence-transformers
Year
2022
Download

Related Feature Extraction datasets

FAQ