microsoft/mediflow
Text GenerationEN
Microsoft/mediflow is a text generation-focused dataset in EN distributed in Parquet format.
About microsoft/mediflow
MediFlow
A large-scale synthetic instruction dataset of 2.5M rows (~700k unique instructions) for clinical natural language processing covering 14 task types and 98 fine-grained input clinical documents.
t-SNE 2D Plot of MediFlow Embedd...
Details
- Task
- Text Generation
- Language
- EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- microsoft
- Year
- 2025