Skip to content

microsoft/mediflow

Text GenerationEN

Microsoft/mediflow is a text generation-focused dataset in EN distributed in Parquet format.

About microsoft/mediflow

MediFlow A large-scale synthetic instruction dataset of 2.5M rows (~700k unique instructions) for clinical natural language processing covering 14 task types and 98 fine-grained input clinical documents. t-SNE 2D Plot of MediFlow Embedd...

Details

Task
Text Generation
Language
EN
Format
Parquet
Rows / instances
N/A
Creator
microsoft
Year
2025
Download

Related Text Generation datasets

FAQ