Skip to content

tokyotech-llm/swallow-code

Text GenerationEN, JA

Tokyotech-llm/swallow-code is a text generation dataset in EN, JA from tokyotech-llm in Parquet format.

About tokyotech-llm/swallow-code

SwallowCode Notice May 21, 2025: We have deleted ablation/exp1-the-stack-v2-train-smol-ids-python because it was flagged as potentially containing unsafe data collected from the Python subset of https://huggingface.co/datasets/big...

Details

Task
Text Generation
Language
EN, JA
Format
Parquet
Rows / instances
N/A
Creator
tokyotech-llm
Year
2025
Download

Related Text Generation datasets

FAQ