tokyotech-llm/swallow-code
Text GenerationEN, JA
Tokyotech-llm/swallow-code is a text generation dataset in EN, JA from tokyotech-llm in Parquet format.
About tokyotech-llm/swallow-code
SwallowCode
Notice
May 21, 2025: We have deleted ablation/exp1-the-stack-v2-train-smol-ids-python because it was flagged as potentially containing unsafe data collected from the Python subset of https://huggingface.co/datasets/big...
Details
- Task
- Text Generation
- Language
- EN, JA
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- tokyotech-llm
- Year
- 2025