Skip to content

CODE Datasets

We catalog 10 CODE datasets for NLP and machine learning, including 1 benchmarks. Browse the list below or narrow down by task.

This page covers CODE-language data. Our directory includes 10 datasets in CODE.

Updated June 2026

What tasks do CODE datasets cover?

Datasets in other languages

Frequently asked questions