nyuuzyou/google-code-archive
Text GenerationCODE, ENBenchmark
Created by nyuuzyou at 2026, the nyuuzyou/google-code-archive is a text generation benchmark dataset in CODE, EN in Parquet format.
📊 This dataset is used as an LLM benchmark. See model leaderboards →
About nyuuzyou/google-code-archive
Google Code Archive Dataset
Dataset Description
This dataset was compiled from the Google Code Archive, a preserved snapshot of projects hosted on Google Code, Google's open-source project hosting service that operated from 2006 to 2...
Details
- Task
- Text Generation
- Language
- CODE, EN
- Format
- Parquet
- Rows / instances
- N/A
- Creator
- nyuuzyou
- Year
- 2026