Skip to content

IlyaGusev/habr

Text GenerationRU, EN

The IlyaGusev/habr dataset is a RU, EN text generation resource from IlyaGusev at 2023 comprising 302,049 examples. With 508 downloads and 33 likes, it is actively used by the community and is a 100K<n<1M-scale dataset.

About IlyaGusev/habr

Habr dataset Description Summary: Dataset of posts and comments from habr.com, a Russian collaborative blog about IT, computer science and anything related to the Internet. Script: create_habr.py Point of Contact: Ilya Gusev Language...

Details

Task
Text Generation
Language
RU, EN
Format
Parquet
Rows / instances
302049
Size
100K<n<1M
Creator
IlyaGusev
Year
2023
Downloads
508
Likes
33
Download Homepage

Related Text Generation datasets

FAQ