Dr3dre/Genius-song-lyrics-cleaned
Text ClassificationSentence SimilarityEN, IT, IAcc-by-4.0
Dr3dre/Genius-song-lyrics-cleaned is a text classification-focused dataset in EN, IT, IA that provides 5,134,856 labeled examples distributed in Parquet format. It is distributed under the cc-by-4.0 license and falls in the 1M<n<10M size category, and has been downloaded 20K times.
About Dr3dre/Genius-song-lyrics-cleaned
🎵 Genius Song Lyrics cleaned Dataset
Dataset Description
This dataset is originally taken from Genius Song Lyrics and it contains cleaned and normalized song lyrics for more than 5 million songs, designed for large-scale topic modeli...
Details
- Task
- Text Classification, Sentence Similarity
- Language
- EN, IT, IA
- Format
- Parquet
- Rows / instances
- 5134856
- Size
- 1M<n<10M
- Creator
- Dr3dre
- Year
- 2026
- License
- cc-by-4.0
- Downloads
- 20020
- Likes
- 1