Skip to content

Dr3dre/Genius-song-lyrics-cleaned

Text ClassificationSentence SimilarityEN, IT, IAcc-by-4.0

Dr3dre/Genius-song-lyrics-cleaned is a text classification-focused dataset in EN, IT, IA that provides 5,134,856 labeled examples distributed in Parquet format. It is distributed under the cc-by-4.0 license and falls in the 1M<n<10M size category, and has been downloaded 20K times.

About Dr3dre/Genius-song-lyrics-cleaned

🎵 Genius Song Lyrics cleaned Dataset Dataset Description This dataset is originally taken from Genius Song Lyrics and it contains cleaned and normalized song lyrics for more than 5 million songs, designed for large-scale topic modeli...

Details

Task
Text Classification, Sentence Similarity
Language
EN, IT, IA
Format
Parquet
Rows / instances
5134856
Size
1M<n<10M
Creator
Dr3dre
Year
2026
License
cc-by-4.0
Downloads
20020
Likes
1
Download Homepage

FAQ