Repository for the final project of the 2023 Data Mining course, taught by Aris Anagnostopoulos at Sapienza, Rome. ๐ฎ๐น
The dataset for full lyrics generation task is composed of 10000 song lyrics divided into 5 genres: indie, metal, pop, rap and rock. Each genre has 20 artist with 100 songs each. All songs are in English ๐ฌ๐ง Check it in this repo or on Kaggle
The dataset for the lyrics completion task, containing 243 Kanye West songs is public on Kaggle here
Simone Teglia, [email protected]