Code Monkey home page Code Monkey logo

shafiahmed / datadata Goto Github PK

View Code? Open in Web Editor NEW

This project forked from alexing/datadata

0.0 1.0 0.0 84.42 MB

Pulling data using both the Genius API and the Spotify API I've been able to analyze Jorge Drexler's music and get some insights and visualizations on his creative process and his songs in general; both from the lyrics side and the musical theory side. Wordcount, lexical and lyrical density, sentiment analysis and analysis of musical components like tempo, time signature and key are all taken into account. Also, in the end, gloom_index is used combining both lyrics and music.

Jupyter Notebook 99.99% Python 0.01%

datadata's Introduction

Data, data

A statistical analysis and exploration on Jorge Drexler's music and lyrics.

by Alex Ingberg

Data, data is a homage to the great Uruguayan musician and songwriter Jorge Drexler.

Pulling data using both the Genius API and the Spotify API I've been able to analyze Jorge's music and get some insights and visualizations on his creative process and his songs in general; both from the lyrics side and the musical theory side.

Wordcount, lexical and lyrical density, sentiment analysis and analysis of musical components like tempo, time signature and key are all taken into account. Also, in the end, gloom_index is used combining both lyrics and music.

To check the analysis, go here.

To check how i built the database, go here.

Some cool samples from the visualizations:

NRC emotions through the years Tempo by albums Usage of keys Top 10 songs with more words Wordcloud Lyrical density vs lexical density Correlation in negative NRC emotions

This whole project has been created using Python 3, Jupyter Notebook and a little bit of PyCharm.

I created the databases with pandas, BeautifulSoup, Spotipy (an amazing Python wrapper for the Web Spotify API), and the Genius API and Web Spotify API.

To work on the analysis the tools I used were pandas, NumPy, Matplotlib, Seaborn, scikit-learn, SciPy, Natural Language Toolkit, wordcloud and py-lex.

To check the analysis, go here.

To check how i built the database, go here.

References:

[1] Text analysis in Pandas with some TF-IDF (again) by Jakub Nowacki

[2] Everything in Its Right Place: Visualization and Content Analysis of Radiohead Lyrics by Myles Harrison

[3] Data Visualization and Analysis of Taylor Swift’s Song Lyrics by Preetish Panda

[4] fitteR happieR by Rcharlie

[5] Quantifying Sufjan Stevens with the Genius API and NLTK by Jonathan Evans

datadata's People

Contributors

alexing avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.