Code Monkey home page Code Monkey logo

spotify-top-hits-2000-2019's Introduction


Spotify Top Hits (2000-2019) | Exploratory Data Analysis

--

Python Jupyter Notebook Pandas NumPy Plotly

--

About

This notebook presents an Exploratory Data Analysis of the dataset Top Hits Spotify from 2000-2019 posted on Kaggle by user Mark Koverha.
The dataset is formed by 2000 rows and 18 columns, whose content description is as follows:

  • artist: Name of the artist.

  • song: Name of the track.

  • duration_ms: Duration of the track in milliseconds.

  • explicit: Whether or not the content of a song is offensive or inappropriate for children.

  • year: Year of release of the track.

  • popularity: The higher the value the more popular the song is.

  • danceability: Describes how suitable the track is for dancing based on tempo, rythm stability, beat strength, and others, where a value closer to 0.0 indicates lower danceability and a value closer to 1.0 indicates higher danceability.

  • energy: It represents the perceptual measure of intensity and activity going from 0.0 to 1.0

  • key: What key the track is in. Uses integer to represent keys where 0 = C, 1 = C♯/D♭, 2 = D, and so on. In case there's no key detected, the value is -1.

  • loudness: Represents the overall loudness of a track in decibels (dB) and it's the quality of a sound that is the primary psychological correlate of physical strength (amplitude). Loudness values are averaged across the entire track, with values typically between the range of -60 and 0 dB.

  • mode: Indicates if the track is in a major or in a minor scale, where major = 1 and minor = 0.

  • speechiness: It detects the presence of spoken words in a track. The more exclusively speech-like the recording, for instance a talk show or an audiobook, the closer to 1.0 will be the attributed value. Values above 0.66 describe tracks that are mostly made entirely of spoken words, while values between 0.33 and 0.66 describe tracks that may contain both music and speech. Values below 0.33 most likely represent music and other non-speech tracks.

  • acousticness: A confidence measure from 0.0 to 1.0 that indicates whether a track is acoustic or not, where 1.0 represents high confidence the track is acoustic.

  • instumentalness: Predicts if the track contains vocals or not. The closer its value is to 1.0, the greater likelihood the track contains no vocals. Any value above 0.5 is intended to represent instrumental tracks, but confidence is higher as the value approaches 1.0. "Ohh"s and "aah"s are treated as instrumental in this context.

  • liveness: Detects the presence of an audience in the recording. A value above 0.8 indicates higher probability that the track was performed live.

  • valence: A measure from 0.0 to 1.0 that describes the positiveness conveyed by the track. Tracks with higher valence may trigger positive emotions, such as happiness, cheerfulness and euphoria, while tracks with lower valence may trigger negative emotions, such as sadness, depression and anger.

  • tempo: Tempo in beats per minute (BPM). In musical terminology, tempo is the speed or pace of a given piece and derives directly from the average beat duration.

  • genre: Genre of the track.

Different techniques of data transformation and data treatment were used in order to make it easier to make studies using data visualization with matplotlib, seaborn and plotly libraries.


Kaggle

In order to have a better comprehension and interaction with plotly charts used in this study, please click here to view this notebook on Kaggle.

Author:

Luís Fernando Torres

spotify-top-hits-2000-2019's People

Contributors

luuisotorres avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.