Code Monkey home page Code Monkey logo

netflixanalysis's Introduction

📊🎬 Netflix Movie Analysis (2008-2021) 🎥📈


Introduction

Netflix is an American subscription video on-demand over-the-top streaming service owned and operated by Netflix, Inc.

Problem Statement

This repository will contain projects and analysis of Data In Moion Challenges

  1. Is there any missing data? Deal with them accordingly.
  2. Using the ‘date_added’ column, a new column called ‘year_added’ only has the year the title was added.
  3. Using the ‘date_added’ column, create a new column called ‘month_added’ that only has the month the title was added.
  4. Check the data types. Does anything look odd? Adjust accordingly.
  5. What is the most popular release year for movies on Netflix?
  6. In what year did Netflix add the most content to its platform?
  7. What is the movie with the longest title in the dataset?
  8. What are the top 5 most popular movie genres?
  9. Create a pie chart visualizing the proportion of movies vs TV shows. Label each section with the percentage.
  10. Create a dashboard to summarize your insights.

Data Sourcing

The data was sourced directly from the web, click here to download

Data Transformation

The dataset imported from the web obviously needs to be cleaned and transformed. Check out a video I created where I carried out this transformation using the power query feature of power Bi.

[See screenshot below after transformation]

The Transformed data Applied Steps

I also cleaned this data using jupyter notebook,see the image below.

Data Modelling

Here I created a calendar Table and used it to establish a relationship between both Tables. Netflix_data Table

The Netflix_data Table Calendar Table

Find the modelled table below.

Data Visualization

Dashboard

Insights/Recommendations

Insight:

Based on the analysis of the Netflix dataset from 2008 to 2021, several key insights have been uncovered. Firstly, 2019 stood out as the year with the highest number of content additions, indicating a significant expansion of the platform. Additionally, 2019 was the most popular year for movies on Netflix, showcasing a diverse range of cinematic experiences. The top five movie genres on the platform were identified as dramas, comedies, action & adventure, children & family, and documentaries. Furthermore, the documentary "Jim & Andy" emerged as the movie with the longest title. Lastly, movies dominated the content on Netflix, accounting for 97.24%, while TV shows made up the remaining 2.76%.

Recommendation:

  • Balance Between Movies and TV Shows: Despite the fact that Netflix is dominated by movies, it is still important to have a balanced collection of TV shows. Continuously curating a diverse range of high-quality TV shows will attract subscribers who prefer episodic storytelling and streaming experiences.
  • Diverse Content Strategy: Given the popularity of dramas, comedies, action & adventure, kids & family, and documentaries, Netflix should continue investing in a wide variety of content in these genres. This will enable them to better serve their subscribers' diverse interests and preferences.
  • Focus on Yearly Expansion: Building on the success of 2019 as the year of content, Netflix should continue prioritizing regular and substantial additions to their library each year. This will enhance subscriber engagement and satisfaction, providing them with a vast array of options.

By leveraging data-driven insights, Netflix can continue providing captivating and relevant content to their subscribers while staying ahead in the competitive streaming industry.

click here to interact with my report on powerBi Service.

netflixanalysis's People

Contributors

omobacoder avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.