Code Monkey home page Code Monkey logo

movie-stats-analysis's Introduction

Movie Stats Data Analysis

Data analysis on movie based on Naver Movie to find whether the movie related comments influence the South Korean market compared to other global movie markets.

Python License: AGPL v3

Websites Used

Motivation

Movie industry is huge. However, movies make a lot of profit from one country, while it does not in other countries. There are many reasons to this such as culture, timing, population, etc. Out of these, I wanted to experiment to find out if the movie review comments has any effects on the movie's profit in a specific country. As I wish to only figure out a movie review platform's impact, I neglected other components that could affect my experiment such as population by dividing the total profit of a movie by its population. I also compared South Korea with G20 and OECD countries as I target on ignoring the data from developing countries.

Conclusion

The Naive Bayes classifier accuracy was 68.59%, meaning that Naver Movie review platform did not impact a movie's profit. This could mean other countries are same as well. However, there are many uncertainties in this experiment, meaning that other experiments could go more deep into it and improve on this result.

How it Works

0️⃣ Dependency Installation

pip install -r requirements.txt

1️⃣ Scrape Movie Statistics

python scrape_movie_stats.py

2️⃣ Preprocess Movie Statistics

Analysis Record

  • Movie Name
  • South Korea's Ratio
  • Other Countries' Ratio
  • Result (1 or 0)

South Korean Ratio Formula

  • x = South Korea's profit
  • y = South Korea's population

Other Countries' Ratio Formula

  • x = Other countries' profit
  • y = Other countries' population
  • n = Total number of other countries
python preprocess_movie_stats.py

3️⃣ Scrape Movie Comments

python scrape_movie_comments.py

4️⃣ Train & Test

python train_test.py

movie-stats-analysis's People

Contributors

nogamsung avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.