Code Monkey home page Code Monkey logo

amazon_vine_analysis's Introduction

Amazon_Vine_Analysis

Overview of the analysis

Access one of fifty datasets of product reviews, in this case: video games reviews from Amazon Vine program; using PySpark to perform ETL by extracting and transform the data on Google Colaboratory, and then load the transformed data into Amazon RDS instance once connected to Postgres. Whether there is any bias toward favorable reviews from Vine members in the video game review dataset was then determined using PySpark.

  • PySpark, ETL, Amazon RDS, Postgres, pgadmin

Results

How many Vine reviews and non-Vine reviews were there?

  • There were a total of 94 Vine reviews and 40471 non-Vine reviews.

VineYT VineNT

How many Vine reviews were 5 stars? How many non-Vine reviews were 5 stars?

  • 48 Vine reviews that are 5 stars. On the other hand, 15663 non-Vine reviews that are 5 stars. Vine5 VineN5

What percentage of Vine reviews were 5 stars? What percentage of non-Vine reviews were 5 stars?

  • 51.1 percent of Vine reviews were 5 stars while only 38.7 percent of non-Vine reviews were 5 stars. p5p p5up

Summary

There could be bias among this 'star ratings' system since there are more five-star reviews amongst vine reviews than non-vine reviews. However, the number of non-Vine reviews is significantly more than Vine reviews. Additional statistic test such as t-test or ANOVA test could be use to prove bias.

amazon_vine_analysis's People

Contributors

xdarkphoenixes avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.