Code Monkey home page Code Monkey logo

turtlegames's Introduction

Project Summary:

Turtle Games: Customer Segmentation & Sales Optimisation- 98% final grade.

Project Focus: In my third 8 week assignment for LSE my objective was to investigate customer loyalty and optimise sales performance for Turtle Games,  a global gaming retailer and manufacturer.  I used a comprehensive suite of data analytics techniques with a real focus on several new skills. In particular this assignment really cemented a significant passion for Natural Language Processing.

For this assignment I distilled the context down to one business question How can Turtle Games use data to enhance customer loyalty and sales? Then the core business question is split into two parts:

  • Part 1: Who are Turtle Games' customers and what are their habits?
    • Customer Insights
    • Customer Segmentation
    • Customer Feedback
  • Part 2: What do sales data reveal about market trends and performance?
    • Platform Performance
    • Sales Trends & Predictive Modelling

Analytical Approach Highlights:

  • Customer Segmentation: Leveraging Python and R, I performed multiple linear regression (MLR), K-means clustering, and natural language processing (NLP) on Turtle Games' customer base data. This revealed factors influencing customer loyalty points and facilitated the segmentation of customers into distinct spending behaviour groups for targeted marketing strategies.

    • K means clustering of two variables that showed promising correlation

    • image

    • Group 1 - 774 observations 39% - A signifcant amount of total observations that operates around the middle of both variables suggesting arguably the strongest market segment to target. Worth considering other factors to flesh out. This is a high priority and the bread and butter of sales. Worth noting that the income range of this group seems to fit the mean of 48 well. The spending_score range fits the mean of 50 well too.

    • Group 0 - 356 observations 18% - Group 0 offers the golden goose of high income earners who also have a high spending_score. Even though they amount to jsut under half group of group 1 they are spending a relitively high amount so are still a relevant segment who are actively engaging and should be proritised. This is a high priority.

    • Group 2 - 330 observations 17% - Group 2 shows a segement that has potential to earn a higher spending score with more income available. As such, marketing needs to understand and address why this group isn't achiveing it's spending score potential. This is a high priorty to turn those Group 2 customers into Group 3.

    • Group 4 - 271 observations 14% - Group 4 show the least potential of all groups and also only contribute 14% of total observations. While there is always potential Group 4 should be seen as a low priority

    • Group 3 - 269 observations 13% - Group 3 show great potential again and have proven loyalty through their spending score in spite of low income. It is worth considering other factors in Group 3's income to see if there is any future benefit to targeting them. For instance if Group 3 doesn't earn much income because of Age being a contributing factor, working on retaining that base as customers and rewarding loyalty will only serve as positive if they grow older and their salaries grow with them.

  • A heatmap of customer information variables

  • image

  • Sales Data Exploration: A thorough analysis of sales data, visualised through R, unearthed critical insights into platform performance and sales trends. This identified which gaming platforms contributed most significantly to sales, providing a clear direction for Turtle Games to focus inventory and marketing efforts.

  • Predictive Modelling for Strategic Forecasting: MLR models were built to accurately predict future sales trends based on regional sales data. This not only showcased the strong relationship between regional and global sales but also equipped Turtle Games with the tools for strategic resource allocation and sales strategy optimisation.

    • A graphic made in R exploring national sales data compared to global sales data.
    • Rplot01

Key Insights and Recommendations:

  • I curated a robust predictive model with 97% accuracy, linking regional sales to global sales, underscoring the significant predictive power of NA and EU market performance on global outcomes, suggesting strategic focus areas for market investment and resource allocation. I also created an interactive Shiny App interface, enabling stakeholders to intuitively utilise the predictive model for strategic planning without technical expertise, enhancing decision-making efficiency.

    • Please explore the interactive Plotly plot in the files above exploring my MLR model results that is referenced in the video later on.
  • I identified and addressed data challenges including heteroscedasticity and multicollinearity, emphasising the need for precise data handling, further refined data and advanced statistical techniques to refine predictive accuracy.

  • I Highlighted the critical role of outliers in skewing sales data, necessitating sophisticated outlier management strategies to ensure model reliability and real-world applicability.

  • I discovered key insights into customer feedback using techniques like word clouds. While the report showed an overall positive sentiment, attention needs to be paid to harnessing recommendations from positive/neutral feedback. Also encouraged investing in multilingual NLP tools and proactively investigating negative customer feedback.

    • A word cloud from normalised positive reviews of products using Vader as the model.
    • WordCloud
  • I noted the evolving market dynamics for regions. For instance PlayStation, transitioning from a strong North American preference (41% market share for the original PlayStation compared to Europe's 31%) to a dominant position in Europe for the PlayStation V (45% of sales in Europe and 42% for the rest of the world, compared to North America's 13%).

    • Download the interactive stacked bar chart from the file above that is referenced in the presentation video at the end.

Professional Development and Project Impact:

This project served as a deep dive into the intersection of data analytics and strategic business decision-making. The wide variety of skills and tools used (Python, R, MLR, K-means clustering, and NLP) really allowed me to understand and consolidate several key areas of analysis.

turtlegames's People

Contributors

wburto avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.