Code Monkey home page Code Monkey logo

google-cert-capstone's Introduction

Google Advacned Data Analytics Capstone Project

What I did

The certificate program has tasked me with analyzing the data to come up with ideas for how to increase employee retention. To help with this, they would like me to design a model that predicts whether an employee will leave the company based on their department, number of projects, average monthly hours, and any other data points me deem helpful.

The steps I took were:

  • Setup reproducible and versionable python environment, something I learned from my software engineering background
  • Familiarize myself with the dataset
  • Conduct Exploratory Data Analysis
  • Gather descriptive statistics (mean, std, quantiles)
  • Cleanup missing and duplicate data
  • Check for outliers
  • Further examine various relationship between features thru visualization
  • Train 3 models using Logistic Regression, DecisionTree and RandomForest
  • Compare the performance of the models
  • Report my findings

Result and Work

see main.ipynb file.

About Dataset

sourced from https://www.kaggle.com/datasets/mfaisalqureshi/hr-analytics-and-job-prediction?select=HR_comma_sep.csv

Context

Hr Data Analytics This dataset contains information about employees who worked in a fictitious company.

Data Dictionary

Variable Description
satisfaction_level Employee-reported job satisfaction level [0โ€“1]
last_evaluation Score of employee's last performance review [0โ€“1]
number_project Number of projects employee contributes to
average_monthly_hours Average number of hours employee worked per month
time_spend_company How long the employee has been with the company (years)
Work_accident Whether or not the employee experienced an accident while at work
left Whether or not the employee left the company
promotion_last_5years Whether or not the employee was promoted in the last 5 years
Department The employee's department
salary The employee's salary (U.S. dollars)

google-cert-capstone's People

Watchers

Geraldi Sutanto avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.