Code Monkey home page Code Monkey logo

elvis.koech's Introduction

Elvis.koech Portfolio

Data Analyst skilled in SQL, Tableau, PowerBI, Machine Learning, and Python

Project 1 : Bank-churn-prediction-app

  • Customer churn (or customer attrition) is a tendency of customers to abandon a brand and stop being a paying client of a particular business.
  • I used a supervised machine learning classification approach to solve this problem and based on the number of target classes I built a binary classifier type of Machine learning model.
  • Data Source: Github Columns: 14 Rows: 10000
  • Using LogisticRegression ML Estimator, a bank can measure the likelihood of customer churn with an accuracy score of 0.813(81.3%)

Evaluating the result/metrics

  • Cross Validation had an accuracy of 80.73%
  • Comparing the logistic Regression model to:
  • Decision Tree Classifier
  • Random Forest Classifier
  • Support Vector Machine
  • K nearest Classifier
  • naive_bayes

Using Classification Report to determine F1_Score of different models:

  • LR F1-score 0.5958600508740877
  • DT F1-score 0.6895821798155766
  • RF F1-score 0.7513784461152883
  • SVM F1-score 0.5210022107590273
  • NB F1-score 0.6412824619876383
  • KNN F1-score 0.6245016923566131
  • Random Forest Classifier model performed well compared to other models
  • To improve the accuracy of the Random Forest model I used RandomSearchCv to tune the hyperparameters:
  • Hence the randomised search cv on random forest classifier gave us better accuracy which is 86.25% and a std of 0.99% and wrong predictions made by the model are 374/2000
  • Bank Churn Prediction Data App

Project 2: Text Analysis App

  • Companies deal with huge amounts of data every day, from emails, chats, and social media posts, to customer support tickets, and survey responses. However, 80% of this data is unstructured and, therefore, hard to process โ€“ not to mention time-consuming if done manually.
  • Text analysis app can be used to visualize the unstructured data
  • Development Dependencies:
    • Python
    • Streamlit
    • Natural Language ToolKit
  • Unstructured data can deliver countless insights that help you make informed, data-driven decisions that can improve the customer experience

Project 3: Covid-19 Dashboard Using Tableau

  • I created an interactive Dashboard to visualize and track the current trend of covid-19 Cases
  • Dataset: Our World in Data

Project 4 : SalesReport Using PowerBI

  • I created an interactive Sales Dashboard to visualize Sales performance over the years
  • Dataset: Github
  • I used Data analysis Expression Language(DAX) to develop Revenue and Sales Quantity Measures
  • I Used the Star Schema concept to link tables

elvis.koech's People

Contributors

elviskoech avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.