Data Analyst skilled in SQL, Tableau, PowerBI, Machine Learning, and Python
Project 1 : Bank-churn-prediction-app
- Customer churn (or customer attrition) is a tendency of customers to abandon a brand and stop being a paying client of a particular business.
- I used a supervised machine learning classification approach to solve this problem and based on the number of target classes I built a binary classifier type of Machine learning model.
- Data Source: Github Columns: 14 Rows: 10000
- Using LogisticRegression ML Estimator, a bank can measure the likelihood of customer churn with an accuracy score of 0.813(81.3%)
- Cross Validation had an accuracy of 80.73%
- Comparing the logistic Regression model to:
- Decision Tree Classifier
- Random Forest Classifier
- Support Vector Machine
- K nearest Classifier
- naive_bayes
- LR F1-score 0.5958600508740877
- DT F1-score 0.6895821798155766
- RF F1-score 0.7513784461152883
- SVM F1-score 0.5210022107590273
- NB F1-score 0.6412824619876383
- KNN F1-score 0.6245016923566131
- Random Forest Classifier model performed well compared to other models
- To improve the accuracy of the Random Forest model I used RandomSearchCv to tune the hyperparameters:
- Hence the randomised search cv on random forest classifier gave us better accuracy which is 86.25% and a std of 0.99% and wrong predictions made by the model are 374/2000
- Bank Churn Prediction Data App
- Companies deal with huge amounts of data every day, from emails, chats, and social media posts, to customer support tickets, and survey responses. However, 80% of this data is unstructured and, therefore, hard to process โ not to mention time-consuming if done manually.
- Text analysis app can be used to visualize the unstructured data
- Development Dependencies:
- Python
- Streamlit
- Natural Language ToolKit
- Unstructured data can deliver countless insights that help you make informed, data-driven decisions that can improve the customer experience
Project 3: Covid-19 Dashboard Using Tableau
- I created an interactive Dashboard to visualize and track the current trend of covid-19 Cases
- Dataset: Our World in Data
Project 4 : SalesReport Using PowerBI
- I created an interactive Sales Dashboard to visualize Sales performance over the years
- Dataset: Github
- I used Data analysis Expression Language(DAX) to develop Revenue and Sales Quantity Measures
- I Used the Star Schema concept to link tables