Code Monkey home page Code Monkey logo

hdfc_internship's Introduction

HDFC_Internship

This repository consists of all files related to internship at HDFC Bank. The bank statistics data were taken from official Reserve Bank of India website (https://rbi.org.in/Scripts/Statistics.aspx). The data consists of the Point of Sales Machine transaction and ATM transactions for the years 2018 - 2019 (month of May). The objective of this project work was to analyze the performance of the Indian banking system.

Data was segregated into government and private bank datasets Feature Engineering was implemented and certain statistics were extracted from the dataset: like per transaction value of credit cards for a particular bank

Project Details:

Version: Jupyter Notebook Packages: pandas, numpy, sklearn, matplotlib, seaborn, json

Data Cleaning

After reading the data, I needed to clean it up so that it was usable for further analysis. I made the following changes and created the following variables:

  • Segregated the bank dataset into Private and Government Banks
  • Parsed numeric data out of ATM and POS transactions
  • Made columns for transaction value for CC and DC transactions
  • Removed rows without any transaction values
  • Parsed rating out of company text
  • Used outstanding CC and DC to aggregate transaction values
  • Made columns if transaction levels were higher than the average:
    • Credit Cards: Private and Government
    • Debit Cards: Private and Government

Exploratory Data analysis

Distributions of the data and the value counts for the various categorical variables.Also, data was sorted according to various peaks. Binning Data: Data was binned according to various categories and then the top performing banks were visualized.

Data Visualization

In this step, I created a different jupyter notebook file wherein different visuals of the data are taken into cognizance. It gives an in-depth research of other factors as well which are not present in the primary file such as violin plots, join plots of various categorical features.

hdfc_internship's People

Contributors

amartyasanyal12 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.