Code Monkey home page Code Monkey logo

credit-card-fraud-detection-'s Introduction

Credit-Card-Fraud-Detection-

Credit Card Fraud Detection using Logistic Regression on credit card dataset

As this is a binary classification problem we will be using Logistic Regression model for model training

Workflow of model

  • Collection of data
  • Data Preprocessing
  • Splitting test and training data
  • Model Training
  • Model Evaluation
  • Prediction System

Dependencies used :

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.linear_model import LogisticRegression
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score
# importing data

transaction_dataset= pd.read_csv("/content/drive/MyDrive/google_collab/creditcard.csv")
transaction_dataset.head(10)

Data analysis

  • shape
  • info()
  • describe()
  • isnull
  • count_values()
  • dtypes

Sampling

  • 0 : Normal transaction
  • 1 : Fraudulent transaction
legit = transaction_dataset[transaction_dataset.Class == 0]
fraud = transaction_dataset[transaction_dataset.Class == 1]

comparing the samples

# comparing the values for both transaction 
transaction_dataset.groupby('Class').mean()

Under-Sampling

  • build a sample dataset having similar distribution of normal and fraudulent transactions.
  • number of fraudulent transaction is = 492

Visualization of data

plt.figure(figsize = (20,11))
# heatmap size in ration 16:9

sns.heatmap(new_transaction_dataset2.corr(), annot = True, cmap = 'coolwarm')
# heatmap parameters

plt.title("Heatmap for correlation matrix for credit card data ", fontsize = 22)
plt.show()

001

002

Splitting data (features and target)

X = new_transaction_dataset2.drop(columns = 'Class', axis = 1)
Y = new_transaction_dataset2['Class']

Splitting into training and test

X_train, X_test, Y_train, Y_test = train_test_split(X,Y, test_size = 0.2, stratify = Y, random_state = 2)

Model Training

model = LogisticRegression()
model.fit(X_train, Y_train)

Model Evaluation

print("\nAccuracy on Training data ",traning_data_accuracy,"\n")
print("Accuracy on Training data ",test_data_accuracy)

image


Contributor : Ankit Nainwal

Other Models

Please ⭐⭐⭐⭐⭐

credit-card-fraud-detection-'s People

Contributors

nano-bot01 avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.