Code Monkey home page Code Monkey logo

credit-card-fraud-detection-pgdcloud-2022-aws's Introduction

Credit-Card-Fraud-Detection-PGDCLOUD-2022-AWS

Cloud Machine Learning (PGDCLOUD_SEP) 2022

Balazs Barcza x19190638
Christoph Kratz x21111898
Wislan Alandes De Lima Arruda x21126151

Abstract—Credit card fraud has been a problem for businesses and financial institutions for decades, resulting, in recent years, in billions of dollars in losses on a yearly basis. To take on the large amount of data generated around financial transactions, large computing resources will be required. Additionally, to review large numbers of transactions in an efficient and timely manner, human review would not be suitable. Therefore, to address these challenges machine learning in the cloud seems to be the solution. With this project, we cover many aspects of fraudulent transactions, as well as a model based on supervised learning techniques such as Decision Tree (DT), and Logistic Regression (LR). It makes use of the Simulated Credit Card Transactions generated using Sparkov. It simulates the transactions of 1000 customers doing transactions with a pool of 800 merchants that was run from the duration 1st Jan 2019 to 31st Dec 2020. The purpose of this study is to predict the likelihood of transactions being fraudulent using machine learning models and deploy it to the cloud. The findings show that Decision Tree Model achieves the best recall and accuracy scores (94%).

Keywords—credit card, fraud, cloud computing, machine learning, Amazon Web Services (AWS)

This project has the following components:

a) IEEE style Paper in PDF format

b) Jupyter Notebook walking through machine learning tests conducted. You can run view and run them yourself. Included are also comments, reasoning, and figures. For your convenience I have included a copy of the original dataset [1] in this git repo, however please refer to the original source for the most up-to-date version.

Installation Clone the project:

$ git clone https://github.com/Balays33/Credit-Card-Fraud-Detection-PGDCLOUD-2022-AWS.git

Pip-install dependencies. For example using a virtualenv:

$ virtualenv env && source env/bin/activate && pip install -r requirements.txt

Usage a) Read the Paper (PDF):

Cloud Machine Learning Report.pdf

b) Run the Jupyter Notebook:

find the the dataset: $ https://www.kaggle.com/datasets/kartik2112/fraud-detection/code

Generate a balanced dataset using ADASYN resampling (this will take several minutes): $ python app.py

Run the notebook: $ jupyter notebook

credit-card-fraud-detection-pgdcloud-2022-aws's People

Contributors

balays33 avatar wislanlima avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.