Code Monkey home page Code Monkey logo

abrar2652 / ieee-cis-fraud-detection-project Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 11.48 MB

This is the first project to be completed in Upskill ISA Intelligent Machines. The project was done after the end of the competition. The XGBClassifier used in this model obtained 0.950844 public scores on Kaggle.

License: Apache License 2.0

Jupyter Notebook 100.00%
machine-learning xgbclassifier matplotlib pandas numpy data-science data-visualization

ieee-cis-fraud-detection-project's Introduction

IEEE-CIS-Fraud-Detection-Project

Description

This project is a part of the Machine Learning Course provided by Upskill Income Sharing Agreement program with Intelligent Machines. An already ended competition dataset has been selected as this project where different machine learning models were benchmarked. The data contains real-world e-commerce transactions from Vesta. It contains a wide range of features from device type to product features. The competitors were to develop a machine learning model to predict if the transaction is fraud or not fraud. This project targets to improve the efficacy of fraudulent transaction alerts for millions of people around the world, helping hundreds of thousands of businesses reduce their fraud loss and increase their revenue

Important Links

Dataset

Getting Started

The main challenge of this project is the gigantic amount of features and it's difficult to remove the unnecessary features where we don't know which factors to consider while choosing features. Training the machine learning models on these all features will waste a lot of time and obviously won't obtain better score. The main starting point should be data exploration, data cleaning, dealing with the null values, feature engineering.

Dependencies

Programming language: Python

Libraries: NumPy, Pandas, Matplotlib, Seaborn, scikit-learn, XGBClassifier

Environment: Kaggle Notebook

Executing program

Help

If you face difficulties running the model on your local machine or Google Colab Notebook, then check if you are running the Kernel on CPU or GPU. If you're running on CPU, change the runtime to GPU. I ran this notebook with 4 GB RAM, 2.4 GHz Intel(R) Core(TM) i3 CPU. I faced a lot of difficulties including sudden shutdown due to overheating, running out of my resources, etc. Kaggle environment worked well for me.

Authors

Md. Abrar Jahin

LinkedIn

License

This project is licensed under the [Apache License 2.0] License - see the LICENSE.md file for details

Acknowledgments

StackOverflow, Towards Data Science articles, Data Exploration and Feature Engineering Techniques of Kaggle Grandmasters, DataCamp

ieee-cis-fraud-detection-project's People

Contributors

abrar2652 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.