Code Monkey home page Code Monkey logo

anomaly-detection-pipeline-kedro's Introduction

Building and Managing an Isolation Forest Anomaly Detection Pipeline with Kedro

Overview

Anomaly (fraud) detection pipeline on credit card transaction data using Isolation Forest machine learning model and Kedro framework

Link to article: https://neptune.ai/blog/data-science-pipelines-with-kedro

Objective

Develop a data science pipeline to detect anomalous (fradulent) credit card transactions with the use of:

  • Isolation Forest machine learning model - For unsupervised anomaly detection
  • Kedro - An open-source Python framework for creating reproducible, maintainable, and modular data science code. This framework helps to accelerate data pipelining, enhance data science prototyping, and promote pipeline reproducibility.)

Motivation

  • Explore how unsupervised anomaly detection works, and better understand the concept and implementation of isolation forest
  • Leverage Kedro framework to optimally structure data science pipeline projects

Data

The credit card transaction data is obtained from the collaboration between Worldline and Machine Learning Group. It is a realistic simulation of real-world credit card transactions and has been designed to include complicated fraud detection issues.

General Pipeline Structure

Alt text

Anomaly Detection Pipeline Structure

Alt text

Steps

  1. Change path to project directory in command line - cd C:/Anomaly-Detection-Pipeline-Kedro
  2. Initialize Conda virtual environment (create one if not done so) - conda activate env_kedro
  3. Execute a pipeline run with kedro run

Please see the walkthrough article for details

anomaly-detection-pipeline-kedro's People

Contributors

kennethleungty avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.