Code Monkey home page Code Monkey logo

project_2's Introduction

Project_2

Multi-labels classification

Python notebook created on October 2018.

The business problem for this example is about predicting problems caused by component failures such that the question "What is the probability that a machine will fail in the near future due to a failure of a certain component?" can be answered. The problem is formatted as a multi-class classification problem and a machine learning algorithm is used to create the predictive model that learns from historical data collected from machines. In the following sections, we go through the steps of implementing such a model which are feature engineering, label construction, training and evaluation.

Common data sources for predictive maintenance problems are :

  • Failure history: The failure history of a machine or component within the machine.
  • Maintenance history: The repair history of a machine, e.g. error codes, previous maintenance activities or component replacements.
  • Machine conditions and usage: The operating conditions of a machine e.g. data collected from sensors.
  • Machine features: The features of a machine, e.g. engine size, make and model, location.
  • Operator features: The features of the operator, e.g. gender, past experience

The data for this example comes from 4 different sources which are real-time telemetry data collected from machines, error messages, historical maintenance records that include failures and machine information such as type and age. Datasets are grouped as following:

  • Fault history: The fault history of a machine or a component inside the machine.
  • Maintenance history: The repair history of a machine, for example, error codes, previous maintenance activities or replacement of components.
  • Conditions and use of the machines: The operating conditions of a machine, for example, data collected from the sensors.
  • Characteristics of the machines: The characteristics of a machine, for example, engine size, make and model, age.

Required Python libraries

  • pandas
  • numpy
  • statsmodels
  • scikit-learn
  • scikitplot
  • matplotlib
  • seaborn
  • xgboost
  • itertools

project_2's People

Contributors

icarus-kch avatar

Forkers

gmnandan

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.