Code Monkey home page Code Monkey logo

predicting-credit-risk-supervised-ml's Introduction

Predicting Credit Risk with Supervised Machine Learning

             _._._                       _._._
            _|   |_                     _|   |_
            | ... |_._._._._._._._._._._| ... |
            | ||| |  o NATIONAL BANK o  | ||| |
            | """ |  """    """    """  | """ |
       ())  |[-|-]| [-|-]  [-|-]  [-|-] |[-|-]|  ())
      (())) |     |---------------------|     | (()))
     (())())| """ |  """    """    """  | """ |(())())
     (()))()|[-|-]|  :::   .-"-.   :::  |[-|-]|(()))()
     ()))(()|     | |~|~|  |_|_|  |~|~| |     |()))(()
        ||  |_____|_|_|_|__|_|_|__|_|_|_|_____|  ||
    ~ ~^^ @@@@@@@@@@@@@@/=======\@@@@@@@@@@@@@@ ^^~ ~
        ^~^~                                ~^~^

Tools, Languages, & Libraries Utilized
  • Python
  • Pandas
  • Pathlib
  • Sklearn - LogisticRegression, RandomForestClassifier
  • VS Code
  • Jupyter Notebook

  • A machine learning model that attempts to predict whether a loan from LendingClub will become high risk or not.

    Credit Risk Evaluator

    Which model will be better? LogisticRegression or RandomForestClassifier?

    Predictions:

    I predict that the RandomForestClassifier will be a better model due to the data set containing several columns of categorical data.

    Thoughts before scaling data

    Comparisons from unscaled data:

    The logistic regression model seems to be performing better on the unscaled data. The random forest classifier is performing poorly, possibly due to overfitting.

    Predictions for scaled data:

    I think the performance of the unscaled data is too poor in the random forest classifier to overcome the deficit of the logistical regression model. I predict both models will perform better after scaling.

    Results

    Comparisons from scaled data:

    It seems the prediction was incorrect due to random forest classifier actually performing worse after scaling. The logistic regression model performed significantly better after scaling.

    Results for scaled data:

    The logistic regression model seems to be the way to go meaning my prediction was incorrect. The data set could possibly use some more pre-processing to achieve better results.

    predicting-credit-risk-supervised-ml's People

    Contributors

    rb25s13 avatar

    Watchers

     avatar

    Recommend Projects

    • React photo React

      A declarative, efficient, and flexible JavaScript library for building user interfaces.

    • Vue.js photo Vue.js

      ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

    • Typescript photo Typescript

      TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

    • TensorFlow photo TensorFlow

      An Open Source Machine Learning Framework for Everyone

    • Django photo Django

      The Web framework for perfectionists with deadlines.

    • D3 photo D3

      Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

    Recommend Topics

    • javascript

      JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

    • web

      Some thing interesting about web. New door for the world.

    • server

      A server is a program made to process requests and deliver data to clients.

    • Machine learning

      Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

    • Game

      Some thing interesting about game, make everyone happy.

    Recommend Org

    • Facebook photo Facebook

      We are working to build community through open source technology. NB: members must have two-factor auth.

    • Microsoft photo Microsoft

      Open source projects and samples from Microsoft.

    • Google photo Google

      Google โค๏ธ Open Source for everyone.

    • D3 photo D3

      Data-Driven Documents codes.