Code Monkey home page Code Monkey logo

house-price-prediction's Introduction

House-Price-Prediction

I have tried to make the model as small as possible, at the same time have a score in the acceptable interval.

Data Tidying:

• Removing outliers did not have a positive effect, so I deleted that part to make the model as simple as possible

• Merged both train and test data to be able to find more reliable values for missing values.

• After finding the not-numerical columns, analyzing the reasoning behind being missing, decided to fill them with “None”

• For numerical columns, filled them with mean of the column.

Feature Engineering:

• Created a variable for some features that had correlation (had multi-collinearity)

• Applied log transform on the features having skewness over 0.2 and also on SalePrice (y).

• Using dummy variables to get rid of the categorical data.

• Split data again to test and train set accordingly.

Model Selection:

• Using LASSO model, with alpha 0.0005, based on trial and error, with default 3 fold cross validation.

Kaggle Score = 0.11939

house-price-prediction's People

Contributors

mhmdkhod avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.