Code Monkey home page Code Monkey logo

housing-price-prediction-in-melbourne's Introduction

Exploratory-Data-Analysis-For-Housing-Price-Prediction-in-Melbourne

Description: This repository presents an exploratory data analysis (EDA) conducted on the Melbourne housing dataset sourced from Kaggle. The analysis aims to determine the optimal parameters for predicting housing unit prices in Melbourne, leveraging techniques in regression analysis and statistical testing.

Key Highlights:

  • Dataset Overview: The dataset consists of 13,580 rows and 21 columns, with four main variables under consideration: Price (in Australian dollars), Distance (from central business district), Propertycount (number of properties in the suburb), and Landsize (in meters).

  • Data Cleaning and Preparation: The data underwent preprocessing, including outlier removal using the Interquartile Range (IQR) method and subsequent transformations for better fit, such as log transformation on Price and Landsize, and square root transformation on Propertycount and Distance.

  • Data Analysis: A linear regression model was employed to analyze the relationship between housing prices and key predictor variables such as distance, property count, and land size. Additionally, variable selection techniques were utilized to identify the most influential parameters.

  • Model Evaluation: Assumptions including multicollinearity, independence of residuals, and normal distribution of residuals were validated. The all-subsets method was utilized for variable selection, leading to the identification of the best-fit model. Additionally, comparison among different models was conducted using ANOVA and AIC, with Model2 (incorporating all three predictor variables) identified as the superior model.

  • Conclusion: The analysis culminates in the identification of a robust linear model for predicting housing prices in Melbourne. While the model meets assumptions and provides valuable insights, there remains potential for improvement through the incorporation of additional variables.

Dataset Citation:

  • Kaggle. "Melbourne Housing Snapshot." Available at: Dataset Link

housing-price-prediction-in-melbourne's People

Contributors

rohanarora03 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.