Code Monkey home page Code Monkey logo

air-quality-prediction's Introduction

Air Quality Prediction of Relative Humidity - Regression

The dataset contains 9358 instances of hourly averaged responses from an array of 5 metal oxide chemical sensors embedded in an Air Quality Chemical Multisensor Device. The device was located on the field in a significantly polluted area, at road level. Data were recorded for a period of one year (March 2004 - February 2005) in an Italian city. (Dataset)

Features

Date Date (DD/MM/YYYY)
Time Time (HH.MM.SS)
CO(GT) True hourly averaged concentration CO in mg/m^3 (reference analyzer)
PT08.S1(CO) PT08.S1 (tin oxide) hourly averaged sensor response (nominally CO targeted)
NMHC(GT) Non Metanic HydroCarbons concentration in microg/m^3 (reference analyzer)
C6H6(GT) True hourly averaged Benzene concentration in microg/m^3 (reference analyzer)
PT08.S2(NMHC) PT08.S2 (titania) hourly averaged sensor response (nominally NMHC targeted)
NOx (GT) True hourly averaged NOx concentration in ppb (reference analyzer)
PT08.S3(NOx) PT08.S3 (tungsten oxide) hourly averaged sensor response
NO2(GT) True hourly averaged NO2 concentration in microg/m^3 (reference analyzer)
PT08.S4(NO2) PT08.S4 (tungsten oxide) hourly averaged sensor response
PT08.S5(O3) PT08.S5 (indium oxide) hourly averaged sensor response (nominally O3 targeted)
T Temperature in °C
RH Relative Humidity (%)
AH AH Absolute Humidity

Model and hyper-parameters comparison

Score Default Parameters ScoreGrid Search Best Parameters
Linear Regression 88.24 88.19 fit_intercept: True, normalize: True
 Random Forest 99.76 99.82 max _features: 'auto', n_estimators: 30}
 SVM 96.26 94.86 degree: 3, kernel: rbf
 Polynomial Regression 99.24 99.20 fit_intercept: True, 'normalize': True

Conclusions

  • Random Forests and Polynomial Regression did a great job predicting relative humidity.
  • Since the default parameters already did a great job, there was no room left for improvement of the models with the help of grid search.
  • Based on the correlation between the variables and the target, removal of less correlated variables did not improve the accuracy of the prediction, therefore, all the variables were used in the prediction.
  • In addition, for Linear and Polynomial regressions not many parameters can be changed, therefore, grid search contribution to the improvement of the model score was very low.

air-quality-prediction's People

Contributors

ishanag9 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.