Code Monkey home page Code Monkey logo

anujm09 / seoul-bike-sharing-demand-prediction Goto Github PK

View Code? Open in Web Editor NEW
3.0 1.0 0.0 8.04 MB

This study demonstrates how numerous factors have an impact on bike rentals. Due to our understanding that many Koreans hire bikes throughout the week, we assumed that most of their use is for commuting to work or school. The number of rentals varies depending on a number of factors, including the day of the week, the hour of the day.

Jupyter Notebook 100.00%
descision-tree hyperparameter-tuning linear-regression machine-learning random-forest regression-models

seoul-bike-sharing-demand-prediction's Introduction

Seoul-Bike-Sharing-Demand-Prediction

Problem Description:

Currently Rental bikes are introduced in many urban cities for the enhancement of mobility comfort. It is important to make the rental bike available and accessible to the public at the right time as it lessens the waiting time. Eventually, providing the city with a stable supply of rental bikes becomes a major concern. The crucial part is the prediction of bike count required at each hour for the stable supply of rental bikes.

Data Description

Date : year-month-day

Rented Bike count - Count of bikes rented at each hour

Hour - Hour of the day

Temperature-Temperature in Celsius

Humidity - %

Windspeed - m/s

Visibility - 10m

Dew point temperature - Celsius

Solar radiation - MJ/m2

Rainfall - mm

Snowfall - cm

Seasons - Winter, Spring, Summer, Autumn

Holiday - Holiday/No holiday

Functional Day - NoFunc(Non Functional Hours), Fun(Functional hours)

Steps to solve the problem statement:

  • Understanding the dataset, performing some basic data inspection to verify the number of columns, comprehending data distribution, and examining the statistics for each variable. Performing a missing value check, To deal with outliers, visualise the distributions and boxplots for each variable. tidying up the info.

  • By feature engineering, new features were added, old features were removed, and the data was encoded into numerical form. Tried using certain adjustments to make the dependant variable regularly distributed.

  • Bi-variate analysis is used to determine whether the independent and dependent variables have any linear relationships. Using correlation analysis, you can see how severe the multicollinearity.

  • The data is finally scaled, and various techniques are tested. To improve accuracy, we first explored some straightforward models like the linear regressor and decision tree before moving on to more complicated algorithms like tree ensemble.

  • The linear regressor model did not perform well since there was little linear relationship between the independent and dependent variables. As a result, the performance significantly improved. Utilizing boosting and ensemble techniques as well as fine-tuning the hyperparameters, we continued to enhance the model's performance. The XG Boost Regressor model displayed the best performance.

  • To avoid underfitting or overfitting, we observed various evaluation metrics together with the optimal collection of hyperparameters for the tested models. We also had a general understanding of the significance of each feature for each model.

Summary

This study demonstrates how numerous factors have an impact on bike rentals. Due to our understanding that many Koreans hire bikes throughout the week, we assumed that most of their use is for commuting to work or school. The number of rentals varies depending on a number of factors, including the day of the week, the hour of the day, and the weather. Because there are more rentals in the spring and summer, weather conditions are particularly crucial. And as we predicted, when the weather is good, more customers plan to rent bikes.

seoul-bike-sharing-demand-prediction's People

Contributors

anujm09 avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.