Code Monkey home page Code Monkey logo

sheetalbongale / scooters-in-austin-data-analysis Goto Github PK

View Code? Open in Web Editor NEW
7.0 3.0 3.0 67.63 MB

Project analyzing the City of Austin's Open Data for 8.4M shared scooter rides and also compare the city's 311 complaints about the same. Find insights about the popular spots for scooter rides and the correlation if these areas also have more scooter complaints and more | Python Project 1 | UT Data Analysis and Visualization Nov 2019 - May 2020

Jupyter Notebook 99.60% Python 0.40%
austin-data-analysis scooters 311-data api data-visualization census-tract plotly geopandas

scooters-in-austin-data-analysis's Introduction

Scooters (Shared Mobility) in Austin - Data Analysis

Austin Scooter

"In 2018, people took 84 million trips on Shared Micromobility in the United States, more than double the number of trips taken in 2017" - NACTO

Our Question:

  • Do the most popular zip codes to start or end rides also have the most complaints?
  • Does frequency of ending scooter rides in zipcode impact complaints when adjusted on per capita basis?
  • Number of rides in certain months of the year.
  • Does the distance of the trip impact the method of transportation (scooter vs. bicycle)
  • What are the 10 most frequent routes for all methods?

Hypothesis:

While we thought that obviously the neighborhoods with the most rides would likely have the most complaints, we also thought that some neighborhoods complain about a higher percent of the scooters that enter their area than other neighborhoods do. We figured these would be neighborhoods that are not as popular start points, who see the scooters longer and thus are more likely to complain.

Data Set:

Technologies Used:

  • Pandas
  • NumPy
  • Matplotlib
  • GeoPandas
  • Plotly.js
  • SQLAlchemy

Actions and Tasks:

  • Data Extraction
  • Data Cleaning and Exploration
  • Data Story and Visualization
  • Writing Analysis and Conclusions

Notebooks:

  • shared_mobility_data_wrangling.ipynb: Data munging and generates clean data CSV file for the Shared Mobility API Data. Also contains script to merge zipcodes to census tracts. The final merged CSV file is what we use for further analysis and visualization.
  • shared_mobility_data_story.ipynb: Data Visualization for shared mobility and some incomplete further work.
  • 311_data_wrangling.ipynb: Data munging and generates clean data CSV file for the 311 Complaints API Data.
  • 311_data_story.ipynb: Data Visualization for 311 data.
  • write_to_sql.py: Convert the 311 dataframe to SQL Database.

Analysis & Conclusions:

  1. 78701 Zipcode has the most number of starting and ending rides.
  2. 78701 also had the most 311 complaints about shared mobility and dockless vehicles.
  3. Saturday was the most popular day to ride a bicycle or a scooter. Also, Weekend spike indicates people like to ride scooters recreationally, or while they are enjoying activities that are outside of their normal weekly commute.
  4. We can see there is a rush hour after work. We saw a peak in the number of rides at 5pm and also before & after 5pm.
  5. October, September and March have the most rides. This is when the weather is nicest in Austin, but also when the biggest Music festivals are.

Which hours of the day are most popular?

trips_per_hour.png

What are the most popular days of the week?

trips_per_week.png

Are some months more popular than others?

trips_per_month2.png

Total daily rides per hour for each day of the week?

dailyrides_per_hour.png

Which census tracts have the most traffic?

trips_per_censustract.png

What are the most popular Census Tracts to start a ride?

choropeth_tractstart.png

What are the most popular Census Tracts to end a ride?

choropeth_tractsend.png

Which Zipcodes have the most traffic? (Without per capita factor)

trips_per_zipcode.png

What are the top 200 popular routes to start and end rides?

trips_popularroutes.png

What locations in Austin get the most complaints about shared mobility??

complains.png

Major limitations in our data:

  1. The data set for the shared mobility data is limited by geography, we can’t get more specific than census tract. Hence we used Census tract shapefiles and geopandas.
  2. 311 data was not clear about what the complaints were specifically about — you would have to manually find them on the 311. We also couldn’t see if it was business, resident, or passerby.
  3. Joyriding, some rides are really short— some people just turn scooters on and don’t seem to take it anywhere. Our overall hypothesis and the census tract choropeth plots will change if we discard all the rows for rides with '0'meter distance.
  4. We’d like to isolate the music festival dates out and see how much those specific dates impact the data.
  5. 78701: Neighborhoods that are mostly commercial have a lot of people who work there, own businesses there, walk around there, but do not show up in the population. It is possible that business owners are more likely to complain because they are made less available to customers due to scooter issues. Nearly 100k people work in downtown Austin which is drastically different from the number of people who are living there. We saw rush hour right after work.

Resources Used:

Documentation about Dataset:

Team Members: Erin Bentley, Jason Jones, Sana Khan, Sheetal Bongale

UT Data Analysis & Visualization Bootcamp | January 2020

Scooter meme

scooters-in-austin-data-analysis's People

Contributors

erinbentley avatar jljones78 avatar sheetalbongale avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.