Code Monkey home page Code Monkey logo

divvy-munging's Introduction

#Enhanced Divvy data

Divvy is hosting its second annual Data Challenge in 2015, with submissings due March 18, 2015 (prizes include an Xbox One and a Windows phone). Divvy Munging is a small project to provide modified & enhanced data so entrants can create a project faster.

2014

Data

Notes

  • Divvy occasionally moves stations (but station IDs persist). There isn't yet a list of station moves.

Stats

Contribute your stats by opening an issue or submitting a pull request.

2013

This enhances the Divvy dataset by adding Google Maps-calculated distances between every station pair.

Data

This "master file" contains a table, ready for import into a MySQL database, that combines Divvy’s year 2013 trip data (released February 11, 2014) with corrected station names, station IDs that match Divvy’s stations availability API, launch times, geographic coordinates, precalculated member age, and estimated bicycling distance between stations (see notes).

Notes

  • Distances are in meters. The distance represents the Google Maps Bicycle directions distance for that station pair and no route was inspected before the algorithm’s route was accepted into the dataset. More about this distance data.
  • Launch dates were corrected (there used to be a message here that said don't trust them)
  • From Divvy's readme: "Trips that did not include a start or end date were removed from original table" is standard language Alta Bicycle Share uses across all data releases. In the case of Divvy's 2013 data, there were no trips without a start or end date removed from the original table. In general, these kinds of trips would occur from missing/stolen bikes.

Basic stats

Looking for more stats? Check here, here, and here.

Sex/gender distribution stats

69% of Divvy members are men, 31% are women, as of the end of 2013, and February 14, 2014. As of this writing, survey responses were not available. From Divvy.

1.8% of male workers 16 years and older primarily used a bicycle to get to work, while 0.7% of female workers 16 years and older did. Both have a margin of error of ±0.1%. Considering there were 625,289 male workers 16+ (MOE of ±4,264) and 588,612 female workers 16+ (MOE of ±4,113), that's a ratio of 73.20% to 26.80%. American Community Survey, 2008-2012, table S08010, via American FactFinder.

In 2013, 79.05% of member trips were by male members and 20.95% by female members. From my analysis of the trip data.

Credits

The pairwise distance was created by Nick Bennet (with assistance from Smart Chicago Collaborative), and the SQL query to merge them by Eric Van Zanten.

Need support?

divvy-munging's People

Contributors

stevevance avatar stringertheory avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

divvy-munging's Issues

Issues with 2014 data

I was looking at the 2014 data, and was surprised to see that while Q1/Q2 gender is "Male", "Female", or "", in the 3 Q3/Q4 .csv files gender is "0", "1", or "2".
Any idea what do "0", "1", and "2" correspond to?

Also, I noticed that a couple of rides (<200) where usertype = "Customer" (rather than "Subscriber") and gender = "Male" or "Female".
What does it mean for customer (i.e. not subscriber) gender to be known?

Finally,
What does usertype = "Dependent" mean?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.