Code Monkey home page Code Monkey logo

bikeshare's People

Contributors

breannam avatar darkzeroman avatar ejette avatar hunterowens avatar jpvelez avatar wdempsey avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

bikeshare's Issues

Do the validation

Train from Feb 2012 to Feb 2013, then predict out 15-60 min with a day + hour offset.

Write model validation script

This involves figuring out the best way to use some data as a test set. For example, do we randomly leave out data points and subsequent data points (based on the "degree" of the AR model we're using), train the model, and then see how we did on our left out points?

Estimate model parameters

Once we have the model we want, we need to estimate its parameters by crunching historical data of number of bikes and docks at each station.

So we need a parameter_estimation.py script that:

  1. Pulls all the historical data we need to do that estimation from the database using SQL query and puts it in a dataframe.
  2. Estimates the parameters using this dataframe
  3. Spits out the parameters as a vector so our model can use them in our prediction script.

Repo Cleanup

Directory Structure

  • Clean, Well named set of directories. Examples include webapp, database, and models.
  • No random files in the root.
  • Explanation of each directory in README.
  • Sub-README's in appropriate folders
  • No directories named after DSSG specific info (ie, person names)
  • Should your team have more than one project, each should have it its own repo.
  • In your data or database folder, provide a way to re-create your database from scratch. .sql files are often appropriate for this.

README

  • Links to appropriate sections in wiki. See wiki issue for more info.
  • Answers: What have you built? In a few sentences.
  • Answers: How do you install it?
  • Answers: What needs to be done/How can I help?
  • Has some sort of Contact Info
  • Open source license

Config

  • No public facing config info - Make sure never to hardcode in database url, password, etc.
  • Description of how you hide config info, ie yaml, environment variables, etc
  • config.example files
  • Requirements.txt or similar file.
  • Relative links in any html.

Use model to predict number of bikes at station

Once we've picked a statistical model and estimated its parameters, we need to use this model to actually predict how many bikes are going to be at every station in a bikeshare system in 60 minutes.

So we need a ARMA_prediction.py script that

  1. Defines the ARMA model we're using.
  2. Takes the parameter estimates output by parameter_estimation.py and uses them for our model
  3. Query the database to get the current number of bikes at the station, time, day of week, month, and whatever other inputs the model needs.
  4. Throw these inputs into the model and spit out a prediction of the number of bikes that will be present at each station in 60 minutes. This predicted output will eventually be shown on a simple webpage.

Make stupid simple web app to show model results

Here are the fields we want to show in a table on a page:

  • station name
  • percent full right now
  • how long station has been full/empty as of now
  • % full one hour from now
  • how long will it have been full/empty in 60 minutes

Wiki

REQUIRED WIKI SECTIONS

  • Homepage

    Intro: have a sentence or two about the project: the problems its solving, the partner, and how you're solving it. Also say that "this wiki is the central place to learn about the social problem we worked on, the data we used, the methods we used to solve it, and our findings" so people know what they're looking at.
    List of pages in the wiki

  • Problem

    An in-depth description of the problem your organization, the problem you're trying to solve, and any relevant domain knowledge. Feel free to copy from blog posts and posters, if relevant.

  • Data

    Describe the dataset(s) you used in the project as well as your database. Walk people through the data model (tables are handy for this), and include a (fake) sample of each dataset.
    If you scraped data, this is the place to document that.

  • Methodology

    An in-depth, technical write up of the method(s) you used on your projects. Use latex equation, walk people through algorithms and models, link out to relevant documentation when possible.

  • Results

    Discuss what metrics you're using to evaluate performance (if applicable), and what your final findings where.

  • Future work

    Discuss what you would like to do / what is in progress.

OPTIONAL WIKI SECTIONS - if its fits your project

  • Analysis

    If you did exploratory data analysis, this is the place to put it and explain your findings. Explain each finding and what your learned from it / how it motivated the methods you used. Put this between the "Data" and "Methodology" sections. Feel free to lift content from relevant blog posts, if any.

  • Resources

    Resources for domain knowledge, methods, and tech. Whatever pieces of paper you used to learn what you know.

  • Tool

    • API Documentation
    • Web app Documenation

Figure out the terms of the ARMA model we're using

We're using an ARMA statistical model to predict the number of bikes and docks available at a station at a certain time.

They can be more or less complicated, so let's figure out what terms we want in the first, simple version. We can also do something more complicated, but let's start simple.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.