Code Monkey home page Code Monkey logo

cloud-dataflow-nyc-taxi-tycoon's Introduction

NYC Taxi Tycoon - Dataflow Codelab

This is the support code and solutions for the NYC Taxi Tycoon Dataflow Codelab

TL;DR

In this codelab you learn how to process streaming data with Dataflow. The public emulated data stream is based on the NYC Taxi & Limousine Commission’s open dataset expanded with additional routing information using the Google Maps Direction API and interpolated timestamps to simulate a real time scenario.

Public Pubsub Data Stream

The public Google Cloud Pubsub topic used in the codelab is available at: projects/pubsub-public-data/topics/taxirides-realtime

To test the public data stream you can create a subscription by using the gcloud cli. Make sure you have selected a project with billing and the PubSub API enabled.

gcloud alpha pubsub subscriptions create taxi-test-sub --topic projects/pubsub-public-data/topics/taxirides-realtime

Wait for a couple seconds before pulling a message from your subscription

gcloud alpha pubsub subscriptions pull projects/<your-project-id>/subscriptions/taxi-test-sub

Finally, delete the test subscription using

gcloud alpha pubsub subscriptions delete projects/<your-project-id>/subscriptions/taxi-test-sub

Contents of this repository

In the dataflow folder you find the solutions for all dataflow pipelines covered in the NYC Taxi Tycoon Dataflow Codelab

The visualizer code to display the output of your codelab dataflow pipelines can be found in the gh-pages branch. The visualizer is available hosted here.

Use: The NYC Taxi & Limousine Commission’s dataset is publicly available for anyone to use under the following terms provided by the Dataset Source —https://data.cityofnewyork.us/— and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.

This is not an official Google product

cloud-dataflow-nyc-taxi-tycoon's People

Contributors

hostirosti avatar idofl avatar martin-gorner avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.