Code Monkey home page Code Monkey logo

etl-project-group-9's Introduction

ETL-Project-Group-9

Team Members:

Chad Dubiel, David Martinez, Katy Fuentes

Scope of Research:

Correlation between cryptocurrency pricing and Covid case counts.

Github Repo:

https://github.com/cdubiel08/ETL-Project-Group-9

Data Sources:

Source:

Other:

  • What useful investigation could be done with the final database? Use the output and compare to markets, commodities, or US dollar.
  • Whether final database will be relational or non-relational. Why? Relational because the information will be interconnected based on a timeframe.

Considerations:

Dates not a good join method, need a unique ID for primary key

Data Analysis

  • Pandas - for data formatting, date cleaning, reduce columns
  • Mongo - better for skipping null values which would skip data column, any covid/crypto overlaps captured โ€ƒ

Steps

Data Sources:

  • At least 2 (or more) sources
  • If possible, try to incorporate a web API as one of your data sources.

ETL Process:

  • Within Jupyter, build out the ETL process to extract your data from their sources, apply some level of transformation, and load the resulting data to a database (relational or non-relational)

Flask API:

  • Build a Flask application that has a route that will execute a query to your database and return the results in JSON format.

Final Report:

  • Write up a short report that details your 3 ETL steps.
  • More details on a later slide.

Github Repo:

  • Store all of your project files in a well-organized project repository
  • Each member of your team will submit a link to your project repo to BCS by the end of class Tuesday

Write Up Process Summary:

  • What data sources you chose and why?
  • Detailing the process of the extraction, transformation, and loading steps
  • Explain why you have performed the types of transformation you did
  • Why you chose the type of final database
  • Schema of the tables/collections in the final database
  • Hypothetical use case(s) for your database

etl-project-group-9's People

Contributors

cdubiel08 avatar davidmrtnz20 avatar katytx avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.