Code Monkey home page Code Monkey logo

dend_postgres_data_modeling's Introduction

Data Modeling with Postgres for Udacity Data Engineering Nanodegree

Project overview

This is data modeling with Postgres project for Udacity Data Engineering Nanodegree. In this project I create a database schema in Postgres database and ETL pipeline that would load JSON files into the database using Python and SQL in order to facilitate the analysis of this data. This JSON files represent a user activity logs collected by a music streaming app of an imaginary startup Sparkify.

Python scripts

  • create_tables.py: Drop previous schema and creates empty tables
  • sql_queries.py: Defines all queries used in the ETL pipeline
  • etl.py: Loads data from the JSON files into the tables

Database schema

  • artists: Artists in the music database
  • songs: Songs in the music database
  • users: Users of the app
  • songplays: Records of song plays in log files
  • time: Timestamps of records

Prerequisites

The code is Python in the form of scripts and in a Jupyter Notebook and it uses:

Run the Code

jupyter notebook etl.ipynb

python create_tables.py

python etl.py

dend_postgres_data_modeling's People

Contributors

rauf-mifteev avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.