Code Monkey home page Code Monkey logo

udacity-data-engineering-nanodgree's Introduction

Udatcity - Data Engineering Nanodgree Program

Learn to design data models, build data warehouses and data lakes, automate data pipelines, and work with massive datasets.

  • Create user-friendly relational and NoSQL data models
  • Create scalable and efficient data warehouses
  • Work efficiently with massive datasets
  • Build and interact with a cloud-based data lake
  • Automate and monitor data pipelines
  • Develop proficiency in Spark, Airflow, and AWS tools

Course 1 - Data Modeling

Learn to create relational and NoSQL data models to fit the diverse needs of data consumers. Use ETL to build databases in PostgreSQL and Apache Cassandra.

Contents

  • Introduction to Data Modeling
  • Relational Data Models
  • NoSQL Data Models

Projects

  • Data Modeling with Postgres
  • Data Modeling with Apache Cassandra

Course 2 - Cloud Data Warehouses

Learn to create cloud-based data warehouses. Sharpen your data warehousing skills, deepen your understanding of data infrastructure, and be introduced to data engineering on the cloud using Amazon Web Services (AWS).

Contents

  • Introduction to the Data Warehouses
  • Introduction to the Cloud with AWS
  • Implementing Data Warehouses on AWS

Project

  • Build a Cloud Data Warehouse

Course 3 - Data Lake with Spark

Learn more about the big data ecosystem and how to use Spark to work with massive datasets. Learn about how to store big data in a data lake and query it with Spark.

Contents

  • The Power of Spark
  • Data Wrangling with Spark
  • Debugging and Optimization
  • Introduction to Data Lake

Project

  • Build a Data Lake

Course 4 - Data Pipelines with Airflow

Learn to schedule, automate, and monitor data pipelines using Apache Airflow. Learn to run data quality checks, track data lineage, and work with data pipelines in production.

Contents

  • Data Pipelines
  • Data Quality
  • Production Data Pipelines

Project

  • Data Pipelines with Airflow

Final Project - DEND Capstone Project

Combine all the skills throughout the program to build your own data engineering portfolio project.

Project

  • Data Engineer Capstone

udacity-data-engineering-nanodgree's People

Contributors

kenthsu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.