Code Monkey home page Code Monkey logo

end-to-end-data-engineering-project-4413618's Introduction

End-to-End Data Engineering Project

This is the repository for the LinkedIn Learning course End-to-End Data Engineering Project. The full course is available from LinkedIn Learning.

End-to-End Data Engineering Project

The world of data engineering is ever-changing, with new tools and technologies emerging on a regular basis. Building an effective analytics platform can be a daunting task, especially if you’re not familiar with all the tools available. How do you turn scattered, complex data into a model that drives insights and decision-making? In this course, Thalia Barrera teaches data professionals how to implement an end-to-end data engineering project using open tools from the modern data stack. She touches on best practices such as data modeling, testing, documentation and version control and shows you how to efficiently extract, load, and transform data into a unified, analytics-ready format. Thalia shows you how to confidently select and use tools through practical examples—taking you through the construction of a robust data pipeline for a fictional ecommerce company—and how to implement best practices in data engineering.

Instructions

This repository has two branches: main holds the initial state of the project, and finished holds the final state. You can use the branch pop up menu in github to switch to a specific branch and take a look at the course at that stage, or you can add /tree/BRANCH_NAME to the URL to go to the branch you want to access.

Branches

You will be working in the main branch throughout the course. At any time, you can checkout the finished branch to consult how the finished project looks like.

Prerequisites

Ensure you have Python 3 installed. If not, you can download and install it from Python's official website.

Installing

  1. Fork the Repository:
    • Click the "Fork" button on the top right corner of this repository.
  2. Clone the repository:
    • git clone https://github.com/YOUR_USERNAME/end-to-end-data-engineering-project-4413618.git
    • Note: Replace YOUR_USERNAME with your GitHub username
  3. Navigate to the directory:
    • cd end-to-end-data-engineering-project-4413618
  4. Set Up a Virtual Environment:
    • For Mac:
      • python3 -m venv venv
      • source venv/bin/activate
    • For Windows:
      • python -m venv venv
      • .\venv\Scripts\activate
  5. Install Dependencies:
    • pip install -e ".[dev]"

Instructor

Thalia Barrera

Check out my other courses on LinkedIn Learning.

end-to-end-data-engineering-project-4413618's People

Contributors

thaliabarrera avatar smoser-lil avatar linkedin-learning-a1 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.