Code Monkey home page Code Monkey logo

tko-data-engineering's Introduction

Data Engineering with Snowpark

This repository contains a hands-on lab for data engineering in Snowflake with Snowpark! Here is an overview of what we'll build in this lab:

Preview Features

Note: The following features/tools used in this lab are still in preview

Setup

You will need the following things before beginning:

  • Snowflake
    • A Snowflake Account
    • A Snowflake user created with ACCOUNTADMIN permissions. This user will be used to get things setup in Snowflake.
  • Anaconda
  • SnowSQL
    • SnowSQL installed on your computer. Go to the SnowSQL Download page and see the Installing SnowSQL page for more details.

    • Create a SnowSQL configuration for this lab by adding the following section to your ~/.snowsql/config file (replacing the account, username, and password with your values):

      [connections.dev]
      account = myaccount
      username = myusername
      password = mypassword
      rolename = HOL_ROLE
      warehousename = HOL_WH
      dbname = HOL_DB
      
  • Visual Studio Code with required extensions
    • Visual Studio Code installed on your computer. Check out the Visual Studio Code homepage for a link to the download page.
    • Python extension installed. Search for and install the "Python" extension in the Extensions pane in VS Code.
    • Snowflake extension installed. Search for and install the "Snowflake" extension in the Extensions pane in VS Code.
  • GitHub account with lab repository forked and cloned locally
    • A GitHub account. If you don't already have a GitHub account you can create one for free. Visit the Join GitHub page to get started.
    • A forked lab repository. You'll need to create a fork of this lab repository in your GitHub account. Visit the tko-data-engineering GitHub Repository and click on the "Fork" button near the top right. Complete any required fields and click "Create Fork".
    • A local clone of the forked lab repository. For connection details about your Git repository, open the Repository and copy the "HTTPS" link provided near the top of the page. If you have at least one file in your repository then click on the green "Code" icon near the top of the page and copy the "HTTPS" link. Use that link in VS Code to clone the repo to your computer. Please follow the instructions at Clone and use a GitHub repository in Visual Studio Code for more details.
  • Anaconda environment
    • Create and active a conda environment for this lab using the supplied conda_env.yml file. Run these commands from a terminal in the root of your local repository.

      conda env create -f conda_env.yml
      conda activate pysnowpark

tko-data-engineering's People

Contributors

sfc-gh-jhansen avatar sfc-gh-cbaechtold avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.