Code Monkey home page Code Monkey logo

tahhnik / designing-large-scale-data-warehouse-with-azure-synapse-analytics Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 43.38 MB

The repo simulates query load, optimizes performance, and offers practical guidance for building data warehouses with Azure Synapse Analytics. ๐Ÿš€

PowerShell 38.33% TSQL 61.67%
azure azurefunctions azuremachinelearning azuresynapse cloud cloud-computing data-analytics data-engineering datascience datawarehousing machinelearning streamprocessing

designing-large-scale-data-warehouse-with-azure-synapse-analytics's Introduction

Static Badge Static Badge Static Badge Static Badge Static Badge Static Badge Static Badge Static Badge Static Badge

From Design to Deployment: Data Warehousing with Azure Synapse Analytics

The repo simulates query load, optimizes performance, and offers practical guidance for building data warehouses with Azure Synapse Analytics. ๐Ÿš€

The project follows the following architecture,

Infrastructure Diagram

Directories ๐Ÿ“‚

  1. data: The datasets, fact, and dimension tables for the data warehouse
  2. Queries_Part4.sql: SQL code for querying and analyzing the created data warehouse
  3. setup.json: ARM (Azure Resource Manager) template. It is a block of code that defines the infrastructure and configuration for the project
  4. setup.ps1: PowerShell script to provision the Azure Synapse Workspace along with the tables of the data warehouse (configured in setup.sql)
  5. setup.sql: SQL script for creating the tables of the data warehouse
  6. table_creation_codes.sql: SQL script to create the fact and dimension tables with dedicated SQL pool

Prerequisites:

  1. An active Azure subscription
  2. Knowledge of Azure Data Fundamentals, Good to begin from here

Getting Started ๐Ÿš€

  1. Clone the Repository:
    git clone https://github.com/tahhnik/Designing-Large-Scale-Data-Warehouse-with-Azure-Synapse-Analytics.git
    

To Set Up on the Azure Portal (Detailed walkthrough is in the blog posts)

Clone the repository inside Azure workspace through PowerShell on Azure Portal

git clone https://github.com/tahhnik/Designing-Large-Scale-Data-Warehouse-with-Azure-Synapse-Analytics.git

synapase-ui-shell-gitclone

  1. Explore the Directories: Navigate into each directory to find detailed automation scripts, SQL codes for queries, and configurations.

  2. Follow the Blog: Implementation details and insights are documented in the associated series of blog posts in Medium.

    โšกIf you want to skip the initial processes like data modeling, and schema design and directly jump onto building the warehouse on Azure Synapse Analytics, follow the Blog Three,

    โšกIf you want to skip the building warehouse processes on Azure Synapse Analytics and directly jump onto querying the tables with SQL (T-SQL in this context), follow the Blog Four

Tools Explored ๐Ÿ› ๏ธ

  1. Azure Synapse Analytics: Enterprise analytics service for data warehouses and big data systems.
  2. Azure Portal: Unified console to manage Azure resources.
  3. Azure Stream Analytics: Real-time stream processing engine.
  4. Azure Machine Learning: Cloud service for ML project lifecycle.
  5. Azure Data Lake Storage Gen2: Scalable storage for data lakes.
  6. Power BI: Business analytics service for data visualization.
  7. Azure Function Apps: Serverless applications for event-driven scenarios.
  8. Azure Cosmos DB: Globally distributed, multi-model database.

Blog Implementation ๐Ÿ“

To implement this project, follow the step-by-step guide in our detailed blog post. Learn how each tool plays a crucial role in creating and scaling a data warehouse on Azure.

Final Draft

Components:

  • The architecture of the data warehouse
  • The details of the data pipeline
  • Brief Discussions of the tools and processes used
Final Draft (3)

Components:

  • Exploring the attributes of each logical entity in the context of retail companies

  • The details of the schema and developing the snowflake schema

  • schema-design-smol

  • Brief Discussions of the data model

Final Draft (3)

Components:

  • Provisioning the Azure Synapse Analytics Workspace with UI and ARM templates synapase-ui-shell-deploy1

  • Provisioning dedicated SQL pool within Azure Synapse Analytics Workspace

  • Creating the SQL database and tables (facts and dimensions)

  • Loading data into the tables

  • azure-shell-sqlfiles

Blog Four: From Design to Deployment: Data Warehousing with Azure Synapse Analytics (Part 3: Querying the Data Warehouse) (is being written at this moment)

Final Draft (3)

Components:

  • Querying the data warehouse
  • Showcasing the analytical capabilities of Azure Synapse Analytics

Acknowledgments ๐Ÿ™Œ

My humble gratitude to my friends and family who are the constant support of my works and endeavors

Connect with Me โฌ‡๏ธ

LinkedIn Twitter

designing-large-scale-data-warehouse-with-azure-synapse-analytics's People

Contributors

tahhnik avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.