Himanshu Agrawal's Projects
This repository provides a command line interface (CLI) utility that replicates an Amazon Managed Workflows for Apache Airflow (MWAA) environment locally.
Repository for Programming Assignment 2 for R Programming on Coursera
Tools for creating Dataproc custom images
Data Lake project on extracting and loading data in Amazon S3 and processing and transforming data using Apache Spark
This repository consist of a Database project created based in Star Schema approach from the user activity data available in JSON format.
This project aims to create a high-grade data pipelines with airflow, This pipeline is built from dynamic and reusable tasks, Implementing the data quality checks to catch the data discrepancies and can be monitored and allow easy backfills.
Data warehousing project on Amazon Redshift
Repo with code for the DBT Fundamentals
k-means clustering using Spark and HDFS
This repository is for uploading leetcode database solutions.
Models built with TensorFlow
Salesforce API Postman Collection
SMART on FHIR developer tutorial
Apache Spark - A unified analytics engine for large-scale data processing