Topic: emr-cluster Goto Github
Some thing interesting about emr-cluster
Some thing interesting about emr-cluster
emr-cluster,Amazon EMR for Data Science
User: achilleaskn
emr-cluster,This project demonstrates the use of Amazon Elastic Map Reduce (EMR) for processing large datasets using Apache Spark. It includes a Spark script for ETL (Extract, Transform, Load) operations, AWS command line instructions for setting up and managing the EMR cluster, and a dataset for testing and demonstration purposes.
User: airscholar
Home Page: https://youtu.be/ZFns7fvBCH4
emr-cluster,Alibaba Cloud EMR Create Example for Python
User: alikemalocalan
emr-cluster,Capstone Project for Udacity's Data Engineering Nanodegree : End-to-end data pipeline to analyze covid-19 effect on airbnb
User: amine-akrout
emr-cluster,Data Engineering Expert Nanodegree - Data Lake on AWS using Spark and S3
User: amrelauoty
emr-cluster,Cloud-AccountReceivableReportSystem
User: anjijava16
emr-cluster,Create a data pipeline on AWS to execute batch processing in a Spark cluster provisioned by Amazon EMR. ETL using managed airflow: extracts data from S3, transform data using spark, load transformed data back to S3.
User: anthonywong611
emr-cluster,Database Schema & ETL pipeline for Song Play Analysis | Bosch AI Talent Accelerator Scholarship Program
User: arfatmateen
emr-cluster,Reference Architectures for Datalakes on AWS
Organization: aws-samples
emr-cluster,
User: bdoepf
emr-cluster,Built a distributed system which completes several objectives with given data to generate loan reports using Amazon Web Services, Apache Spark, Java and Python.
User: berksudan
emr-cluster,This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/data/blob/main/AdventureWorks.zip, it's a zipped file with some .csvs inside that we will apply transformations.
User: camposvinicius
emr-cluster,Terraform module to provision an Elastic MapReduce (EMR) cluster on AWS
Organization: cloudposse
Home Page: https://cloudposse.com/accelerate
emr-cluster,Bits of code I use during live demos
User: dacort
emr-cluster,Apache Spark TPC-DS benchmark setup with EMR launch setup
Organization: dhiraa
emr-cluster,Performed business operations using Big data technologies: AWS EMR, AWS RDS (MySQL), Hadoop, Apache Scoop, Apache HBase, MapReduce
User: eddieamaitum
emr-cluster,Etl data pipeline using aws services
User: fermat01
emr-cluster,Generic python library that enables to provision emr clusters with yaml config files (Configuration as Code)
User: harshadranganathan
emr-cluster,Classwork projects and home works done through Udacity data engineering nano degree
User: immu0001
emr-cluster,This repository contains a definition of standar structure for Machine Learning and Data Pipelines Projects
User: johnnylvp
Home Page: https://johnnylvp.github.io/Project-Standar-Documentation/
emr-cluster,Guide: Executing a python script on AWS EMR for big data analysis.
User: jpb111
emr-cluster,Full code for UDACITY's Data Engineer Nano Degree project. Implementing a Data Lake in Amazon's cloud with AWS S3, AWS EMR and Spark.
User: jpsalado92
emr-cluster,This project is to prove the efficiency of distributed computing and distributed database. The machine learning multiple classification algorithms in spark were used to predict the Air Quality Index in California.
User: liyinging
emr-cluster,Example for provisioning AWS EMR service with Terraform
User: m1theus
emr-cluster,A Cassandra Architecture for GDELT Database 🌍
User: maelfabien
emr-cluster,Used a public clickstream dataset of a cosmetics store to extract data and gather insights. Launched an EMR 5.29.0 cluster that utilizes Hive services and used optimized hive queries to improve their sales by identifying customer behavior.
User: manaswikamila05
emr-cluster,Orchestrating Cloud ETL Workloads
User: mikeacosta
emr-cluster,An end-to-end data pipeline for building Data Lake and supporting report using Apache Spark.
User: minhky2185
emr-cluster,Coalesced and transformed various data sources to create a comprehensive data lake for the USA tourism sector.
User: morgan-sell
emr-cluster,Repo for playing around an AWS Elastic Map Reduce (EMR) cluster
User: mwilchek
emr-cluster,Spark, Python, AWS EMR, MLLib, Spark Streaming, Spark - SQL
User: nahidalam
emr-cluster,Player Unknown's Battlegrounds (PUBG), is a first person shooter game where the goal is to be the last player standing. You are placed on a giant circular map that shrinks as the game goes on, and you must find weapons, armor, and other supplies in order to kill other players / teams and survive.
User: nileshsingal
Home Page: https://github.com/nileshsingal/PUBG-DATA-ANALYSIS
emr-cluster,AWS Lambda function to send EMR events to Slack via SNS
User: rkr2017
emr-cluster,BERT, AWS RDS, AWS Forecast, EMR Spark Cluster, Hive, Serverless, Google Assistant + Raspberry Pi, Infrared, Google Cloud Platform Natural Language, Anomaly detection, Tensorflow, Mathematics
User: rubenszimbres
emr-cluster,Spark Job on Amazon EMR cluster
User: rupeshtr78
emr-cluster,An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
User: san089
emr-cluster,Projects related to Udacity Data Engineering Nanodegree including Data Modeling, Infrastructure setup on AWS cloud, Data Warehousing and Data Lake development on Amazon EMR and Redshift, developing Data Pipelines using Apache Airflow.
User: saurabhsoni5893
emr-cluster,Built a recommender system using Apache Mahout machine learning library carried out data analysis using Hadoop, Apache Hive & Pig on Amazon Customer Reviews Data set(130M+ reviews))
User: sayaliwalke30
emr-cluster,Event driven EMR via Serverless
User: sepulworld
emr-cluster,Used Amazon AWS and PySpark to solve this EDA assignment
User: shantamgarg24
emr-cluster,Uses EMR clusters to export dynamoDB tables to S3 and generates import steps
Organization: signiant
emr-cluster,A large-scale data framework that will enable us to store and analyze financial market data and drive future predictions for investment.
User: sjmiller8182
Home Page: https://sjmiller8182.github.io/Warehousing-Stock-Tweet-Data/
emr-cluster,Load data from S3, process the data into analytics tables using Spark and load them back into S3. Deployed this Spark process on a cluster using AWS EMR
User: tanay0510
emr-cluster,Amazon EMR Automatic Scaling using Custom Metrics
User: tmusabbir
emr-cluster,Hosting data lake with bid-ask data in S3 using Spark and Airflow
User: ucaiado
emr-cluster,This BigData study intends to identify the most revenue-generating Taxi zones in New York City for the year 2019. Three MapReduce algorithms were developed and their performance was analyzed on different size of input datasets and different size clusters in EMR.
User: udeshikadissa
emr-cluster,The goal of this project is to offer an AWS EMR template using Spot Fleet and On-Demand Instances that you can use quickly. Just focus on writing pyspark code.
User: wittline
Home Page: https://wittline.github.io/pyspark-on-aws-emr/
emr-cluster,A boilerplate for spark projects with docker support for local development and scripts for emr support.
User: xianwill
emr-cluster,Collection of code for submitting Spark/Hadoop/Hive/Pig tasks to EMR (AWS Elastic MapReduce) | #DE
User: yennanliu
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.