dataalchemisttech Goto Github PK
Name: Data Alchemist
Type: User
Location: Tallinn Estonia
Name: Data Alchemist
Type: User
Location: Tallinn Estonia
Projects done in the Data Engineering Nanodegree by Udacity.com
Resources and projects from Udacity Data Engineering with AWS nano degree programme
Free Data Engineering course!
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
A repo for data science related questions and answers
This is my porfolio website.
Data Engineering Project on COVID-19 DataLake by AWS
Example end to end data engineering project.
Dataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Streaming Anomaly Detection Solution by using Pub/Sub, Dataflow, BQML & Cloud DLP
Distance metrics are one of the most important parts of some machine learning algorithms, supervised and unsupervised learning, it will help us to calculate and measure similarities between numerical values expressed as data points
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
Dockerizing and Consuming an Apache Livy environment
Apache Spark docker image
This is an example project to demonstrate how one can easily scale simulation runs with docker containers
The goal of this project is to identify students at risk of dropping out the school
Databricks ETL (Extract Transform Load) Pipeline
Airflow DAGs for exporting, loading, and parsing the Ethereum blockchain data. How to get any Ethereum smart contract into BigQuery https://towardsdatascience.com/how-to-get-any-ethereum-smart-contract-into-bigquery-in-8-mins-bab5db1fdeee
[READ-ONLY] PHP - ETL (Extract Transform Load) data processing library
Simple ETL pipeline using Python
Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more
:snake: Python wrapper for Goodreads API :books:
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
A package to easily open an instance of a Google spreadsheet and interact with worksheets through Pandas DataFrames.
This is my personal collection of free Hadoop books, please feel free to share and learn.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.