mubaraka Goto Github PK
Name: Mubaraka [Muby] Arif
Type: User
Name: Mubaraka [Muby] Arif
Type: User
Metadata driven hive table generation and joins implemented in python
Runs a MySQL query to parse metadata and column data.
A manually programmed Logistic Regression implementation of a Chronic Kidney Disease predictor.
It consists of 60,000 32x32 color images containing one of 10 object classes, with 6000 images per class.
Comet Data Pipeline is a Spark Ingestion Framework fo Batch (Hadoop) & Streaming (Coming) Systems
A platform for analysis & development of machine learning models using large de-identified healthcare datasets.
Corgi 柯基:MySQL & Oracle 到 Hive 批量数据集成服务
a set of scripts to pull meta data and data profiling metrics from relational database systems
Spark数据加载 To (HDFS/Hive)
This scala pet-project attempts to create a data replication application that will help replicate base and incremental data from source to target data stores
This is a repository of my data science projects showcasing my abilities with cloud computing deployment, machine learning, and advanced interactive data visualization techniques to produce value in real-world problems.
Generate Scala case class based on database table metadata
Deep Learning with Apache Spark and Deep Cognition
A simple program to put files from a directory into HDFS with the added functionality and defining how that action will happen
Diagrams describing Apache Hadoop internals (2.3.0 or later).
Python tools for healthcare machine learning
Predicting Heart Failure using Ensemble Learning with Spark
Transfer the hive metadata from one cluster to another
Find Hive Tables by Table or Column Names
Sync hive database/table information from its metastore to a mysql database
These projects all focus on the various algorithms under the umbrella of machine learning. Each project focuses on a specific algorithm and involves a hypothetical scenario where I need to implement the algorithm to arrive at a solution to a "real world" problem. Since the focus is on the actual algorithm and analysis of its performance the datasets are fairly clean and simple. In reality a huge chunk of the time spent working with datasets will be getting it into a form so these algorithms can perform appropriately.
Churn Prediction with PySpark using MLlib and ML Packages
Real-World Machine Learning Projects with Scikit-Learn [Video], Published by Packt
Project to load data from SFTP source into HDFS and Spark
This repository contains Spark, MLlib, PySpark and Dataframes projects
Spark application for ingest data into data lake
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.