Light

niloferbano / flow-based-spark-ml Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 4.83 MB

Java 68.02% JavaScript 1.05% TypeScript 21.41% CSS 1.37% HTML 8.15%

flow-based-spark-ml's Introduction

FLow based programming for automatic ML application creation

Getting Started

Prerequsite

JDK 8 (set JAVA_HOME environment variable)
Maven
Node
angular cli (version 8.3.18)
Mongodb

Clone

To get started you can simply clone this repository using git:

git clone https://github.com/niloferbano/Flow-based-Spark-ML.git
cd Flow-based-Spark-ML

Start the backend

Change the path depending on your installation path MAVEN_PATH in 
src/main/java/de/tum/spark/ml/codegenerator/MavenBuild.java file
cd Flow-based-Spark-ML/src/main/resources/frontend/sparkML-ui
npm install
cd Flow-based-Spark-ML 
mvn clean install
mvn spring-boot:run or java -jar target/your-file.jar or you can use
choice of your IDE to run the application.

Start the frontend

Navigate to path : Flow-based-Spark-ML/src/main/resources/frontend/sparkML-ui
npm start
Browse to http://localhost:4200

Creating an ML application jar file from front end

Go to http://localhost:4200
There are four steps involved in creating a flow for application generation.
1. Start
2. FeatureExtraction/FeatureExtractionFromTextFile
3. ML model of your choice(Decision Tree/ KMeans/ Collaborative filtering)
4. Save Model
After creating a flow by dragging and dropping on the pipeline panel. Click Generate Jar button to submit the job.
You will get either successful message with path to the package/code created by the application or a failure message in case something goes wrong.

Links to the data used for the use cases

Decision Tree use case: https://archive.ics.uci.edu/ml/machine-learning-databases/covtype/
KMeans Clustering use case: https://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html
Collaborative Filtering use case: https://storage.googleapis.com/aas-data-sets/profiledata_06-May-2005.tar.gz

flow-based-spark-ml's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.