View Code? Open in Web Editor NEW

This project forked from apache/tez

Mirror of Apache Tez

License: Apache License 2.0

Shell 0.33% Java 90.54% Roff 0.08% Python 0.22% JavaScript 7.48% HTML 0.97% CSS 0.39%

tez's Introduction

Apache Tez

apt-get install libprotobuf-dev 898 apt-get install protobuf-compiler 899 protoc 900 apt-get install libprotobuf-java 901 apt-get install python-protobuf 915 adduser 916 adduser fleandr 917 su - fleandr 918 cd ~/apache-ambari-2.6.0-src/phantomjs/ 919 ls 920 cd .. 921 cp -R phantomjs/ /home/fleandr/ 923 chown -R fleandr:fleandr /home/fleandr/phantomjs 924 su - fleandr

wget http://mirror.linux-ia64.org/apache/tez/0.9.0/apache-tez-0.9.0-src.tar.gz 2 tar -xvf apache-tez-0.9.0-src.tar.gz 3 cd apache-tez-0.9.0-src/ 4 vim pom.xml 5 9 export PATH=$PATH:/home/fleandr/phantomjs/bin

12 mvn clean package -DskipTests=true -Dmaven.javadoc.skip=true

Apache Tez is a generic data-processing pipeline engine envisioned as a low-level engine for higher abstractions such as Apache Hadoop Map-Reduce, Apache Pig, Apache Hive etc.

At its heart, tez is very simple and has just two components:

The data-processing pipeline engine where-in one can plug-in input, processing and output implementations to perform arbitrary data-processing. Every 'task' in tez has the following:

Input to consume key/value pairs from.
Processor to process them.
Output to collect the processed key/value pairs.

A master for the data-processing application, where-by one can put together arbitrary data-processing 'tasks' described above into a task-DAG to process data as desired. The generic master is implemented as a Apache Hadoop YARN ApplicationMaster.

Recommend Projects

fleandr / tez Goto Github PK

tez's Introduction

Apache Tez

tez's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent