Skip to content
Mirror of Apache Tez
Branch: master
Clone or download
Chyler and Jonathan Eagles TEZ-4045. Task should be accessible from TaskAttempt
Signed-off-by: Jonathan Eagles <jeagles@apache.org>
Latest commit d5675c3 Mar 22, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
build-tools
docs TEZ-4031. Support tez gitbox migration (Jonathan Eagles via kshukla) Mar 19, 2019
hadoop-shim-impls TEZ-3988. Update snapshot version in master to 0.10.1-SNAPSHOT Sep 14, 2018
hadoop-shim
tez-api
tez-build-tools
tez-common TEZ-3988. Update snapshot version in master to 0.10.1-SNAPSHOT Sep 14, 2018
tez-dag
tez-dist TEZ-4044. Zookeeper: exclude jline from Zookeeper client from tez dist Mar 12, 2019
tez-examples TEZ-3988. Update snapshot version in master to 0.10.1-SNAPSHOT Sep 14, 2018
tez-ext-service-tests
tez-mapreduce TEZ-4049. Fix findbugs issues in NotRunningJob (Jonathan Eagles via k… Feb 28, 2019
tez-plugins
tez-runtime-internals TEZ-3957: Report TASK_DURATION_MILLIS as a Counter for completed task… Dec 11, 2018
tez-runtime-library TEZ-3998. Allow CONCURRENT edge property in DAG construction and intr… Nov 20, 2018
tez-tests TEZ-4052. Fit dot files ASF License issues - part 2 (Jonathan Eagles … Mar 14, 2019
tez-tools TEZ-3988. Update snapshot version in master to 0.10.1-SNAPSHOT Sep 14, 2018
tez-ui
.gitignore
.travis.yml
BUILDING.txt
INSTALL.md
KEYS TEZ-4003. Add gopalv@apache.org to KEYS file (Gopal V via jeagles) Oct 9, 2018
LICENSE.txt TEZ-595. Add Notice, Licence, Changes files. (hitesh) Nov 11, 2013
NOTICE.txt
README.md TEZ-2170. Incorrect its in README.md. (Jakob Homan via hitesh) Mar 4, 2015
Tez_DOAP.rdf TEZ-4031. Support tez gitbox migration (Jonathan Eagles via kshukla) Mar 19, 2019
pom.xml

README.md

Apache Tez

Apache Tez is a generic data-processing pipeline engine envisioned as a low-level engine for higher abstractions such as Apache Hadoop Map-Reduce, Apache Pig, Apache Hive etc.

At its heart, tez is very simple and has just two components:

  • The data-processing pipeline engine where-in one can plug-in input, processing and output implementations to perform arbitrary data-processing. Every 'task' in tez has the following:
  • Input to consume key/value pairs from.
  • Processor to process them.
  • Output to collect the processed key/value pairs.
  • A master for the data-processing application, where-by one can put together arbitrary data-processing 'tasks' described above into a task-DAG to process data as desired. The generic master is implemented as a Apache Hadoop YARN ApplicationMaster.
You can’t perform that action at this time.