Skip to content

zaratsian/hive_udf

Repository files navigation

Apache Hive - UDF example using python scripts

This repo shows examples of how to use Hive UDFs to call python scripts. More specifically, this shows how to push/distribute custom python packages to nodes of the cluster, where data is being processed (but might not always have the required packages).

This repo contains two examples:


1.)  hive_udf_addfile (example showing how to call a simple python script as a Hive UDF, using "ADD FILE")

To test, run this cmd on Hive: hive -f hive_udf_addfile.hql


2.)  hive_udf_addarchive (example showing how to call a python environment (with custom packages) as a Hive UDF, using "ADD ARCHIVE")

To test, run this cmd on Hive: hive -f hive_udf_addarchive.hql

About

Apache Hive - UDF Example with Python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages