This repo shows examples of how to use Hive UDFs to call python scripts. More specifically, this shows how to push/distribute custom python packages to nodes of the cluster, where data is being processed (but might not always have the required packages).
1.) hive_udf_addfile (example showing how to call a simple python script as a Hive UDF, using "ADD FILE")
To test, run this cmd on Hive:
hive -f hive_udf_addfile.hql
2.) hive_udf_addarchive (example showing how to call a python environment (with custom packages) as a Hive UDF, using "ADD ARCHIVE")
To test, run this cmd on Hive:
hive -f hive_udf_addarchive.hql