Python interface to Hive and Presto.
Python Shell
Latest commit a37bf7a Oct 21, 2016 @fbertsch fbertsch committed with jingw Add presto query cancellation
Closes #83



PyHive is a collection of Python DB-API and SQLAlchemy interfaces for Presto and Hive.



from pyhive import presto
cursor = presto.connect('localhost').cursor()
cursor.execute('SELECT * FROM my_awesome_data LIMIT 10')
print cursor.fetchone()
print cursor.fetchall()

DB-API (asynchronous)

from pyhive import hive
from TCLIService.ttypes import TOperationState
cursor = hive.connect('localhost').cursor()
cursor.execute('SELECT * FROM my_awesome_data LIMIT 10', async=True)

status = cursor.poll().operationState
while status in (TOperationState.INITIALIZED_STATE, TOperationState.RUNNING_STATE):
    logs = cursor.fetch_logs()
    for message in logs:
        print message

    # If needed, an asynchronous query can be cancelled at any time with:
    # cursor.cancel()

    status = cursor.poll().operationState

print cursor.fetchall()


First install this package to register it with SQLAlchemy (see

from sqlalchemy import *
from sqlalchemy.engine import create_engine
from sqlalchemy.schema import *
engine = create_engine('presto://localhost:8080/hive/default')
logs = Table('my_awesome_data', MetaData(bind=engine), autoload=True)
print select([func.count('*')], from_obj=logs).scalar()

Note: query generation functionality is not exhaustive or fully tested, but there should be no problem with raw SQL.

Passing session configuration

hive.connect('localhost', configuration={'hive.exec.reducers.max': '123'})
presto.connect('localhost', session_props={'query_max_run_time': '1234m'})
# SQLAlchemy
    connect_args={'configuration': {'hive.exec.reducers.max': '123'}},


Install using

  • pip install pyhive[hive] for the Hive interface and
  • pip install pyhive[presto] for the Presto interface.

PyHive works with

  • Python 2.7
  • For Presto: Presto install
  • For Hive: HiveServer2 daemon

There's also a third party Conda package.




  • Please fill out the Dropbox Contributor License Agreement at and note this in your pull request.
  • Changes must come with tests, with the exception of trivial things like fixing comments. See .travis.yml for the test environment setup.


Run the following in an environment with Hive/Presto:

virtualenv --no-site-packages env
source env/bin/activate
pip install -e .
pip install -r dev_requirements.txt

WARNING: This drops/creates tables named one_row, one_row_complex, and many_rows, plus a database called pyhive_test_database.