Skip to content
Python Client for WebHDFS REST API
Python
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
webhdfs added get Jun 22, 2014
.gitignore
README.md
setup.py added README Dec 28, 2011

README.md

WebHDFS Python Client Implementation

WebHDFS is REST-API to HDFS. To facilitate access to WebHDFS from Python, webhdfs-py was developed. The library can easily be installed via easy_install or pip:

easy_install webhdfs

Webhdfs-py has no further dependencies and solely relies on the Python standard library. Similar to the Python os package, webhdfs-py provides basic capabilities like the creation/listing and deletion of directories on files.

Hadoop configuration

Supported Hadoop version: 2.x (including 2.4). Tested with HDP 2.1

Ensure that WebHDFS is enabled in the hdfs-site.xml:

Relevant properties:

<property>
    <name>dfs.webhdfs.enabled</name>
    <value>true</value>
 </property>

see http://hadoop.apache.org/docs/stable/hadoop-project-dist/hadoop-hdfs/WebHDFS.html

Limitations

* Kerberos security not supported
Something went wrong with that request. Please try again.