Thrift based Client library for Hadoop Distributed FileSystem (HDFS) <http://hadoop.apache.org/hdfs>
-
Updated
Apr 26, 2011 - Python
Thrift based Client library for Hadoop Distributed FileSystem (HDFS) <http://hadoop.apache.org/hdfs>
Python wrapper to access Hadoop HDFS REST API
A python library to submit spark job in yarn cluster at different distributions (Currently CDH, HDP)
Generate a script to setup system for practices: creating users, www user folder, mysql account, firewall access, opennebula users...
Yelp-Academic_Dataset_Analysis_in HDFS on the Fladoop cluster
An implementation of KMeans text clustering made in Spark.
MapReduce, Spark, Hadoop, PostgreSQL, Cluster Management
State of the Union dataset
Mini projet realisé au sein de la Faculté de Sciences de Kenitra pour le cours de Technologies du Big Data(Master Big Data et Cloud Computing)
Big Data Analysis
Sentiment Analysis and Data Visualization
Add a description, image, and links to the hdfs topic page so that developers can more easily learn about it.
To associate your repository with the hdfs topic, visit your repo's landing page and select "manage topics."