This repository provides examples on how to use Greenplum with GPHDFS to access external data such as HDFS on Cloudera or any Hadoop distributions
- How to read simple text file and write text file on HDFS (Cloudera distribution)
- How to read parquet file on HDFS (Cloudera distribution) - TBD
The Greenplum Database (GPDB) is an advanced, fully featured, open source data warehouse. It provides powerful and rapid analytics on petabyte scale data volumes. Uniquely geared toward big data analytics, Greenplum Database is powered by the world’s most advanced cost-based query optimizer delivering high analytical query performance on large data volumes. Greenplum Pivotal Greenplum