Skip to content

kongyew/greenplum-gphdfs-examples

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Readme

This repository provides examples on how to use Greenplum with GPHDFS to access external data such as HDFS on Cloudera or any Hadoop distributions

Build Status

Use Case

  1. How to read simple text file and write text file on HDFS (Cloudera distribution)
  2. How to read parquet file on HDFS (Cloudera distribution) - TBD

Reference:

Greenplum

The Greenplum Database (GPDB) is an advanced, fully featured, open source data warehouse. It provides powerful and rapid analytics on petabyte scale data volumes. Uniquely geared toward big data analytics, Greenplum Database is powered by the world’s most advanced cost-based query optimizer delivering high analytical query performance on large data volumes. Greenplum Pivotal Greenplum