Skip to content

Converts binary parquet from a pipe into JSON outputted to STDOUT

License

Notifications You must be signed in to change notification settings

rushton/parquet-dump

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

47 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Build Status

Parquet Dump

Converts binary parquet data from a pipe into JSON outputted to STDOUT

#> cat /tmp/foobar/*.parquet | java -jar target/scala-2.11/Parquet-Dump-assembly-1.0.0.jar
{"a":1,"b":null}
{"a":1,"b":null}
{"a":1,"b":null}
{"a":1,"b":null}
{"a":1,"b":null}
...

counting records per block

cat /tmp/foobar/*.parquet | java -jar target/scala-2.11/Parquet-Dump-assembly-1.0.0.jar --counts
5
10
15

Install

Download the latest jar from the releases page

Build

sbt assembly

Run

cat /tmp/myparquet/*.parquet | java -jar Parquet-Dump-assembly-1.0.0.jar

About

Converts binary parquet from a pipe into JSON outputted to STDOUT

Resources

License

Stars

Watchers

Forks

Packages

No packages published