Overview

Spade format is an experimental Column-oriented data format (somewhat similar to Column-oriented DBMS concept), aimed at adequate encoding and decoding speeds to allow use for "in-flight" data, and not just for "at rest" data like other existing formats such as ORC and Parquet.

Aside from focus on more CPU-efficient encoding/decoding, differences include:

Standard interfaces to expose data to processing as row-oriented
- Processing systems need not be aware of underlying column format (can operate on streams of records/rows/Objects)
- Processing systems can still do efficient projection at column level
Both textual and binary encodings for trouble-shooting, interoperability
- Since super-structure is the same, efficient transcoding possible

Differences to ORC, Parquet

Some of the differences were already listed; other differences stemming from different goals, approach include:

Smaller batch size: instead of targeting storage in tens-of-megs range (per encoded block), goal is to keep chunks in kilobyte range (less than a meg)
- limits amount of memory needed per reader/writer process
- lower latency between start of a chunk, production of a chunk
With textual format, possible to operate (both decode AND encode) from platforms that do not handle binary well, such as Javascript.

More

For more information (such as, say, format description...) check out Wiki.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
samples		samples
src		src
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml
run-comparison.sh		run-comparison.sh
run-speed.sh		run-speed.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

Differences to ORC, Parquet

More

About

Uh oh!

Releases

Packages

Languages

cowtowncoder/jackson-dataformat-spade

Folders and files

Latest commit

History

Repository files navigation

Overview

Differences to ORC, Parquet

More

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages