GitHub - djhworld/bigdan-table: daniel's bigtable implementation

I enjoyed reading the BigTable paper so much I decided to go ahead and attempt to implement some of the ideas in it.

Concepts worked on:

Tablet
- ❌ Scan/filter entire tablet
- ❌ Scan/filter entire row
- ✅ Timestamped values
- ✅ Read/Write to memtable
- 💡 Commit log
  - Need to figure out how this is stored and how to checkpoint it....
- ✅ Read/Flush to SSTable
  - ✅ Amazon S3 supported
  - ✅ Local filesystem
- ✅ Tablet compaction
- ❌ Tablet split
SSTable
- ✅ Blocks compressed with GZIP
- ✅ Footer compressed with GZIP
- ✅ Configurable block size
- ✅ Configurable compression (GZIP, SNAPPY, Uncompressed supported)
- ✅ Storage agnostic
TabletServer
- ✅ Each tablet responsible for a row range
- ❌ column family locality
- in progress

SSTable

Blocks are of fixed length and block size is stored in the header. All blocks are compressed using a defined compression algorithm. Footer is compressed using a defined compression algorithm.

Reader will

Read header
Read footer

Blocks are read when a value is requested, and cached if appropriate.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
src		src
.gitignore		.gitignore
README.md		README.md
pom.xml		pom.xml
sstable.png		sstable.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

.gitignore

.gitignore

README.md

README.md

pom.xml

pom.xml

sstable.png

sstable.png

Repository files navigation

SSTable

About

Releases

Packages

Languages

djhworld/bigdan-table

Folders and files

Latest commit

History

Repository files navigation

SSTable

About

Resources

Stars

Watchers

Forks

Languages