Skip to content

v0.8.0

Choose a tag to compare

@oritwoen oritwoen released this 31 Dec 18:57
· 34 commits to main since this release

0.8.0 is the next minor release.

👀 Highlights

Persistent storage for TB-scale key analysis! 🎉

📦 Parquet Storage Backend

Store generated keys in Apache Parquet format for efficient analysis of massive datasets:

  • StorageBackend trait - pluggable storage architecture
  • Arrow schema - columnar format for fast analytics
  • Automatic chunk rotation - configurable by record count or file size
  • Hive-style partitioning - organize by transform type and date
  • CLI integration - simple --storage ./results flag
vuke generate --storage ./results --transform milksad range --start 1 --end 1000000

✅ Upgrading

cargo install vuke

👉 Changelog

compare changes

🚀 Enhancements

  • storage: Add StorageBackend trait (#41)
  • storage: Add Arrow schema for results (#42)
  • storage: Implement ParquetBackend (#43)
  • storage: Add automatic chunk rotation (#44)
  • storage: Add Hive-style partitioning (#46)
  • storage: Add --storage CLI flag (#47)

❤️ Contributors