Skip to content

v0.1 — Initial Release

Latest

Choose a tag to compare

@bradcoles-dev bradcoles-dev released this 22 Jun 08:19
· 9 commits to main since this release

delta-doctor v0.1

Seven production-ready PySpark notebooks for Delta table maintenance on Microsoft Fabric.

Notebooks

Notebook Responsibility
doctor_prevention_session_config Spark session baseline by layer
doctor_diagnosis_table_health Health report across all tables in a Lakehouse
doctor_treatment_table_maintenance OPTIMIZE + VACUUM on a single table
doctor_treatment_maintenance_orchestrator OPTIMIZE + VACUUM across all tables in a Lakehouse
doctor_prevention_set_table_properties Delta table properties + liquid clustering on a single table
doctor_prevention_set_properties_orchestrator Table properties across all tables in a Lakehouse
doctor_treatment_rebaseline_orchestrator One-off REORG TABLE APPLY (PURGE) + OPTIMIZE to reset file sizes to the layer target

Getting Started

  1. Download all seven .ipynb files attached to this release
  2. In the Fabric Data Engineering experience, select Import notebook and upload the files
  3. Follow the Getting Started guide in the README

Docs

Full documentation at deltadoctor.dev