# 🚀 Databricks Getting Started - Essential Demos Installation

Welcome to Databricks! This notebook will help you get started by installing and exploring the most important demos that showcase key platform capabilities.

## What You'll Install

This notebook installs 5 essential **dbdemos** that every new Databricks user should explore:

* **Delta Lake** - Learn about the foundation of the Databricks Lakehouse with ACID transactions, time travel, and data versioning
* **dbt on Databricks** - Discover how to orchestrate and run your dbt jobs seamlessly on Databricks
* **Auto Loader** - Master incremental data ingestion from cloud storage with automatic schema evolution
* **AI/BI Portfolio Assistant** - Explore advanced analytics and AI capabilities with dashboards and Genie
* **SQL Warehouse** - Understand data warehousing features including identity columns, primary/foreign keys, and stored procedures

## Prerequisites

* Databricks workspace access
* Cluster with appropriate permissions
* Internet connectivity for package installation

## How to Use This Notebook

1. Run each cell sequentially
2. Each demo installation may take 2-5 minutes
3. After installation, explore the generated folders in your workspace
4. Follow the README files in each demo folder for detailed walkthroughs

---

**⚠️ Note**: Demo installations will create new folders, tables, and resources in your workspace. Make sure you have sufficient permissions and storage quota.

In [0]:
# Install the dbdemos package
# This package provides pre-built demos showcasing Databricks capabilities
%pip install dbdemos --quiet

# Restart Python to ensure the package is properly loaded
dbutils.library.restartPython()

In [0]:
import dbdemos

# Display dbdemos version and available demos
print(f"dbdemos version: {dbdemos.__version__}")
print("\n📋 Installing 5 essential demos for new Databricks users...")
print("Each installation may take 2-5 minutes depending on demo complexity.")

## 🏗️ Demo 1: Delta Lake - The Foundation of Databricks Lakehouse

**Delta Lake** is the storage layer that brings ACID transactions to Apache Spark and big data workloads.

### What you'll learn:
* ACID transactions for data reliability
* Time travel and data versioning
* Schema enforcement and evolution
* Optimizations like Z-ordering and auto-compaction
* Streaming and batch data processing

### Key Features Demonstrated:
* Creating Delta tables
* Handling schema changes
* Time travel queries
* Merge operations (UPSERT)
* Performance optimizations

In [0]:
# Install Delta Lake demo
print("🔄 Installing Delta Lake demo...")
dbdemos.install('delta-lake')
print("✅ Delta Lake demo installed successfully!")
print("📁 Check the 'delta-lake' folder in your workspace for notebooks and datasets.")

## 🔧 Demo 2: dbt on Databricks - Modern Data Transformation

**dbt (data build tool)** enables analytics engineers to transform data using SQL and software engineering best practices.

### What you'll learn:
* Setting up dbt projects on Databricks
* Creating dbt models and transformations
* Testing and documentation
* Orchestrating dbt jobs
* Integration with Databricks workflows

### Key Features Demonstrated:
* dbt model development
* Data quality testing
* Incremental models
* Macros and packages
* Job orchestration and scheduling

In [0]:
# Install dbt on Databricks demo
print("🔄 Installing dbt on Databricks demo...")
dbdemos.install('dbt-on-databricks')
print("✅ dbt on Databricks demo installed successfully!")
print("📁 Check the 'dbt-on-databricks' folder in your workspace for dbt projects and workflows.")

## 📥 Demo 3: Auto Loader - Incremental Data Ingestion

**Auto Loader** incrementally and efficiently processes new data files as they arrive in cloud storage.

### What you'll learn:
* Setting up Auto Loader for various file formats
* Automatic schema inference and evolution
* Handling bad records and data quality
* Monitoring and alerting
* Integration with Delta Live Tables

### Key Features Demonstrated:
* Cloud file ingestion (S3, ADLS, GCS)
* Schema evolution handling
* Checkpointing and exactly-once processing
* Error handling and dead letter queues
* Performance optimization techniques

In [0]:
# Install Auto Loader demo
print("🔄 Installing Auto Loader demo...")
dbdemos.install('auto-loader')
print("✅ Auto Loader demo installed successfully!")
print("📁 Check the 'auto-loader' folder in your workspace for ingestion patterns and examples.")

## 🤖 Demo 4: AI/BI Portfolio Assistant - Advanced Analytics & AI

**Databricks AI/BI** combines the power of AI with business intelligence for advanced analytics in capital markets.

### What you'll learn:
* Building AI-powered dashboards
* Using Genie for natural language queries
* Financial data analysis and modeling
* Real-time portfolio monitoring
* Advanced visualization techniques

### Key Features Demonstrated:
* AI-assisted data exploration
* Natural language to SQL with Genie
* Interactive dashboards
* Financial risk modeling
* Automated insights and alerts

**Note**: This demo uses a custom catalog and schema for financial services data.

In [0]:
# Install AI/BI Portfolio Assistant demo with custom catalog and schema
print("🔄 Installing AI/BI Portfolio Assistant demo...")
dbdemos.install('aibi-portfolio-assistant', catalog='main', schema='dbdemos_aibi_fsi_portfolio_assistant')
print("✅ AI/BI Portfolio Assistant demo installed successfully!")
print("📁 Check the 'aibi-portfolio-assistant' folder for dashboards and AI-powered analytics.")
print("🗄️ Data stored in: main.dbdemos_aibi_fsi_portfolio_assistant")

## 🏢 Demo 5: SQL Warehouse - Enterprise Data Warehousing

**SQL Warehouse** demonstrates advanced data warehousing capabilities including modern SQL features and enterprise-grade functionality.

### What you'll learn:
* Identity columns and auto-incrementing keys
* Primary and foreign key constraints
* Stored procedures and functions
* Control flow with loops and conditionals
* Advanced SQL patterns and optimizations

### Key Features Demonstrated:
* Table constraints and relationships
* Stored procedure development
* Transaction management
* Performance tuning
* Data governance and security

In [0]:
# Install SQL Warehouse demo
print("🔄 Installing SQL Warehouse demo...")
dbdemos.install('sql-warehouse')
print("✅ SQL Warehouse demo installed successfully!")
print("📁 Check the 'sql-warehouse' folder for advanced SQL examples and stored procedures.")

# 🎉 Installation Complete!

Congratulations! You've successfully installed 5 essential Databricks demos. Here's what to do next:

## 📂 Explore Your New Demo Folders

Check your workspace for these new folders:
* `delta-lake/` - Delta Lake fundamentals and advanced features
* `dbt-on-databricks/` - dbt transformation workflows
* `auto-loader/` - Data ingestion patterns
* `aibi-portfolio-assistant/` - AI/BI analytics and dashboards
* `sql-warehouse/` - Advanced SQL warehousing features

## 🚀 Recommended Learning Path

1. **Start with Delta Lake** - Understanding the storage foundation
2. **Explore Auto Loader** - Learn data ingestion patterns
3. **Try dbt on Databricks** - Modern data transformation
4. **Experiment with SQL Warehouse** - Advanced SQL features
5. **Dive into AI/BI** - Advanced analytics and AI capabilities

## 📚 Additional Resources

* [Databricks Documentation](https://docs.databricks.com/)
* [Databricks Academy](https://academy.databricks.com/)
* [Community Forums](https://community.databricks.com/)
* [GitHub Examples](https://github.com/databricks)

## 💡 Tips for Success

* Each demo folder contains a README with detailed instructions
* Start with the `00-` numbered notebooks in each folder
* Don't hesitate to modify and experiment with the code
* Join the Databricks community for support and best practices

---

**Happy Learning! 🎓**

# 🚀 Databricks Getting Started - Essential Demos Installation

Welcome to Databricks! This notebook will help you get started by installing and exploring the most important demos that showcase key platform capabilities.

## What You'll Install

This notebook installs 5 essential **dbdemos** that every new Databricks user should explore:

* **Delta Lake** - Learn about the foundation of the Databricks Lakehouse with ACID transactions, time travel, and data versioning
* **dbt on Databricks** - Discover how to orchestrate and run your dbt jobs seamlessly on Databricks
* **Auto Loader** - Master incremental data ingestion from cloud storage with automatic schema evolution
* **AI/BI Portfolio Assistant** - Explore advanced analytics and AI capabilities with dashboards and Genie
* **SQL Warehouse** - Understand data warehousing features including identity columns, primary/foreign keys, and stored procedures

## Prerequisites

* Databricks workspace access
* Cluster with appropriate permissions
* Internet connectivity for package installation

## How to Use This Notebook

1. Run each cell sequentially
2. Each demo installation may take 2-5 minutes
3. After installation, explore the generated folders in your workspace
4. Follow the README files in each demo folder for detailed walkthroughs

---

**⚠️ Note**: Demo installations will create new folders, tables, and resources in your workspace. Make sure you have sufficient permissions and storage quota.