Skip to content

databricks-demos/databricks-aws-demos

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Databricks on AWS - Demos

CLOUD LAB

Description

This repository contains a collection of demos illustrating the integration of Databricks with various AWS native services, focusing on data ingestion, transformation, and serving.

Table of Contents

  1. Databricks Intro
  2. Ingestion
  3. Data Transformation/Enrichment
  4. Data Serving/Consumption
  5. Prerequisites
  6. Setup and Running Demos
  7. Contributing
  8. License
  9. Contact

Databricks Intro

  • What is Databricks
  • Databricks on AWS - networking considerations

Ingestion

  • Ingesting data from S3 with Autoloader (Directory Listing and File Notification)
  • Ingesting data from a Kinesis
    • Structured Streaming
    • Kinesis Firehose + Autoloader
  • Ingesting data from an RDS Database
    • Bulk load using AWS Database Migration Service (DMS)
    • CDC using AWS Database Migration Service (DMS)

Data Transformation/Enrichment

  • Building an ETL Pipeline using Delta Live Tables
  • Building a data pipeline using Databricks Workflows
  • Reduce TCO by using AWS Graviton

Data Serving/Consumption

  • Query your Delta Lake using Amazon Athena
  • Pushing Gold data to DynamoDB for low latency use cases
  • Real-Time ML Inference using Sagemaker Serverless Endpoints
  • Real-Time ML Inference using Databricks Model Serving V2
  • Visualization using Amazon QuickSight

Prerequisites

To run these demos, you will need:

  • An AWS account with necessary permissions to create and manage resources
  • A Databricks account
  • Basic knowledge of AWS services and Databricks

Setup and Running Demos

  1. Clone this repository to your local machine.
  2. Set up your AWS and Databricks credentials.
  3. Follow the individual READMEs in each demo's folder to set up and run the demos.

Contributing

We welcome contributions to this project. Please refer to the CONTRIBUTING.md file for more details.

License

© 2023 Databricks, Inc. All rights reserved. The source in this notebook is provided subject to the Databricks License [https://databricks.com/db-license-source].

Contact

For any questions or feedback, please open an issue on this GitHub repository.


We hope you find these demos useful as you explore the capabilities of Databricks on AWS! Happy data engineering!

Project support

Please note the code in this project is provided for your exploration only, and are not formally supported by Databricks with Service Level Agreements (SLAs). They are provided AS-IS and we do not make any guarantees of any kind. Please do not submit a support ticket relating to any issues arising from the use of these projects. The source in this project is provided subject to the Databricks License. All included or referenced third party libraries are subject to the licenses set forth below.

Any issues discovered through the use of this project should be filed as GitHub Issues on the Repo. They will be reviewed as time permits, but there are no formal SLAs for support.

About

No description, website, or topics provided.

Resources

License

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages