Data Lake Foundation on the AWS Cloud with Apache Zeppelin, Amazon RDS, and Other AWS Services
This Quick Start deploys a data lake foundation that integrates various AWS Cloud services and components to help you migrate data to the AWS Cloud, and store, monitor, and analyze the data.
The deployment uses Amazon Simple Storage Service (Amazon S3) as a core service to store the data, and deploys Apache Zeppelin and Kibana for analyzing and visualizing the data. It also integrates with Amazon Relational Database Service (Amazon RDS), AWS Data Pipeline, Amazon Redshift, Amazon Elasticsearch Service (Amazon ES), Amazon Kinesis Firehose, and AWS CloudTrail.
The AWS CloudFormation templates included with the Quick Start automate the following:
- Deploying the data lake foundation into a new VPC
- Deploying the data lake foundation into an existing VPC
You can also use the AWS CloudFormation templates as a starting point for your own implementation.
For architectural details, best practices, step-by-step instructions, and customization options, see the deployment guide.
To post feedback, submit feature ideas, or report bugs, use the Issues section of this GitHub repo. If you'd like to submit code for this Quick Start, please review the AWS Quick Start Contributor's Kit.