Skip to content
Branch: master
Find file History
jasonxebia Update
Fixing broken links
Latest commit 72d4e9e Sep 20, 2019

Data Lake Solution on Amazon EC2


AWS offers a sample Data Lake Solution that shows how you can store both structured and unstructured data in a centralized repository on Amazon Elastic Compute Cloud (EC2), which provides resizable compute capacity in the cloud.

Use this blueprint to deploy the sample Data Lake Solution on EC2 using CloudFormation, which defines the infrastructure that will run on EC2. The release template that the blueprint generates will provision an EC2 instance, deploy the Data Lake Solution to it, and optionally tear the instance down.

Before you get started

If you're new to XebiaLabs blueprints, check out:


To use this blueprint, run xl blueprint and select:


Tools and technologies

This blueprint includes the following tools and technologies:

Minimum required versions

This blueprint version requires at least the following versions of the specified tools to work properly:

XL Release: Version 9.0.0 XL Deploy: Version 9.0.0 XL CLI: Version 9.0.0


To run the YAML that this blueprint generates, you need:

  • XebiaLabs Release Orchestration and Deployment Automation up and running
  • Access to an AWS account to deploy the application to
  • Email address where Data Lake administrator credentials can be sent

Information required

This blueprint requires:

  • AWS credentials
  • An AWS region
  • An email address for Data Lake administrator credentials


This blueprint will output:

  • Sample Data Lake Solution
  • Release template
  • AWS CloudFormation templates
  • A docker-compose setup for XL Release & XL Deploy

Tips and tricks

  • The YAML that the blueprint generates includes optional steps to remove the application and deprovision the infrastructure.


  • Cloud
  • AWS
You can’t perform that action at this time.