aws-cost-explorer-data-collector

A serverless stack capable to collect data from AWS Cost Explorer API and publish as parquet into Amazon S3, serving it as a AWS Glue table for further queries

Stack composition

IAM service role (Lambda)
AWS Lambda function
Amazon EventBridge rule
AWS Glue database

Journey (process workflow)

Requirements

AWS Data Wrangler and AWS Lambda Powertools deployed as Lambda layers (recommended to deploy the packages by using the commons-lambda-layers repository)
AWS CLI installed
IAM user configured in the AWS CLI with permissions to:
- upload files into Amazon S3 bucket (store the zipped packages)
- create AWS Lambda functions
- create AWS Glue databases
- create Amazon EventBridge rules
- create IAM roles

How the solution operates

Every day the EventBridge rule triggers at 9 am and invokes the Lambda function responsible for the data collection

The data collection leverages on AWS Python's SDK (boto3) and the data transformation/saving into S3 + Glue table management leverages on AWS Data Wrangler

The Lambda function is expecting the name of the tag desired to be monitored in the Cost Explorer Data Collector solution. Here's an example of a JSON that should be passed to invoke the function:

{
    "monitoredTag": "SolutionName"
}

If you want to collect data from a specific period of time, you may additionally include the atributes 'startDate' and 'endDate':

{
    "startDate": "2021-06-01",
    "endDate": "2021-06-15",
    "monitoredTag": "SolutionName"
}

In the case that 'startDate' and 'endDate' are not provided, the function will collect the data based on the day before of the current day (D-1 until D0).

How to deploy the stack

There are 2 options to be followed in order to deploy the stack:

Editing the shell scripts (recommended for new users)
Passing the parameters as argments in the shell invocation

Step 1

Edit the shell script cloudformation_package.sh including the bucket name where the zipped packages will be sent and the AWS CLI profile that will be used OR pass these parameters as argments
Run the shell script:
```
$ sh commons/cloudformation_package.sh
```

Step 2

Edit the shell script cloudformation_deploy.sh including the bucket name where the packaged template will be sent, the AWS CLI profile that will be used, the CloudFormation stack name, and the chosen region OR pass these parameters as argments
Run the shell script:
```
$ sh commons/cloudformation_deploy.sh
```

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
cloudformation		cloudformation
lambda		lambda
utils		utils
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

aws-cost-explorer-data-collector

Stack composition

Journey (process workflow)

Requirements

How the solution operates

How to deploy the stack

Step 1

Step 2

About

Releases

Packages

Languages

lmassaoy/aws-cost-explorer-data-collector

Folders and files

Latest commit

History

Repository files navigation

aws-cost-explorer-data-collector

Stack composition

Journey (process workflow)

Requirements

How the solution operates

How to deploy the stack

Step 1

Step 2

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages