The Mozilla Release Engineering team is producing some estimates of our Continous Integration costs.
This repository contains a work-in-progress for the scripts used to gather and process the data.
pushlog_scanner.py and its dependencies are turned into a
.zip file for use in an AWS
Lambda function. This gives us a place to run the script periodically without worrying about
other infrastructure. The lambda function's role gives it the appropriate access to an S3
bucket for the results.
Updating the Lambda Function
- Clone the repository
- Ensure Python 3.6 is installed (AWS Lambda currently only supports 2.7 and 3.6)
create_lambda_func_in_docker.shIf running on Linux,
create_lambda_func.shwill do. Some modules require compilation, and the target platform is Linux, so installing natively on Mac/Windows is not going to work.
- Copy the resulting zip file to S3:
aws s3 cp measuring_ci.zip s3://mozilla-releng-metrics/measuring_ci.zip
- Visit the Lambda function's config page
(If needed: ARN -
- Under 'Function code' choose a 'Code entry type' of 'Upload a file from Amazon S3' and paste the above s3 url into the box.
- Ensure the Handler is set correctly if not using lambda_handler() in lambda_function.py
- Under 'Basic Settings' ensure the Memory usage is at 512Mb and Timeout is at least 2 minutes.
- Click 'Save' at the top of the page
- Test the lambda function using the 'Test' button. The test event itself doesn't matter as we're not using its data. If a test event is not defined, the basic 'Hello world' template will do.
The direct querying of a parquet file in S3 is a short-term step in order to get data visibility. Longer term we will want to use the generic data ingestion method provided by the data team.