[ci.jenkins.io] Azure billing shows huge cloud cost due to outbound bandwidth #3485

dduportal · 2023-03-31T17:40:07Z

While checking our cloud billing on Azure, we were able to pin point that ci.jenkins.io costed us ~ 1,300 $ for March in.. outbound bandwidth!

The screenshot below is:

For the ci.jenkins.io controller only resource group (1 VM, 2 disks, 1 public IP and 1 NIC in this resource group)
For the span of 1st of March 2023 -> 28 March 2023
The service "Rtn Preference: MGN" is "Internet Egress (routed via Microsoft Premium Global Network)" in https://azure.microsoft.com/en-us/pricing/details/bandwidth/

It means that there are multiple Terabytes of data sent out of ci.jenkins to outside the Azure cloud (we have around 5 $ of cross-region as we use both US East and US East 2 for the infrastructure).

Worst case on price per Gb: South America destination, $0.181 per GB means ~7182 Gb
Worst case for amount of data: EU/US, means ~ 14900 Gb

=> we need to check and understand how to control this cost.

The text was updated successfully, but these errors were encountered:

dduportal · 2023-03-31T17:49:24Z

A few elements after discussing and brainstomring (not exhaustive but great start) to analyse:

We have an Apache server in front of ci.jenkins.io: its logs are important to check.
- todo: check if datadog agent collects these logs (should be the case) and use datadog to check the amount of data Apache reports for outbound network
todo: check the "outbound network bandwidth" in datadog for the VM, to see if it report the same amount
As discussed with @MarkEWaite , it could be in the controller <-> agents communication area as we launch agents in AWS and DigitalOcean for ci.jenkins.io
- The unstash pipeline step could be a great candidate for outbound badnwidth: with a mega war at (optimistic evaluation) 100Mb, with ~200 parallel PCT steps, once a day, it is already 620 Gb of outbound data to AWS/DigitalOcean

dduportal · 2023-03-31T17:52:29Z

Proposal about the stash/unstash: using https://plugins.jenkins.io/artifact-manager-s3/ could help:

For the EKS and EC2 VM agents, that would be really suitable
Digital Ocean provides an s3-compliant storage (https://www.digitalocean.com/products/spaces), let's check if it works with this plugin
For Azure: https://plugins.jenkins.io/azure-artifact-manager/

dduportal · 2023-04-24T10:32:15Z

[ci.jenkins.io] Use Artifact Manager to store archived artifacts (and stashes) #3496 is done and we can already see an effect of it + @jglick help in the bom (that led to [ci.jenkins.io] Use Artifact Manager to store archived artifacts (and stashes) #3496 using AWS S3):

- Next step: creating a dashboard in datadog (enabled by #https://github.com//issues/3514) to measure the outbound bandwidth - Check Apache optimizations (gzip? websockets for agents? etc.)

dduportal · 2023-05-15T17:23:21Z

It seems the unusual consumption is manageable (thanks to the huge work by maintainers in bom along with the S3 Artifact management and persists to be "normal" again:

dduportal added this to the infra-team-sync-next milestone Mar 31, 2023

dduportal added azure ci.jenkins.io billing-report labels Mar 31, 2023

dduportal modified the milestones: infra-team-sync-next, infra-team-sync-2023-04-11 Apr 4, 2023

This was referenced Apr 6, 2023

ci.jenkins.io disk almost full #3492

Closed

[ci.jenkins.io] Use Artifact Manager to store archived artifacts (and stashes) #3496

Closed

Spring 2023: Decrease AWS costs #3502

Closed

dduportal modified the milestones: infra-team-sync-2023-04-11, infra-team-sync-2023-04-18, infra-team-sync-next Apr 11, 2023

jglick mentioned this issue Apr 11, 2023

Avoid large stashes jenkinsci/bom#1955

Merged

This was referenced Apr 13, 2023

chore: ensure that spot instances are used for packer builds jenkins-infra/packer-images#598

Merged

[Puppet] Datadog agent does not collect Apache logs #3514

Closed

[ci.jenkins.io] separate container agent resources between bom and other builds #3521

Closed

dduportal closed this as completed May 15, 2023

dduportal modified the milestones: infra-team-sync-next, infra-team-sync-2023-05-16 May 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ci.jenkins.io] Azure billing shows huge cloud cost due to outbound bandwidth #3485

[ci.jenkins.io] Azure billing shows huge cloud cost due to outbound bandwidth #3485

dduportal commented Mar 31, 2023 •

edited

dduportal commented Mar 31, 2023

dduportal commented Mar 31, 2023

dduportal commented Apr 24, 2023

dduportal commented May 15, 2023

[ci.jenkins.io] Azure billing shows huge cloud cost due to outbound bandwidth #3485

[ci.jenkins.io] Azure billing shows huge cloud cost due to outbound bandwidth #3485

Comments

dduportal commented Mar 31, 2023 • edited

dduportal commented Mar 31, 2023

dduportal commented Mar 31, 2023

dduportal commented Apr 24, 2023

dduportal commented May 15, 2023

dduportal commented Mar 31, 2023 •

edited