Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Packaging][Release] Reduce disk requirements for linux packaging jobs #35964

Open
raulcd opened this issue Jun 7, 2023 · 2 comments
Open

[Packaging][Release] Reduce disk requirements for linux packaging jobs #35964

raulcd opened this issue Jun 7, 2023 · 2 comments

Comments

@raulcd
Copy link
Member

raulcd commented Jun 7, 2023

Describe the enhancement requested

Currently we have to perform some disk cleanup on the default GH runners system in order to build Linux packaging jobs, otherwise we get out of space.
Example of occurrences in the past:

On the last issue we temporary removed several things from /usr/local/bin, /usr/local/share and /opt/hostedtoolcache.

There are some ideas for improvements proposed on this comment #35947 (comment)
From the comment

Short term workarounds:

  1. Use self-hosted runner like arm64
  2. Disable some large components such as S3 and GCP

Long term solution:

  1. Build deb packages for large dependencies such as aws-sdk-cpp and google-cloud-cpp separately and use them to build our deb packages

FYI: https://github.com/datafusion-contrib/datafusion-c/ provides deb packages at https://apache.jfrog.io/ui/native/arrow/ubuntu/pool/jammy/main/d/datafusion-c/ . Apache Arrow C++ dependencies can use the same approach.

One more idea: #35961

Component(s)

Packaging, Release

@lriggs
Copy link
Contributor

lriggs commented Oct 3, 2023

Are there any updates on work towards a long term solution? I'm running into this problem trying to build arrow jars using the default ubuntu github runner. The problem started recently and I imagine would start to affect other users as well.

Are there any other temporary workaround I could try in the meantime?

@raulcd
Copy link
Member Author

raulcd commented Oct 3, 2023

There is not much update yet, as a temporary workaround we do clean some unused binaries and cached packages on the GitHub runner by using the following script: https://github.com/apache/arrow/blob/main/ci/scripts/util_free_space.sh

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants