-
Notifications
You must be signed in to change notification settings - Fork 404
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Exclude .git for cloud storage #1494
Conversation
Hm tests seem to be broken now due to python version issue #609 (comment) |
Could we run some tests and write a Tested section? |
Ah yes I already did some tests. updated the post. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @infwinston. I've run the following tests for AWS/S3 only:
» cat test.yaml
run: |
set -ex
pwd
ls -lthr
mkdir dir-with-no-git
sky spot launch test.yaml --cloud aws -n dir-with-no-git --workdir=dir-with-no-git
mkdir dir-with-git-file; cd dir-with-git-file; touch .git
sky spot launch test.yaml --cloud aws -n dir-with-git-file --workdir=dir-with-git-file
Can you help test them on GCS before merging? It'd also be great to run the file mount smoke tests.
I just tested
|
|
hmm I just realized our bert_qa example shown on the spot job doc may fail. because One simple solution is to modify the YAML in the doc to use |
Great catch!
->
|
Smoke test passed. Just done an end-to-end test with and without excluding
|
@concretevitamin I also add some warning if
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A bit concerning there's no speed difference.
Maybe try the example in #1463?
Co-authored-by: Zongheng Yang <zongheng.y@gmail.com>
The example in #1463 seems to be related to a local network issue that occasionally happens to my laptop. Now it seems to work well with and without Will keep an eye on this. |
45a13de
to
1897a35
Compare
* s3 exclude .git * gcs * comments * warning and fix doc * Update sky/data/storage.py Co-authored-by: Zongheng Yang <zongheng.y@gmail.com> * fix Co-authored-by: Zongheng Yang <zongheng.y@gmail.com>
This is a quick patch to make s3 exclude
.git
for #1485. However, this might be a surprising behavior to some users.Alternative is to implement our own code to read
.gitignore
and generate corresponding directories to exclude (i.e., parsing those**/*.pyc
syntax ourselves)Should we implement that?
Note
sky launch
will respect.gitignore
and putting.git/
inside is enough to exclude the directory. or should we make it default for allrsync
?Tested
sky spot launch
a workdir containing.git