Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unpin tf-nightly version #1140

Merged
merged 1 commit into from Jun 11, 2019

Conversation

2 participants
@alsrgv
Copy link
Collaborator

commented Jun 11, 2019

No description provided.

Unpin tf-nightly version
Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>

@alsrgv alsrgv requested review from tgaddair and abditag2 Jun 11, 2019

@alsrgv alsrgv merged commit 0945a27 into master Jun 11, 2019

3 checks passed

DCO DCO
Details
License Compliance All checks passed.
Details
buildkite/horovod Build #481 passed (44 minutes, 58 seconds)
Details

@alsrgv alsrgv deleted the unpin_tf_v5 branch Jun 11, 2019

sblotner added a commit to sblotner/horovod that referenced this pull request Jun 19, 2019

Unpin tf-nightly version (horovod#1140)
Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

alsrgv added a commit that referenced this pull request Jun 20, 2019

Restructure Horovod doc landing page (#1158)
* Edit Horovod docs (#1119)

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Fix tf-nightly-gpu break (#1124)

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Fix minor spacing issue in GPU and summary docs

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Update Horovod doc site font, add landing page accordion

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Reorder and clean up titles in left navigation

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Add basic pages for keras, mxnet, pytorch, and tensorflow

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Fix mpirun 4 GPU example

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Minor updates to README

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Make Open MPI an advanced topic

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Fix REAMDE TF link

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Resolve conflict

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Fix minor spacing issue in GPU and summary docs (#1127)

Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Updated horovodrun command (#1126)

Signed-off-by: Carsten Jacobsen <carstenj@uber.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Ask GCC version when filling out the issue (#1133)

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Pin tf-nightly to 1.14.1.dev20190606 & remove torchvision-nightly (#1137)

* Pin tf-nightly to 1.14.1.dev20190606

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>

* torchvision-nightly -> torchvision

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>

* Pin torchvision to a version that does not require CUDA

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Replace .step(synchronize=False) with optimizer.skip_synchronize() (#1132)

* Replace .step(synchronize=False) with optimizer.already_synchronized()

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>

* Fix docs

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>

* Rename to skip_synchronize() and fix test

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Unpin tf-nightly version (#1140)

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Bump version to 0.16.4 (#1139)

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* remove MSHADOW_USE_F16C (#1141)

Signed-off-by: Lin Yuan <apeforest@gmail.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Adding support for multiple CUDA streams for NCCL operations. (#1128)

* Adding support for multiple CUDA streams for NCCL operations.

Signed-off-by: Josh Romero <joshr@nvidia.com>

* Fix compilation without CUDA or NCCL enabled.

Signed-off-by: Josh Romero <joshr@nvidia.com>

* Updating variable names.

Signed-off-by: Josh Romero <joshr@nvidia.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Add Singularity example page (#1149)

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Update Gloo api for data layer (#1120)

* Added gloo as a submodule

Signed-off-by: Travis Addair <taddair@uber.com>

* Added cmake build for gloo

Signed-off-by: Travis Addair <taddair@uber.com>

* Added allreduce and broadcast ops for Gloo

Signed-off-by: Travis Addair <taddair@uber.com>

* Enable MPI

Signed-off-by: Travis Addair <taddair@uber.com>

* Fixed transport

Signed-off-by: Travis Addair <taddair@uber.com>

* Use MPI comm from Horovod

Signed-off-by: Travis Addair <taddair@uber.com>

* Changed gloo allreduce to always make use of fusion buffer

Signed-off-by: Travis Addair <taddair@uber.com>

* Copy directly to output buffer

Signed-off-by: Travis Addair <taddair@uber.com>

* Unique ptr to shared ptr

Signed-off-by: Travis Addair <taddair@uber.com>

* Fixed root pointer rank

Signed-off-by: Travis Addair <taddair@uber.com>

* Added float16 support for Gloo

Signed-off-by: Travis Addair <taddair@uber.com>

* Use allgatherv

Signed-off-by: Travis Addair <taddair@uber.com>

* Use GlooAllgather by default

Signed-off-by: Travis Addair <taddair@uber.com>

* Pulled down update to gloo

Signed-off-by: Sihan Zeng <zsh@uber.com>

* update allgather allreduce and broadcast for unified gloo api

Signed-off-by: Sihan Zeng <zsh@uber.com>

* update setup.py & MANIFEST.in

Signed-off-by: Sihan Zeng <zsh@uber.com>

* Add runtime flag to support switching betwee gloo and mpi

Signed-off-by: Sihan Zeng <zsh@uber.com>

* Resolve review

Signed-off-by: Sihan Zeng <zsh@uber.com>

* fix iface issue

Signed-off-by: Sihan Zeng <zsh@uber.com>

* set Gloo to be automatically compiled except on MacOS

Signed-off-by: Sihan Zeng <zsh@uber.com>

* fix code style

Signed-off-by: Sihan Zeng <zsh@uber.com>

* integrate compile flag

Signed-off-by: Sihan Zeng <zsh@uber.com>

* fixed reviews

Signed-off-by: Sihan Zeng <zsh@uber.com>

* remove cmake from require list if system has cmake installed

Signed-off-by: Sihan Zeng <zsh@uber.com>

* cmake becomes a blocking issue, temporarily work it around by skip compiling gloo if cmake is not installed.

Signed-off-by: Sihan Zeng <zsh@uber.com>

* rebase on the latest master

Signed-off-by: Sihan Zeng <zsh@uber.com>

* remove chmod related code

Signed-off-by: Sihan Zeng <zsh@uber.com>

* final fix up

Signed-off-by: Sihan Zeng <zsh@uber.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* MLSL: move mlsl_init before mpi_init, add mlsl_finalize call (#1156)

Signed-off-by: Mikhail Shiryaev <mikhail.shiryaev@intel.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>

* Update Horovod GitHub URL (#1147)

Signed-off-by: Alex Sergeev <alsrgv@users.noreply.github.com>
Signed-off-by: Stephanie Blotner <sblotner@uber.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
You can’t perform that action at this time.