Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

Enable dp4a for the CI #9684

Merged
merged 1 commit into from Feb 4, 2018
Merged

Conversation

KellenSunderland
Copy link
Contributor

Description

Turn on sm61 so that when we add p3 support to the CI env we'll be able to execute the dp4a instruction. This should help test features like quantization in CI.

@@ -176,7 +180,7 @@ ${DOCKER_BINARY} run --rm --pid=host \
-e "CI_BUILD_UID=$(id -u)" \
-e "CI_BUILD_GROUP=$(id -g -n)" \
-e "CI_BUILD_GID=$(id -g)" \
-e "CUDA_ARCH=-gencode arch=compute_52,code=[sm_52,compute_52] --fatbin-options -compress-all" \
-e "CUDA_ARCH=-gencode arch=compute_52,code=[sm_52,compute_52] -gencode arch=compute_61,code=sm_61 --fatbin-options -compress-all" \
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

6.1 is Pascal based GPUs and thus would have no effect on our CI. Could you elaborate?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will produce backwards compatible binaries, including the dp4a instruction, that will work on p3 instances.

@marcoabreu marcoabreu merged commit 4c55d06 into apache:master Feb 4, 2018
rahul003 pushed a commit to rahul003/mxnet that referenced this pull request Jun 4, 2018
zheng-da pushed a commit to zheng-da/incubator-mxnet that referenced this pull request Jun 28, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants