Skip to content
This repository was archived by the owner on Nov 17, 2023. It is now read-only.

Enable dp4a for the CI#9684

Merged
marcoabreu merged 1 commit intoapache:masterfrom
KellenSunderland:ci_dp4a_support
Feb 4, 2018
Merged

Enable dp4a for the CI#9684
marcoabreu merged 1 commit intoapache:masterfrom
KellenSunderland:ci_dp4a_support

Conversation

@KellenSunderland
Copy link
Copy Markdown
Contributor

Description

Turn on sm61 so that when we add p3 support to the CI env we'll be able to execute the dp4a instruction. This should help test features like quantization in CI.

-e "CI_BUILD_GROUP=$(id -g -n)" \
-e "CI_BUILD_GID=$(id -g)" \
-e "CUDA_ARCH=-gencode arch=compute_52,code=[sm_52,compute_52] --fatbin-options -compress-all" \
-e "CUDA_ARCH=-gencode arch=compute_52,code=[sm_52,compute_52] -gencode arch=compute_61,code=sm_61 --fatbin-options -compress-all" \
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

6.1 is Pascal based GPUs and thus would have no effect on our CI. Could you elaborate?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will produce backwards compatible binaries, including the dp4a instruction, that will work on p3 instances.

@marcoabreu marcoabreu merged commit 4c55d06 into apache:master Feb 4, 2018
rahul003 pushed a commit to rahul003/mxnet that referenced this pull request Jun 4, 2018
zheng-da pushed a commit to zheng-da/incubator-mxnet that referenced this pull request Jun 28, 2018
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants