Skip to content

Commit 392436e

Browse files
q10meta-codesync[bot]
authored andcommitted
Fix HSTU build in OSS (#5197)
Summary: X-link: https://github.com/facebookresearch/FBGEMM/pull/2194 Pull Request resolved: #5197 - Fix HSTU build in OSS Reviewed By: cthi Differential Revision: D88696540 fbshipit-source-id: bd5b30cca7c7eb51e80e336ced9de06a2437d956
1 parent d1e0d74 commit 392436e

File tree

1 file changed

+6
-2
lines changed

1 file changed

+6
-2
lines changed

.github/scripts/fbgemm_gpu_build.bash

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -298,11 +298,15 @@ __configure_fbgemm_gpu_build_cuda () {
298298
# NOTE: Compiling 9.0a code will fail if sm_80 output is also is also
299299
# enabled, bc the code relies on the following function that is not
300300
# supported in sm_80:
301+
#
301302
# float4 atomicAdd(float4* address, float4 val);
302-
local arch_list="8.0;9.0a;10.0a;12.0a"
303+
#
304+
# See:
305+
# https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html?highlight=atomicAdd#atomicadd
306+
local arch_list="9.0a;10.0a;12.0a"
303307
else
304308
# NOTE: HSTU requires sm_75 or higher
305-
local arch_list="8.0;9.0a"
309+
local arch_list="9.0a"
306310
fi
307311

308312
elif [[ $cuda_version_nvcc == *"V13.0"* ]] ||

0 commit comments

Comments
 (0)