Skip to content

Commit

Permalink
Refresh CUDA arch of ort.
Browse files Browse the repository at this point in the history
- Use 3.5 PTX for both Kepler/Maxwell because there's not much Kepler clusters around with more power than a few 1080Ti.
- Add 8.0 PTX for forward compatibility.
- Set `CMAKE_CUDA_ARCHITECTURES` instead of compiler flags.
  • Loading branch information
xkszltl committed Apr 26, 2022
1 parent 5ada0d6 commit 9f267c2
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion pkgs/ort.sh
Original file line number Diff line number Diff line change
Expand Up @@ -99,7 +99,7 @@
-DCMAKE_AR="$(which "$AR")" \
-DCMAKE_BUILD_TYPE=Release \
-DCMAKE_C_COMPILER="$CC" \
-DCMAKE_CUDA_FLAGS="-gencode=arch=compute_35,code=sm_35 -gencode=arch=compute_37,code=sm_37" \
-DCMAKE_CUDA_ARCHITECTURES='35-virtual;60-real;61-real;70-real;75-real;80-real;80-virtual;86-real' \
-DCMAKE_{CUDA_HOST,CXX}_COMPILER="$CXX" \
-DCMAKE_{C,CXX,CUDA}_COMPILER_LAUNCHER=ccache \
-DCMAKE_C{,XX}_FLAGS="-fdebug-prefix-map='$SCRATCH'='$INSTALL_PREFIX/src' -g $($TOOLCHAIN_CPU_NATIVE || echo '-march=haswell -mtune=generic')" \
Expand Down

0 comments on commit 9f267c2

Please sign in to comment.