當編譯cubin文件時，指明「arch sm_35「的區別

時間 2019-12-11

標籤編譯 cubin 文件指明 arch 區別欄目 Linux 简体版

原文原文鏈接

接着上面一篇blogspa

nvcc -cubin -m64 -arch sm_35   *.cu --use_fast_math  --maxrregcount=32  --ptxas-options=-v -O3 -o *.cubin

當用上面的命令編譯後，影響有兩個：code

1. 顯示會使用local memory。blog

2. 在Tesla K40上運行，沒有問題。ci

nvcc -cubin -m64 *.cu --use_fast_math  --maxrregcount=32  --ptxas-options=-v -O3 -o *.cubin

若是去掉 -arch sm_35, 默認是給sm_20編譯「compiling entry funciton '*' for 'sm_20'。影響有兩個：it

1. 使用--maxrregcount=32，不適用local memory。io

2. 不能在 Tesla K40 上運行。編譯

相關標籤/搜索

當編譯cubin文件時， 指明「arch sm_35「的區別