centos7.0安裝cuda驅動

時間 2019-12-05

標籤 centos7.0 centos 安裝 cuda 驅動欄目 CentOS 简体版

原文原文鏈接

00、CUDA簡介

　　CUDA和GPU的並行處理能力來加速深度學習和其餘計算密集型應用程序linux

0一、CPU+GPU協同架構

0二、部署環境

[docker@lab-250 ~]$ cat /etc/*release
NAME="Red Hat Enterprise Linux Server"
VERSION="7.0 (Maipo)"
ID="rhel"
ID_LIKE="fedora"
VERSION_ID="7.0"
PRETTY_NAME="Red Hat Enterprise Linux Server 7.0 (Maipo)"
ANSI_COLOR="0;31"
CPE_NAME="cpe:/o:redhat:enterprise_linux:7.0:GA:server"
HOME_URL="https://www.redhat.com/"
BUG_REPORT_URL="https://bugzilla.redhat.com/"

REDHAT_BUGZILLA_PRODUCT="Red Hat Enterprise Linux 7"
REDHAT_BUGZILLA_PRODUCT_VERSION=7.0
REDHAT_SUPPORT_PRODUCT="Red Hat Enterprise Linux"
REDHAT_SUPPORT_PRODUCT_VERSION=7.0
Red Hat Enterprise Linux Server release 7.0 (Maipo)
Red Hat Enterprise Linux Server release 7.0 (Maipo)

[docker@lab-250 ~]$ uname -r
3.10.0-123.el7.x86_64
[docker@lab-250 ~]$ uname -a
Linux lab-250 3.10.0-123.el7.x86_64 #1 SMP Mon May 5 11:16:57 EDT 2014 x86_64 x86_64 x86_64 GNU/Linuxdocker

注意：要在服務器上安裝GPU顯卡vim

0三、下載CUDA-Tookit

https://developer.nvidia.com/cuda-toolkit-archivecentos

CUDA Toolkit 9.0 (Sept 2017), Online Documentation //實驗下載此版本，根據系統下載對應的安裝包，建議選擇本地集成成果包！服務器

https://developer.nvidia.com/cuda-toolkit 架構

注意：下面的安裝，是因爲系統是rhel7.0,錯誤認爲是centos7.0致使部分rpm未安裝須要單獨下載。通常對應版本是不須要在額外下載rpm包ide

cuda-repo-rhel7-9-0-local-9.0.176-1.x86_64-rpm #centos7,因爲centos是基於rhel7的開源發行版本，因此名字rhel7學習

0四、setup

Installation Instructions:
rpm -i cuda-repo-rhel7-9-0-local-9.0.176-1.x86_64.rpm
yum clean all && yum makecache
yum install cuda

Other installation options are available in the form of meta-packages. 
For example, to install all the library packages, replace "cuda" with the "cuda-libraries-9-0" meta package

注意：安裝cuda的時候它會自動找NVIDIA顯卡的，不須要提早把NVIDIA顯卡設置爲默認顯卡測試

錯誤處理：ui

https://mirrors.aliyun.com/epel/7/aarch64/Packages/d/dkms-2.6.1-1.el7.noarch.rpm
https://mirrors.aliyun.com/centos/7.6.1810/os/x86_64/Packages/libvdpau-1.1.1-3.el7.x86_64.rpm

--> Finished Dependency Resolution
Error: Package: 1:nvidia-kmod-384.81-2.el7.x86_64 (cuda-9-0-local)
           Requires: dkms
 You could try using --skip-broken to work around the problem
 You could try running: rpm -Va --nofiles --nodigest
[root@lab-250 ~]# rz -E
rz waiting to receive.
[root@lab-250 ~]# rpm -ivh dkms-2.6.1-1.el7.noarch.rpm 
warning: dkms-2.6.1-1.el7.noarch.rpm: Header V3 RSA/SHA256 Signature, key ID 352c64e5: NOKEY
error: Failed dependencies:
	elfutils-libelf-devel is needed by dkms-2.6.1-1.el7.noarch
[root@lab-250 ~]# 
[root@lab-250 ~]# yum install -y elfutils-libelf-devel
Resolving Dependencies
--> Running transaction check
---> Package elfutils-libelf-devel.x86_64 0:0.158-3.el7 will be installed
--> Finished Dependency Resolution

Dependencies Resolved

[root@lab-250 ~]# rpm -ivh dkms-2.6.1-1.el7.noarch.rpm
warning: dkms-2.6.1-1.el7.noarch.rpm: Header V3 RSA/SHA256 Signature, key ID 352c64e5: NOKEY
Preparing... ################################# [100%]
Updating / installing...
1:dkms-2.6.1-1.el7 ################################# [100%]
[root@lab-250 ~]#
[root@lab-250 ~]# yum install -y cuda

0五、設置環境變量

/usr/local/cuda-9.0 #默認安裝位置

vim /etc/profile

export CUDA_HOME="/usr/local/cuda-9.0"
export PATH=$CUDA_HOME/bin:$PATH
export LD_LIBRARY_PATH=$CUDA_HOME/lib64:$LD_LIBRARY_PATH

source /etc/profile

[docker@lab-250 ~]$ nvcc -V  #驗證環境變量
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2017 NVIDIA Corporation
Built on Fri_Sep__1_21:08:03_CDT_2017
Cuda compilation tools, release 9.0, V9.0.176

[docker@lab-250 ~]$ nvidia-smi  #查看本機GPU顯卡信息，因爲測試機未安裝GPU顯卡致使的
NVIDIA-SMI has failed because it couldn't communicate with the NVIDIA driver. Make sure that the latest NVIDIA driver is installe
d and running.

引用：

https://baijiahao.baidu.com/s?id=1610852365402771191&wfr=spider&for=pc

https://www.jianshu.com/p/34a504af8d51

相關標籤/搜索

每日一句

每一个你不满意的现在，都有一个你没有努力的曾经。