用DELL服務器的想必都應該知道在Windows環境下的"Server Administrator"經過這個工具能夠很方便的查看出服務器當前硬件的工做狀況。因爲工做需我使用的不少DELL服務器均安裝的Linux操做系統,cpu/內存倒還好說,可硬盤都是經過PERC 6/i SAS陣列控制器建立RAID組而後提供給Linux系統使用的。而平時平常維護工做都是經過ssh文本模式遠程登陸到服務器的,爲了可以在硬盤掛了之後及時亡羊補牢從DELL網站上找到了一個不錯的軟件dell-onlinediags-linux,這個軟件功能很強大本文僅僅使用它的存儲設備檢測功能,其餘的功能你們能夠詳細研究一下幫助文檔,在此就不一一列出了。
第一步,獲取軟件
在ftp://ftp.us.dell.com/diags/找到dell-onlinediags-linux軟件包,如今最新版本是2.17.0.44。已經能夠支持RedHat6了。我是在Centos5.5 x86_64系統上測試的,可使用RHEL5的rpm包。
第二步,解壓安裝(本例下載tar包到/tmp目錄下)
cd /tmp
tar zxvf dell-onlinediags-linux-2.17.0.44.tar.gz
cd onliediags
rpm -ivh RHEL5/srvadmin-hapi-6.4.0-1.42.1.el5.i386.rpm
rpm -ivh x86_64/pediags-storelib-sysfs-6.4.0-1.2.1.el4.i386.rpm (若是是32bit系統安裝RHEL5目錄下的srvadmin-storelib-sysfs-6.4.0-1.4.2.el5.i386.rpm )
rpm -ivh dell-onlinediags-2.17.0-44.i386.rpm
第三步,檢測查詢
cd /opt/dell/onlinediags/oldiags/bin
./pediags sasdevdiag --run quicktest=true
而後屏幕會逐個出現一排小點.......耐心等待,這個要看磁盤多少了。我這裏7塊盤大概等了3分鐘的樣子吧。
而後會出現以下結果:
...................................................................
Device Name : Dell PERC 6/i Integrated RAID Controller 1
Description : Dell PERC 6/i Integrated RAID Controller 1
Device Location : PCI Bus 3, Device 0, Function 0
Test Name : SAS Controller Log Test
Description : Gets the Controller Events Log.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:48:10 2011
Completion Time : Tue Mar 15 13:48:10 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Dell PERC 6/i Integrated RAID Controller 1
Description : Dell PERC 6/i Integrated RAID Controller 1
Device Location : PCI Bus 3, Device 0, Function 0
Test Name : Controller Self Test
Description : Runs Self Diagnostics on the controller.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:48:12 2011
Completion Time : Tue Mar 15 13:49:52 2011
Result Event : The test operation completed successfully. No errors found.
Event String 1 : NVRAM test completed 37 passes successfully
Event String 2 : Memory test completed 2 passes successfully
Event String 3 : DISK test completed 11 passes successfully
Event String 4 : XOR test completed 152 passes successfully
Event String 5 : DMA test completed 18 passes successfully
Event String 6 : Controller Battery Backup Unit Working Normally
Device Name : Dell PERC 6/i Integrated RAID Controller 1
Description : Dell PERC 6/i Integrated RAID Controller 1
Device Location : PCI Bus 3, Device 0, Function 0
Test Name : Controller TTY Log Test
Description : Gets the Controller TTY Log.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:49:58 2011
Completion Time : Tue Mar 15 13:49:58 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : SAS/SATA backplane 1:0
Description : SAS/SATA backplane 1:0
Device Location : Enclosure 1:0
Test Name : SAS Phy Integrity Test
Description : Gets Enclosure and HDD Phy Error Log Information.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:48:12 2011
Completion Time : Tue Mar 15 13:49:57 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:0
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 0
Test Name : Disk Self Test
Description : Tests the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:48:12 2011
Completion Time : Tue Mar 15 13:49:09 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:0
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 0
Test Name : Disk Blink Test
Description : Blinks the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:49:16 2011
Completion Time : Tue Mar 15 13:49:46 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:1
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 1
Test Name : Disk Self Test
Description : Tests the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:48:12 2011
Completion Time : Tue Mar 15 13:49:09 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:1
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 1
Test Name : Disk Blink Test
Description : Blinks the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:49:16 2011
Completion Time : Tue Mar 15 13:49:46 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:2
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 2
Test Name : Disk Self Test
Description : Tests the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:48:12 2011
Completion Time : Tue Mar 15 13:49:04 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:2
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 2
Test Name : Disk Blink Test
Description : Blinks the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:49:16 2011
Completion Time : Tue Mar 15 13:49:46 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:3
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 3
Test Name : Disk Self Test
Description : Tests the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:48:12 2011
Completion Time : Tue Mar 15 13:49:05 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:3
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 3
Test Name : Disk Blink Test
Description : Blinks the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:49:06 2011
Completion Time : Tue Mar 15 13:49:36 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : SAS/SATA backplane 1:0
Description : SAS/SATA backplane 1:0
Device Location : Enclosure 1:0
Test Name : SAS Phy Integrity Test
Description : Gets Enclosure and HDD Phy Error Log Information.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:50:00 2011
Completion Time : Tue Mar 15 13:51:22 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:4
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 4
Test Name : Disk Self Test
Description : Tests the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:48:12 2011
Completion Time : Tue Mar 15 13:49:04 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:4
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 4
Test Name : Disk Blink Test
Description : Blinks the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:49:10 2011
Completion Time : Tue Mar 15 13:49:40 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:5
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 5
Test Name : Disk Self Test
Description : Tests the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:48:12 2011
Completion Time : Tue Mar 15 13:49:06 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:5
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 5
Test Name : Disk Blink Test
Description : Blinks the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:49:12 2011
Completion Time : Tue Mar 15 13:49:42 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:6
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 6
Test Name : Disk Self Test
Description : Tests the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:48:12 2011
Completion Time : Tue Mar 15 13:49:04 2011
Result Event : The test operation completed successfully. No errors found.
Device Name : Array Disk 1:0:6
Description : SAS TOSHIBA MBF2300RC
Device Location : Slot 6
Test Name : Disk Blink Test
Description : Blinks the hard drive.
Passes : 1
Result : passed
Start Time : Tue Mar 15 13:49:14 2011
Completion Time : Tue Mar 15 13:49:44 2011
Result Event : The test operation completed successfully. No errors found.
結果中有下劃線標出的那部分就是第一塊硬盤disk1:0:0的測試結果,看徹底部結果很清楚的就能看出來Slot 0—Slot 6
一共7塊硬盤每塊盤測試兩次,每次都是正常的。看到這樣的結果心理終於踏實了,這樣即使人沒有在IDC機房託管服務器前,也同樣能夠知道服務器當前全部硬盤的工做狀態了。寫個腳本crontab每隔3天測試一次而後發封郵件給本身,這樣是否是就不再必擔憂硬盤掛了呢?linux