X3950經過MGMT關機產生NMI,PCI,LOG報錯

症狀 服務器

IBM X3950類型爲8878服務器的光通路診斷面板上點亮了LOG燈,經過服務器後面的MGMT口登陸服務器http://192.168.70.125(mgmt口管理IP192.168.70.125,用戶名:USERID 密碼:PASSW0RD,注意密碼中0不是大寫的password,01230)。查看到服務器上以前啓動時的錯誤日誌,點擊右下角按鈕清除日誌,須要關機斷電將電源撥掉後才能清除光通路診斷面板上點亮了LOG燈。 網絡

清除日誌後因而就在管理頁面的電源管理選項中當即關閉電源了,而後將服務器的2根電源線撥掉,過了片刻再將服務器的電源線插好加電開機,服務器的光通路診斷面板上此次亮了三個燈:NMI,PCI,LOG.,且服務器的全部風扇工做在97%-100%狀態,聲音很大,一直不停。 ide

再次登陸MGMT管理口查看日誌,發現以下報錯信息: ui

22     WARN       SERVPROC      01/03/12   19:06:39   Software NMI google

23     ERR SERVPROC      01/03/12   19:06:36   Address of special cycle DPE on PCI primary Chassis#=1 Slot#=2 Bus#=4 Dev.ID=0xfd00 Vend.ID=0x10df Status=0xc238 DevFun#=0x8 url

24     ERR SERVPROC      01/03/12   19:06:36   System Error PCI Bus spa

25     ERR SERVPROC      01/03/12   19:06:36   SMI handler has reported a PCI SERR. 操作系統

26     ERR SERVPROC      01/03/12   19:06:36   Uncorrectable ECC error on PCI primary Chassis#=1 Slot#=2 Bus#=4 Dev.ID=0xfd00 Vend.ID=0x10df Status=0xc238 DevFun#=0x8 rest

27     ERR SERVPROC      01/03/12   19:06:35   Parity Error PCI Bus 日誌

28     ERR SERVPROC      01/03/12   19:06:35   SMI handler has reported a PCI PERR.

29     ERR SERVPROC      01/03/12   19:06:35   Additional uncorrectable ECC error on PCI primary Chassis#=1 Slot#=2 Bus#=4 Dev.ID=0xfd00 Vend.ID=0x10df Status=0xc238 DevFun#=0x8

30     ERR SERVPROC      01/03/12   19:06:35   Parity Error PCI Bus

31     ERR SERVPROC      01/03/12   19:06:35   SMI handler has reported a PCI PERR.

32     ERR SERVPROC      01/03/12   19:06:35   Device signaled SERR on PCI primary. Chassis#=1 Slot#=2 Bus#=4 Dev.ID=0x 2a 1 Vend.ID=0x1014 Status=0x64b0 DevFun#=0x0

33     ERR SERVPROC      01/03/12   19:06:35   System Error PCI Bus

34     ERR SERVPROC      01/03/12   19:06:35   SMI handler has reported a PCI SERR.

35     ERR SERVPROC      01/03/12   19:06:35   PCI Bus SERR# Detected Chassis#=1 Slot#=2 Bus#=4 Dev.ID=0x 2a 1 Vend.ID=0x1014 Status=0x64b0 DevFun#=0x0

36     ERR SERVPROC      01/03/12   19:06:34   System Error PCI Bus

37     ERR SERVPROC      01/03/12   19:06:34   SMI handler has reported a PCI SERR.

 

查找緣由以下:

PCIe的不支持的請求和致命的流量控制產生的錯誤PCI SERR和軟件NMIRSA日誌事件的報告和調查。 These events occur intermittently during manual or scheduled restarts in Microsoft Windows Server 2003.這些事件發生間歇性地在手動或計劃在Microsoft Windows Server 2003從新啓動。

The root cause was determined to be memory read/write requests that were inadvertently sent to the on-board Broadcom devices after the devices were already put to the PCIe D3hot low power state in preparation for the restart.根本緣由被肯定爲內存讀/寫,不經意間發送到板上的Broadcom設備後,設備已經準備從新啓動到PCIe D3hot低功耗狀態的請求。

A fix was provided in the Broadcom driver to reject any memory requests to the onboard Broadcom devices when they are in the D3hot state.一個修復提供了Broadcom驅動程序拒絕任何內存請求,板載的Broadcom設備,當他們在D3hot狀態。 The fix is included in Broadcom driver version 4.6.55 or higher as seen in the Broadcom Advanced Control Suite (BACS).該修補程序包含Broadcom驅動的Broadcom高級控制套件(BACS 55 4 6 或更高版本。 See the p_w_picpath below for an example of how to see the driver version in BACS.

該系統錯誤可對任何下列IBM服務器:

·                                 System x3850 M2, type 7141, any model任何模型的System x 3850 M2 ,鍵入7141

·                                 System x3850 M2, type 7144, any model任何模型的System x 3850 M2 ,鍵入7144

·                                 System x3850 M2, type 7233, any model任何模型的System x 3850 M2 ,鍵入7233

·                                 System x3850 M2, type 7234, any model任何模型的System x 3850 M2 ,鍵入7234

·                                 System x3950 M2, type 7141, any modelSystem x 3950 M2 7141型,任何模型

·                                 System x3950 M2, type 7233, any modelSystem x 3950 M2 7233型,任何模型

·                                 System x3950 M2, type 7234, any modelSystem x 3950 M2 7234型,任何模型

This tip is not option specific.這個提示是否是選項的具體。

·                                 The Windows device driver for the on-board Broadcom 5709 is affected.板上的Broadcom 5709Windows設備驅動程序的影響。

The system is configured with at least one of the following:該系統配置至少有如下之一:

·                                 Microsoft Windows 2003 Server for 32-bit Servers, any service pack微軟Windows 2003 Server32位服務器,任何服務包

·                                 Microsoft Windows 2003 Server for 64-bit Servers, any service pack Microsoft Windows 200364位服務器的服務器,任何服務包

·                                 Microsoft Windows 2003 Server, EE x64, any service pack Microsoft Windows 2003服務器,EE X64,任何服務包

·                                 Microsoft Windows 2003 Server, x64 Edition, any service pack Microsoft Windows 2003服務器,x64版,任何服務包

Note: This does not imply that the network operating system will work under all combinations of hardware and software. 注:這並不意味着網絡操做系統下工做的硬件和軟件的全部組合。

Please see the compatibility page for more information:更多信息,請參閱兼容性頁面:

 

http://www.ibm.com/servers/eserver/serverproven/compat/us/ http://www.ibm.com/servers/eserver/serverproven/compat/us/

Solution解決方案

This symptom is resolved in the Broadcom Windows driver available for download at the following URL:這種症狀是解決BroadcomWindows驅動程序,可在如下網址下載:

 

http://www.ibm.com/support/docview.wss?uid=psg1MIGR-5070012 http://www.ibm.com/support/docview.wss?uid=psg1MIGR-5070012

 

是因爲我經過遠程管理服務器頁面關閉服務器電源產生的控制流量產生的錯誤致使了網卡處於高功耗狀態,因此風扇纔會所有工做。

解決方法爲須要更新網卡驅動,但在IBM官網上搜索了一下X3950 Broadcom的網卡驅動,爲無效鏈接,打電話給IBM400,服務器已通過保,苦逼了,因而將服務器的網卡驅動卸載再掃描一下後,將服務器上全部鏈接網線所有撥掉,清除掉日誌後關閉服務器電源,再撥掉電源線,而後再從新啓動,服務器的風扇工做正常了,光通路診斷面板上的燈也所有不亮了。

相關文章
相關標籤/搜索