戴尔2950的不可预知的启动失败II

我们最近购买了翻新的戴尔2950 II作为我们实验室的开发盒。

在首次安装操作系统(Debian Wheezy)和引导之后,我在DRAC中收到以下错误,主机意外重启:

Critical 08/09/2014 03:13:50 CPU 2 has an internal error (IERR). Critical 08/09/2014 03:13:50 CPU 1 has an internal error (IERR). 

之后,在下一次启动的过程中,我收到以下内容(以相反的顺序):

 Critical 08/09/2014 03:15:41 A fatal IO error detected on a component at OK 08/09/2014 03:15:41 An OEM diagnostic event has occurred. OK 08/09/2014 03:15:41 An OEM diagnostic event has occurred. OK 08/09/2014 03:15:41 An OEM diagnostic event has occurred. OK 08/09/2014 03:15:41 An OEM diagnostic event has occurred. OK 08/09/2014 03:15:41 An OEM diagnostic event has occurred. OK 08/09/2014 03:15:40 An OEM diagnostic event has occurred. Non-Recoverable 08/09/2014 03:15:40 CPU 2 machine check detected. Non-Recoverable 08/09/2014 03:15:40 CPU 2 machine check detected. Critical 08/09/2014 03:15:40 A fatal IO error detected on a component at OK 08/09/2014 03:15:40 An OEM diagnostic event has occurred. Critical 08/09/2014 03:15:40 A fatal IO error detected on a component at OK 08/09/2014 03:15:40 An OEM diagnostic event has occurred. OK 08/09/2014 03:15:40 An OEM diagnostic event has occurred. OK 08/09/2014 03:15:40 An OEM diagnostic event has occurred. OK 08/09/2014 03:15:40 An OEM diagnostic event has occurred. OK 08/09/2014 03:15:40 An OEM diagnostic event has occurred. OK 08/09/2014 03:15:40 An OEM diagnostic event has occurred. Non-Recoverable 08/09/2014 03:15:39 CPU 1 machine check detected. OK 08/09/2014 03:14:05 CPU 1 is operating correctly. OK 08/09/2014 03:14:05 CPU 2 is operating correctly. OK 08/09/2014 03:14:05 CPU 1 is operating correctly. 

现在有一个奇怪的是,大约有75%的时间,启动失败,显示错误,另外25%的启动正常。

重启总是在GRUB菜单之后发生,但是在Debian开始发布典型的syslog / boot消息之前。

一如往常,任何帮助将不胜感激,谢谢!

尝试在GRUB设置中禁用帧缓冲区。 您可以在引导string的末尾添加“nofb”。 我们在戴尔2950 II或III上遇到了这个问题。