我最近安装了第二个CPU到服务器。 CPU与第一个相同,我确认在安装之前CPU处于良好的工作状态。
服务器启动,我安装了CentOS 6没有问题。
POST屏幕可以识别两个CPU – 屏幕截图: http : //pasteboard.co/bOY8M04.png
但是,第一次启动时,我注意到在控制台上显示一个奇怪的错误消息“CPU1:卡住??” – 截图: http : //pasteboard.co/bOWvk1c.png
在挖掘/ var / log / messages时,我发现了更多的错误输出:
Sep 13 18:01:58 customer kernel: Kernel panic - not syncing: Fatal exception Sep 13 18:01:58 customer kernel: Pid: 0, comm: swapper Tainted: GD --------------- 2.6.32-431.29.2.el6.x86_64 #1 Sep 13 18:01:58 customer kernel: Call Trace: Sep 13 18:01:58 customer kernel: [<ffffffff8152873c>] ? panic+0xa7/0x16f Sep 13 18:01:58 customer kernel: [<ffffffff8152ca74>] ? oops_end+0xe4/0x100 Sep 13 18:01:58 customer kernel: [<ffffffff81010e0b>] ? die+0x5b/0x90 Sep 13 18:01:58 customer kernel: [<ffffffff8152c552>] ? do_general_protection+0x152/0x160 Sep 13 18:01:58 customer kernel: [<ffffffff8152bd25>] ? general_protection+0x25/0x30 Sep 13 18:01:58 customer kernel: [<ffffffff8103eb79>] ? native_write_cr4+0x9/0x10 Sep 13 18:01:58 customer kernel: [<ffffffff81050a2e>] ? syscall32_cpu_init+0x6e/0x80 Sep 13 18:01:58 customer kernel: [<ffffffff8151bea2>] ? xsave_init+0x31/0x48 Sep 13 18:01:58 customer kernel: [<ffffffff8151be45>] ? fpu_init+0x7e/0xaa Sep 13 18:01:58 customer kernel: [<ffffffff8151df1b>] ? cpu_init+0x309/0x35f Sep 13 18:01:58 customer kernel: [<ffffffff81521fcd>] ? start_secondary+0xd/0x2ef Sep 13 18:01:58 customer kernel: [<ffffffff81521fc0>] ? start_secondary+0x0/0x2ef Sep 13 18:01:58 customer kernel: CPU1: Stuck ?? Sep 13 18:01:58 customer kernel: #2 #3 Sep 13 18:01:58 customer kernel: general protection fault: 0000 [#2] SMP Sep 13 18:01:58 customer kernel: last sysfs file: Sep 13 18:01:58 customer kernel: CPU 3 Sep 13 18:01:58 customer kernel: Modules linked in: Sep 13 18:01:58 customer kernel: Sep 13 18:01:58 customer kernel: Pid: 0, comm: swapper Tainted: GD --------------- 2.6.32-431.29.2.el6.x86_64 #1 Supermicro X7DWT/X7DWT
以下是引导过程中/ var / log / messages的完整输出: http : //pastebin.com/b3wfmLX6
系统启动后,如果运行cat /proc/cpuinfo则只显示四个内核。
有谁知道什么可能会导致这些错误?
嗯,看起来像一个超微。 你确定你的硬件是健康的吗?
这只需要解决问题的步骤。