OpenBSD 4.4的ichiic0错误

我有四个相同的节点构build在运行OpenBSD 4.4的Supermicro 1025TC-TB硬件上。 其中两个节点是路由器,另外两个是负载均衡器。 每个集合都以主动/被动configuration运行,并由CARP提供故障转移。

活动的路由器节点以约每小时两个的速率logging这些错误:5月5日04:09:30 gw / bsd:ichiic0:中止失败,状态0x0

偶尔会popup:5月5日03:59:41 gw / bsd:ichiic0:中止失败,状态0x40

我还没有find这些logging的模式。 ichiic是英特尔SMBus驱动程序。 这款主板还具有I / OATfunction,但OpenBSD不支持这些function。 我正在运行最新的BIOS。

我怀疑这些logging是在特别高的中断尖峰期间logging的,但是systat足够慢,以至于我无法直接关联它。 我也无法手动复制这个问题。 负载平衡器不受影响。

有任何想法吗?

dmesg的:

OpenBSD 4.4 (GENERIC) #1021: Tue Aug 12 17:16:55 MDT 2008 deraadt@i386.openbsd.org:/usr/src/sys/arch/i386/compile/GENERIC cpu0: Intel(R) Xeon(R) CPU E5405 @ 2.00GHz ("GenuineIntel" 686-class) 2.01 GHz cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,DS-CPL,VMX,TM2,CX16,xTPR real mem = 3756871680 (3582MB) avail mem = 3648520192 (3479MB) mainbus0 at root bios0 at mainbus0: AT/286+ BIOS, date 08/20/08, BIOS32 rev. 0 @ 0xfdb90, SMBIOS rev. 2.5 @ 0xdff5f000 (39 entries) bios0: vendor Phoenix Technologies LTD version "1.1" date 08/20/2008 bios0: Supermicro X7DCT acpi0 at bios0: rev 2 acpi0: tables DSDT FACP APIC MCFG BOOT SPCR ERST HEST BERT EINJ SLIC SSDT SSDT SSDT SSDT SSDT SSDT SSDT SSDT SSDT acpi0: wakeup devices P0P2(S5) P0P4(S5) USB4(S1) USB5(S1) USB7(S1) ESB2(S1) EXP1(S5) EXP5(S5) EXP6(S5) USB1(S1) USB2(S1) USB3(S1) USB6(S1) ESB1(S1) PCIB(S5) COM1(S5) COM2(S5) acpitimer0 at acpi0: 3579545 Hz, 24 bits acpiprt0 at acpi0: bus 1 (P0P2) acpiprt1 at acpi0: bus 2 (P0P4) acpiprt2 at acpi0: bus 0 (PCI0) acpiprt3 at acpi0: bus 3 (EXP1) acpiprt4 at acpi0: bus 4 (EXP5) acpiprt5 at acpi0: bus 5 (EXP6) acpiprt6 at acpi0: bus 6 (PCIB) acpicpu0 at acpi0 acpibtn0 at acpi0: PWRB bios0: ROM list: 0xc0000/0xb000 ipmi at mainbus0 not configured cpu0 at mainbus0 pci0 at mainbus0 bus 0: configuration mode 1 (no bios) pchb0 at pci0 dev 0 function 0 "Intel 5100 Host" rev 0x90 ppb0 at pci0 dev 2 function 0 "Intel 5100 PCIE" rev 0x90 pci1 at ppb0 bus 1 ppb1 at pci0 dev 4 function 0 "Intel 5100 PCIE" rev 0x90 pci2 at ppb1 bus 2 "Intel I/OAT SCNB" rev 0x90 at pci0 dev 8 function 0 not configured pchb1 at pci0 dev 16 function 0 "Intel 5100 FSB" rev 0x90 pchb2 at pci0 dev 16 function 1 "Intel 5100 FSB" rev 0x90 pchb3 at pci0 dev 16 function 2 "Intel 5100 FSB" rev 0x90 pchb4 at pci0 dev 17 function 0 "Intel 5100 Reserved" rev 0x90 pchb5 at pci0 dev 19 function 0 "Intel 5100 Reserved" rev 0x90 pchb6 at pci0 dev 21 function 0 "Intel 5100 DDR" rev 0x90 pchb7 at pci0 dev 22 function 0 "Intel 5100 DDR" rev 0x90 uhci0 at pci0 dev 26 function 0 "Intel 82801I USB" rev 0x02: irq 5 ehci0 at pci0 dev 26 function 7 "Intel 82801I USB" rev 0x02: irq 11 usb0 at ehci0: USB revision 2.0 uhub0 at usb0 "Intel EHCI root hub" rev 2.00/1.00 addr 1 ppb2 at pci0 dev 28 function 0 "Intel 82801I PCIE" rev 0x02: irq 10 pci3 at ppb2 bus 3 ppb3 at pci0 dev 28 function 4 "Intel 82801I PCIE" rev 0x02: irq 10 pci4 at ppb3 bus 4 em0 at pci4 dev 0 function 0 "Intel PRO/1000MT (82573E)" rev 0x03: irq 10, address 00:30:48:d4:62:ea ppb4 at pci0 dev 28 function 5 "Intel 82801I PCIE" rev 0x02: irq 10 pci5 at ppb4 bus 5 em1 at pci5 dev 0 function 0 "Intel PRO/1000MT (82573L)" rev 0x00: irq 10, address 00:30:48:d4:62:eb uhci1 at pci0 dev 29 function 0 "Intel 82801I USB" rev 0x02: irq 7 uhci2 at pci0 dev 29 function 1 "Intel 82801I USB" rev 0x02: irq 11 uhci3 at pci0 dev 29 function 2 "Intel 82801I USB" rev 0x02: irq 10 ehci1 at pci0 dev 29 function 7 "Intel 82801I USB" rev 0x02: irq 7 usb1 at ehci1: USB revision 2.0 uhub1 at usb1 "Intel EHCI root hub" rev 2.00/1.00 addr 1 ppb5 at pci0 dev 30 function 0 "Intel 82801BA Hub-to-PCI" rev 0x92 pci6 at ppb5 bus 6 vga1 at pci6 dev 1 function 0 "ATI ES1000" rev 0x02 wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) wsdisplay0: screen 1-5 added (80x25, vt100 emulation) drm at vga1 unsupported ichpcib0 at pci0 dev 31 function 0 "Intel 82801IR LPC" rev 0x02: PM disabled pciide0 at pci0 dev 31 function 2 "Intel 82801I SATA" rev 0x02: DMA, channel 0 configured to native-PCI, channel 1 configured to native-PCI pciide0: using irq 10 for native-PCI interrupt wd0 at pciide0 channel 0 drive 0: <WDC WD1600BEKT-00F3T0> wd0: 16-sector PIO, LBA48, 152627MB, 312581808 sectors wd0(pciide0:0:0): using PIO mode 4, Ultra-DMA mode 5 ichiic0 at pci0 dev 31 function 3 "Intel 82801I SMBus" rev 0x02: irq 10 iic0 at ichiic0 lm1 at iic0 addr 0x2d: W83627HF wbng0 at iic0 addr 0x2f: w83793g pciide1 at pci0 dev 31 function 5 "Intel 82801I SATA" rev 0x02: DMA, channel 0 wired to native-PCI, channel 1 wired to native-PCI pciide1: using irq 11 for native-PCI interrupt usb2 at uhci0: USB revision 1.0 uhub2 at usb2 "Intel UHCI root hub" rev 1.00/1.00 addr 1 usb3 at uhci1: USB revision 1.0 uhub3 at usb3 "Intel UHCI root hub" rev 1.00/1.00 addr 1 usb4 at uhci2: USB revision 1.0 uhub4 at usb4 "Intel UHCI root hub" rev 1.00/1.00 addr 1 usb5 at uhci3: USB revision 1.0 uhub5 at usb5 "Intel UHCI root hub" rev 1.00/1.00 addr 1 isa0 at ichpcib0 isadma0 at isa0 com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo pckbc0 at isa0 port 0x60/5 pckbd0 at pckbc0 (kbd slot) pckbc0: using irq 1 for kbd slot wskbd0 at pckbd0: console keyboard, using wsdisplay0 pcppi0 at isa0 port 0x61 midi0 at pcppi0: <PC speaker> spkr0 at pcppi0 wbsio0 at isa0 port 0x2e/2: W83627HF rev 0x41 wbsio0 port 0x290/2 not configured npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16 biomask fde5 netmask fde5 ttymask ffff mtrr: Pentium Pro MTRR support softraid0 at root root on wd0a swap on wd0b dump on wd0b 

你正在运行最新的BIOS? 这将是我会检查的第一件事。

就个人而言,我从来没有看到这个错误(我运行一个OBSD 4.4开发框)。 我为你研究了一下,我可以告诉你,不止一个人遇到同样的问题(甚至在ODBS 4.4之前,我相信你也发现了他们的post),但是没有确定的答案。 有些将其归因于超线程,有些归于失败的主板。

我会把这些留在我待清理的事情清单上。 对不起,我不能有进一步的帮助。

该错误出现在ichiic_i2c_exec()函数ichiic_i2c_exec() ,并指示等待传输完成的超时。 如果非常罕见,我不会担心。 但是我会留意日志(日志检查?)。 如果开始发生的次数更多,我会怀疑硬件。

你可以尝试升级到快照,看看情况是否仍然存在? 如果不能解决问题,可以把它报告给开发者。

在你使用的BT版本中有一个令人讨厌的DoS漏洞 。