我刚才提到我的proxmox系统在执行备份任务(bacula,而不是proxmox-backup-system)时会不时重新启动。
日志显示如下:
Mar 8 02:30:01 proxmox-server kernel: [1922972.974633] audit: type=1400 audit(1457400601.487:129743): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=22456 comm="atop" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:30:01 proxmox-server kernel: [1922972.974665] audit: type=1400 audit(1457400601.487:129744): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=22456 comm="atop" requested_mask="read" denied_mask="read" peer="unconfined" Mar 8 02:30:01 proxmox-server kernel: [1922973.110758] audit: type=1400 audit(1457400601.623:129746): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=22532 comm="atop" requested_mask="read" denied_mask="read" peer="unconfined" Mar 8 02:30:01 proxmox-server kernel: [1922973.111619] audit: type=1400 audit(1457400601.627:129747): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=22532 comm="atop" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:31:33 proxmox-server kernel: [1923064.989437] audit: type=1400 audit(1457400693.511:129762): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=11598 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:31:33 proxmox-server kernel: [1923064.989461] audit: type=1400 audit(1457400693.511:129763): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=11598 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:31:34 proxmox-server puppet-agent[31158]: Finished catalog run in 0.63 seconds Mar 8 02:33:09 proxmox-server kernel: [1923160.542757] audit: type=1400 audit(1457400789.071:129768): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=29153 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:33:09 proxmox-server kernel: [1923160.542962] audit: type=1400 audit(1457400789.071:129769): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=29153 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:34:23 proxmox-server kernel: [1923235.217380] audit: type=1400 audit(1457400863.751:129770): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=31188 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:36:11 proxmox-server kernel: [1923342.872770] audit: type=1400 audit(1457400971.415:129771): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=9471 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:36:13 proxmox-server kernel: [1923344.924216] audit: type=1400 audit(1457400973.463:129772): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=11037 comm="sw-collectd" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:36:13 proxmox-server kernel: [1923344.924251] audit: type=1400 audit(1457400973.463:129773): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=11037 comm="sw-collectd" requested_mask="read" denied_mask="read" peer="unconfined" Mar 8 02:36:28 proxmox-server kernel: [1923360.048495] audit: type=1400 audit(1457400988.591:129777): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=15979 comm="sw-collectd" requested_mask="read" denied_mask="read" peer="unconfined" Mar 8 02:36:30 proxmox-server kernel: [1923361.994794] audit: type=1400 audit(1457400990.536:129778): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=16377 comm="sw-collectd" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:36:49 proxmox-server kernel: [1923380.457157] audit: type=1400 audit(1457401009.000:129784): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=2842 comm="ps" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:36:49 proxmox-server kernel: [1923380.821127] audit: type=1400 audit(1457401009.364:129790): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=2872 comm="ps" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:38:11 proxmox-server kernel: [1923462.649203] audit_printk_skb: 54 callbacks suppressed Mar 8 02:38:11 proxmox-server kernel: [1923462.649206] audit: type=1400 audit(1457401091.200:129810): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=29153 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:38:11 proxmox-server kernel: [1923462.649433] audit: type=1400 audit(1457401091.200:129811): apparmor="DENIED" operation="ptrace" profile="lxc-container-default" pid=29153 comm="monit" requested_mask="trace" denied_mask="trace" peer="unconfined" Mar 8 02:41:25 proxmox-server rsyslogd: [origin software="rsyslogd" swVersion="8.4.2" x-pid="920" x-info="http://www.rsyslog.com"] start Mar 8 02:41:25 proxmox-server systemd-modules-load[303]: Module 'fuse' is builtin Mar 8 02:41:25 proxmox-server systemd-modules-load[303]: Module 'loop' is builtin Mar 8 02:41:25 proxmox-server systemd-modules-load[303]: Inserted module 'vhost_net' Mar 8 02:41:25 proxmox-server hdparm[324]: Setting parameters of disc: (none). Mar 8 02:41:25 proxmox-server keyboard-setup[325]: Setting preliminary keymap...done. Mar 8 02:41:25 proxmox-server systemd-fsck[482]: /dev/sda3: Journal wird wiederhergestellt Mar 8 02:41:25 proxmox-server systemd-fsck[482]: /dev/sda3: sauber, 331/62592 Dateien, 64258/250112 Blöcke
这是我的proxmox设置:
proxmox-ve: 4.1-39 (running kernel: 4.2.8-1-pve) pve-manager: 4.1-15 (running version: 4.1-15/8cd55b52) pve-kernel-4.2.6-1-pve: 4.2.6-36 pve-kernel-2.6.32-43-pve: 2.6.32-166 pve-kernel-4.2.8-1-pve: 4.2.8-39 pve-kernel-4.2.2-1-pve: 4.2.2-16 pve-kernel-2.6.32-26-pve: 2.6.32-114 pve-kernel-4.2.3-2-pve: 4.2.3-22 lvm2: 2.02.116-pve2 corosync-pve: 2.3.5-2 libqb0: 1.0-1 pve-cluster: 4.0-33 qemu-server: 4.0-62 pve-firmware: 1.1-7 libpve-common-perl: 4.0-49 libpve-access-control: 4.0-11 libpve-storage-perl: 4.0-42 pve-libspice-server1: 0.12.5-2 vncterm: 1.2-1 pve-qemu-kvm: 2.5-8 pve-container: 1.0-46 pve-firewall: 2.0-18 pve-ha-manager: 1.0-23 ksm-control-daemon: 1.2-1 glusterfs-client: 3.5.2-2+deb8u1 lxc-pve: 1.1.5-7 lxcfs: 2.0.0-pve1 cgmanager: 0.39-pve1 criu: 1.6.0-1
任何想法呢?
通常当零星重新启动时,它不是操作系统,这是一个物理问题。
我会从电源开始。 这是一个好的,还是便宜? 服务器上是否有冗余电源,还是低端服务器或台式机? 功率是否超载?
有可能是有一堆磁盘和这些磁盘都旋转和寻找做高功耗备份时。 尽pipe整个PSU都很好,但是这会导致电源的一个特定轨道(又称电压总线)变得过载。
例如,如果+ 5V的最大功率是20安培,并且每个2.1安培有10个磁盘,那么您将超载电源,并closures或重新启动。