zhy1899 发表于 2017-1-5 08:51:23

第二个节点老是dead状态

安装步骤如下:
两个节点
web1172.16.100.8
web2172.16.100.9
VIP:172.16.100.100


web1的配置如下
# cat /etc/hosts
hosts      hosts.allowhosts.deny   
# cat /etc/hosts
hosts      hosts.allowhosts.deny   
# cat /etc/hosts
127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
172.16.100.8 web1.sdzy.cn web1
172.16.100.9 web2.sdzy.cn web2
# cat /etc/sysconfig/network
NETWORKING=yes
HOSTNAME=web1.sdzy.cn
# uname -n
web1.sdzy.cn
# ifconfig
eth1      Link encap:EthernetHWaddr 00:50:56:2A:84:EB
          inet addr:172.16.100.8Bcast:172.16.100.255Mask:255.255.255.0
          inet6 addr: fe80::250:56ff:fe2a:84eb/64 Scope:Link
          UP BROADCAST RUNNING MULTICASTMTU:1500Metric:1
          RX packets:1884 errors:0 dropped:0 overruns:0 frame:0
          TX packets:1534 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:341923 (333.9 KiB)TX bytes:292151 (285.3 KiB)


lo      Link encap:Local Loopback
          inet addr:127.0.0.1Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNINGMTU:16436Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:0 (0.0 b)TX bytes:0 (0.0 b)


web2的配置如下
# cat /etc/hosts
127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
172.16.100.8 web1.sdzy.cn web1
172.16.100.9 web2.sdzy.cn web2
# cat /etc/sysconfig/network
NETWORKING=yes
HOSTNAME=web2.sdzy.cn
# ifconfig
eth1      Link encap:EthernetHWaddr 00:50:56:34:E7:9B
          inet addr:172.16.100.9Bcast:172.16.255.255Mask:255.255.0.0
          inet6 addr: fe80::250:56ff:fe34:e79b/64 Scope:Link
          UP BROADCAST RUNNING MULTICASTMTU:1500Metric:1
          RX packets:2869 errors:0 dropped:0 overruns:0 frame:0
          TX packets:2469 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:486685 (475.2 KiB)TX bytes:472672 (461.5 KiB)

lo      Link encap:Local Loopback
          inet addr:127.0.0.1Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNINGMTU:16436Metric:1
          RX packets:22 errors:0 dropped:0 overruns:0 frame:0
          TX packets:22 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:1166 (1.1 KiB)TX bytes:1166 (1.1 KiB)

所有节点安装的heartbeat的软件列表如下
-rw-r--r--1 root root   165292 Dec32013 heartbeat-3.0.4-2.el6.x86_64.rpm
-rw-r--r--1 root root   269468 Dec32013 heartbeat-libs-3.0.4-2.el6.x86_64.rpm
-rw-r--r--1 root root   114888 Sep22014 heat-cfntools-1.2.6-4.el6.noarch.rpm
web1的ha的配置如下
# cat ha.cf | grep -v "^#.*"
logfacility   local0
keepalive 1
bcasteth1
auto_failback on
node    web1.sdzy.cn
node    web2.sdzy.cn
ping 172.16.100.2




# cat haresources | grep -v "^#.*"
web1.sdzy.cn IPaddr2::172.16.100.4/24/eth1 httpd
# cat authkeys | grep -v "^#.*"
auth 1
1 md5 2a8559a2d3c8404fed986b16fbd49cb4
#
web2 的ha配置

# scp ha.cf authkeys haresourcesweb2:/etc/ha.d/
ha.cf                                                                                             100%   10KB10.3KB/s   00:00   
authkeys                                                                                          100%691   0.7KB/s   00:00   
haresources

现在的问题为:
1。web2 经常dead

heartbeat的日志


Jan5 08:49:13 web1 heartbeat: : info: Pacemaker support: false
Jan5 08:49:13 web1 heartbeat: : WARN: Logging daemon is disabled --enabling logging daemon is recommended
Jan5 08:49:13 web1 heartbeat: : info: **************************
Jan5 08:49:13 web1 heartbeat: : info: Configuration validated. Starting heartbeat 3.0.4
Jan5 08:49:13 web1 heartbeat: : info: heartbeat: version 3.0.4
Jan5 08:49:13 web1 heartbeat: : info: Heartbeat generation: 1483526594
Jan5 08:49:13 web1 heartbeat: : info: glib: UDP Broadcast heartbeat started on port 694 (694) interface eth1
Jan5 08:49:13 web1 heartbeat: : info: glib: UDP Broadcast heartbeat closed on port 694 interface eth1 - Status: 1
Jan5 08:49:13 web1 heartbeat: : info: glib: ping heartbeat started.
Jan5 08:49:13 web1 heartbeat: : info: G_main_add_TriggerHandler: Added signal manual handler
Jan5 08:49:13 web1 heartbeat: : info: G_main_add_TriggerHandler: Added signal manual handler
Jan5 08:49:13 web1 heartbeat: : info: G_main_add_SignalHandler: Added signal handler for signal 17
Jan5 08:49:13 web1 heartbeat: : info: Local status now set to: 'up'
Jan5 08:49:13 web1 heartbeat: : info: Link 172.16.100.2:172.16.100.2 up.
Jan5 08:49:13 web1 heartbeat: : info: Status update for node 172.16.100.2: status ping
Jan5 08:49:13 web1 heartbeat: : info: Link web1.sdzy.cn:eth1 up.
Jan5 08:49:44 web1 heartbeat: : WARN: node web2.sdzy.cn: is dead
Jan5 08:49:44 web1 heartbeat: : info: Comm_now_up(): updating status to active
Jan5 08:49:44 web1 heartbeat: : info: Local status now set to: 'active'
Jan5 08:49:44 web1 heartbeat: : WARN: No STONITH device configured.
Jan5 08:49:44 web1 heartbeat: : WARN: Shared disks are not protected.
Jan5 08:49:44 web1 heartbeat: : info: Resources being acquired from web2.sdzy.cn.
Jan5 08:49:44 web1 harc(default): info: Running /etc/ha.d//rc.d/status status
Jan5 08:49:44 web1 mach_down(default): info: /usr/share/heartbeat/mach_down: nice_failback: foreign resources acquired
Jan5 08:49:44 web1 mach_down(default): info: mach_down takeover complete for node web2.sdzy.cn.
Jan5 08:49:44 web1 heartbeat: : info: mach_down takeover complete.
Jan5 08:49:44 web1 heartbeat: : info: Initial resource acquisition complete (mach_down)
Jan5 08:49:45 web1 /usr/lib/ocf/resource.d//heartbeat/IPaddr2(IPaddr2_172.16.100.4): INFO:Resource is stopped
Jan5 08:49:45 web1 heartbeat: : info: Local Resource acquisition completed.
Jan5 08:49:45 web1 harc(default): info: Running /etc/ha.d//rc.d/ip-request-resp ip-request-resp
Jan5 08:49:45 web1 ip-request-resp(default): received ip-request-resp IPaddr2::172.16.100.4/24/eth1 OK yes
Jan5 08:49:45 web1 ResourceManager(default): info: Acquiring resource group: web1.sdzy.cn IPaddr2::172.16.100.4/24/eth1 httpd
Jan5 08:49:45 web1 /usr/lib/ocf/resource.d//heartbeat/IPaddr2(IPaddr2_172.16.100.4): INFO:Resource is stopped
Jan5 08:49:45 web1 ResourceManager(default): info: Running /etc/ha.d/resource.d/IPaddr2 172.16.100.4/24/eth1 start
Jan5 08:49:45 web1 IPaddr2(IPaddr2_172.16.100.4): INFO: Adding inet address 172.16.100.4/24 with broadcast address 172.16.100.255 to device eth1
Jan5 08:49:45 web1 IPaddr2(IPaddr2_172.16.100.4): INFO: Bringing device eth1 up
Jan5 08:49:45 web1 IPaddr2(IPaddr2_172.16.100.4): INFO: /usr/libexec/heartbeat/send_arp -i 200 -r 5 -p /var/run/resource-agents/send_arp-172.16.100.4 eth1 172.16.100.4 auto not_used not_used
Jan5 08:49:45 web1 /usr/lib/ocf/resource.d//heartbeat/IPaddr2(IPaddr2_172.16.100.4): INFO:Success
Jan5 08:49:45 web1 ResourceManager(default): info: Running /etc/init.d/httpdstart



我在web1的ifconfig和ipaddr的信息

# ifconfig
eth1      Link encap:EthernetHWaddr 00:50:56:2A:84:EB
          inet addr:172.16.100.8Bcast:172.16.100.255Mask:255.255.255.0
          inet6 addr: fe80::250:56ff:fe2a:84eb/64 Scope:Link
          UP BROADCAST RUNNING MULTICASTMTU:1500Metric:1
          RX packets:4712 errors:0 dropped:0 overruns:0 frame:0
          TX packets:4110 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:836834 (817.2 KiB)TX bytes:902984 (881.8 KiB)

lo      Link encap:Local Loopback
          inet addr:127.0.0.1Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNINGMTU:16436Metric:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:0 (0.0 b)TX bytes:0 (0.0 b)

# ip addr show
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
    inet6 ::1/128 scope host
       valid_lft forever preferred_lft forever
2: eth0: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN qlen 1000
    link/ether 00:50:56:3d:8c:d0 brd ff:ff:ff:ff:ff:ff
3: eth1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
    link/ether 00:50:56:2a:84:eb brd ff:ff:ff:ff:ff:ff
    inet 172.16.100.8/24 brd 172.16.100.255 scope global eth1
    inet 172.16.100.4/24 brd 172.16.100.255 scope global secondary eth1
    inet6 fe80::250:56ff:fe2a:84eb/64 scope link
       valid_lft forever preferred_lft forever
#



web2的


# ifconfig
eth1      Link encap:EthernetHWaddr 00:50:56:34:E7:9B
          inet addr:172.16.100.9Bcast:172.16.255.255Mask:255.255.0.0
          inet6 addr: fe80::250:56ff:fe34:e79b/64 Scope:Link
          UP BROADCAST RUNNING MULTICASTMTU:1500Metric:1
          RX packets:4955 errors:0 dropped:0 overruns:0 frame:0
          TX packets:4570 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000
          RX bytes:899479 (878.3 KiB)TX bytes:885811 (865.0 KiB)

lo      Link encap:Local Loopback
          inet addr:127.0.0.1Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNINGMTU:16436Metric:1
          RX packets:22 errors:0 dropped:0 overruns:0 frame:0
          TX packets:22 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0
          RX bytes:1166 (1.1 KiB)TX bytes:1166 (1.1 KiB)

# ip addr show
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
    inet6 ::1/128 scope host
       valid_lft forever preferred_lft forever
2: eth0: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN qlen 1000
    link/ether 00:50:56:29:d6:a0 brd ff:ff:ff:ff:ff:ff
3: eth1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
    link/ether 00:50:56:34:e7:9b brd ff:ff:ff:ff:ff:ff
    inet 172.16.100.9/16 brd 172.16.255.255 scope global eth1
    inet 172.16.100.4/24 brd 172.16.255.255 scope global eth1
    inet6 fe80::250:56ff:fe34:e79b/64 scope link
       valid_lft forever preferred_lft forever
#

请问,为什么web2老是dead








页: [1]
查看完整版本: 第二个节点老是dead状态