Проблема с heartbeat

Есть и такой ОС.

Модератор: weec

Правила форума
Убедительная просьба юзать теги [cоde] при оформлении листингов.
Сообщения не оформленные должным образом имеют все шансы быть незамеченными.
100matolog
ст. сержант
Сообщения: 309
Зарегистрирован: 2008-05-30 12:11:16
Откуда: kiev
Контактная информация:

Проблема с heartbeat

Непрочитанное сообщение 100matolog » 2010-09-06 17:57:01

Есть два сервера. Два реальных айпи.

Код: Выделить всё

cat  /etc/ha.d/ha.conf
logfile	/var/log/ha-log
logfacility	local0
keepalive 1
deadtime 8
warntime 4
initdead 120
udpport	694
ucast eth1 172.16.0.7
auto_failback on
node	srv-4-1.server 
node	srv-4-0.server
В логе на srv-4-0.server постоянно

Код: Выделить всё

ail -f /var/log/ha-log 
heartbeat[3943]: 2010/09/06_17:38:05 WARN: nodename srv-4-0.server uuid changed to srv-4-1.server
heartbeat[3943]: 2010/09/06_17:38:05 ERROR: should_drop_message: attempted replay attack [srv-4-1.server]? [gen = 1271845467, curgen = 1271845471]
heartbeat[3943]: 2010/09/06_17:38:06 WARN: nodename srv-4-1.server uuid changed to srv-4-0.server
heartbeat[3943]: 2010/09/06_17:38:06 WARN: nodename srv-4-0.server uuid changed to srv-4-1.server
heartbeat[3943]: 2010/09/06_17:38:06 ERROR: HBDoMsg_T_ACK: corrupted ackseq current hiseq = 32 ackseq =672 in this message
heartbeat[3943]: 2010/09/06_17:38:06 ERROR: should_drop_message: attempted replay attack [srv-4-1.server]? [gen = 1271845467, curgen = 1271845471]
heartbeat[3943]: 2010/09/06_17:38:07 WARN: nodename srv-4-1.server uuid changed to srv-4-0.server
heartbeat[3943]: 2010/09/06_17:38:07 WARN: nodename srv-4-0.server uuid changed to srv-4-1.server
heartbeat[3943]: 2010/09/06_17:38:07 ERROR: should_drop_message: attempted replay attack [srv-4-1.server]? [gen = 1271845467, curgen = 1271845471]
heartbeat[3943]: 2010/09/06_17:38:08 WARN: nodename srv-4-1.server uuid changed to srv-4-0.server
heartbeat[3943]: 2010/09/06_17:38:08 WARN: nodename srv-4-0.server uuid changed to srv-4-1.server
heartbeat[3943]: 2010/09/06_17:38:08 ERROR: should_drop_message: attempted replay attack [srv-4-1.server]? [gen = 1271845467, curgen = 1271845471]
heartbeat[3943]: 2010/09/06_17:38:09 WARN: nodename srv-4-1.server uuid changed to srv-4-0.server
heartbeat[3943]: 2010/09/06_17:38:09 WARN: nodename srv-4-0.server uuid changed to srv-4-1.server
heartbeat[3943]: 2010/09/06_17:38:09 ERROR: should_drop_message: attempted replay attack [srv-4-1.server]? [gen = 1271845467, curgen = 1271845471]
heartbeat[3943]: 2010/09/06_17:38:10 WARN: nodename srv-4-1.server uuid changed to srv-4-0.server
heartbeat[3943]: 2010/09/06_17:38:10 WARN: nodename srv-4-0.server uuid changed to srv-4-1.server
heartbeat[3943]: 2010/09/06_17:38:10 ERROR: should_drop_message: attempted replay attack [srv-4-1.server]? [gen = 1271845467, curgen = 1271845471]
heartbeat[3943]: 2010/09/06_17:38:11 WARN: nodename srv-4-1.server uuid changed to srv-4-0.server
heartbeat[3943]: 2010/09/06_17:38:11 WARN: nodename srv-4-0.server uuid changed to srv-4-1.server
heartbeat[3943]: 2010/09/06_17:38:11 ERROR: should_drop_message: attempted replay attack [srv-4-1.server]? [gen = 1271845467, curgen = 1271845471]
heartbeat[3943]: 2010/09/06_17:38:12 WARN: nodename srv-4-1.server uuid changed to srv-4-0.server
heartbeat[3943]: 2010/09/06_17:38:12 WARN: nodename srv-4-0.server uuid changed to srv-4-1.server
heartbeat[3943]: 2010/09/06_17:38:12 ERROR: should_drop_message: attempted replay attack [srv-4-1.server]? [gen = 1271845467, curgen = 1271845471]
heartbeat[3943]: 2010/09/06_17:38:13 WARN: nodename srv-4-1.server uuid changed to srv-4-0.server
heartbeat[3943]: 2010/09/06_17:38:13 WARN: nodename srv-4-0.server uuid changed to srv-4-1.server
heartbeat[3943]: 2010/09/06_17:38:13 ERROR: should_drop_message: attempted replay attack [srv-4-1.server]? [gen = 1271845467, curgen = 1271845471]
heartbeat[3943]: 2010/09/06_17:38:14 WARN: nodename srv-4-1.server uuid changed to srv-4-0.serve
И Когда падает один сервер - то второй айпи на втором сервере не поднимается. Живет только на одном

Хостинговая компания Host-Food.ru
Хостинг HostFood.ru
 

Услуги хостинговой компании Host-Food.ru

Хостинг HostFood.ru

Тарифы на хостинг в России, от 12 рублей: https://www.host-food.ru/tariffs/hosting/
Тарифы на виртуальные сервера (VPS/VDS/KVM) в РФ, от 189 руб.: https://www.host-food.ru/tariffs/virtualny-server-vps/
Выделенные сервера, Россия, Москва, от 2000 рублей (HP Proliant G5, Intel Xeon E5430 (2.66GHz, Quad-Core, 12Mb), 8Gb RAM, 2x300Gb SAS HDD, P400i, 512Mb, BBU):
https://www.host-food.ru/tariffs/vydelennyi-server-ds/
Недорогие домены в популярных зонах: https://www.host-food.ru/domains/

daltinn
проходил мимо
Сообщения: 2
Зарегистрирован: 2010-09-06 23:23:54

Re: Проблема с heartbeat

Непрочитанное сообщение daltinn » 2010-09-06 23:30:51

ha.conf идентичны на серверах?

100matolog
ст. сержант
Сообщения: 309
Зарегистрирован: 2008-05-30 12:11:16
Откуда: kiev
Контактная информация:

Re: Проблема с heartbeat

Непрочитанное сообщение 100matolog » 2010-09-07 7:36:35

Код: Выделить всё

[root@srv-4-1 ha.d]# cat ha.cf
logfile /var/log/ha-log
logfacility     local0
keepalive 1
deadtime 5
warntime 4
initdead 120
udpport 694
ucast eth1 172.16.0.8
auto_failback on
node    srv-4-1.server
node    srv-4-0.server

Код: Выделить всё

[root@srv-4-1 ~]# /sbin/ifconfig         
eth0      Link encap:Ethernet  HWaddr 00:30:48:66:E0:66  
          inet addr:*.*.*.20  Bcast:*.*.*.31  Mask:255.255.255.224
          inet6 addr: fe80::230:48ff:fe66:e066/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:25353473 errors:0 dropped:0 overruns:0 frame:0
          TX packets:32979955 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:5732660289 (5.3 GiB)  TX bytes:40352739060 (37.5 GiB)
          Memory:d8420000-d8440000 

eth0:0    Link encap:Ethernet  HWaddr 00:30:48:66:E0:66  
          inet addr:*.*.*.13  Bcast:*.*.*.31  Mask:255.255.255.224
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          Memory:d8420000-d8440000 

eth1      Link encap:Ethernet  HWaddr 00:30:48:66:E0:67  
          inet addr:172.16.0.8  Bcast:172.16.0.255  Mask:255.255.255.0
          inet6 addr: fe80::230:48ff:fe66:e067/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:993435765 errors:0 dropped:0 overruns:0 frame:0
          TX packets:880917799 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:813841008191 (757.9 GiB)  TX bytes:310367816399 (289.0 GiB)
          Memory:d8460000-d8480000 

lo        Link encap:Local Loopback  
          inet addr:127.0.0.1  Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:392897 errors:0 dropped:0 overruns:0 frame:0
          TX packets:392897 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0 
          RX bytes:327795296 (312.6 MiB)  TX bytes:327795296 (312.6 MiB)

Код: Выделить всё

[root@srv-4-0 ha.d]# cat ha.cf
logfile /var/log/ha-log
logfacility     local0
keepalive 1
deadtime 5
warntime 4
initdead 120
udpport 694
ucast eth1 172.16.0.8
auto_failback on
node    srv-4-0.server
node    srv-4-1.server

Код: Выделить всё

[root@srv-4-0 ha.d]# /sbin/ifconfig 
eth0      Link encap:Ethernet  HWaddr 00:30:48:62:D0:FC  
          inet addr:*.*.*.16  Bcast:*.*.*.31  Mask:255.255.255.224
          inet6 addr: fe80::230:48ff:fe62:d0fc/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:935972 errors:0 dropped:0 overruns:0 frame:0
          TX packets:898291 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:430470122 (410.5 MiB)  TX bytes:963060236 (918.4 MiB)
          Memory:d8420000-d8440000 

eth0:0    Link encap:Ethernet  HWaddr 00:30:48:62:D0:FC  
          inet addr:*.*.*.14  Bcast:*.*.*.31  Mask:255.255.255.224
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          Memory:d8420000-d8440000 

eth0:1    Link encap:Ethernet  HWaddr 00:30:48:62:D0:FC  
          inet addr:*.*.*.13  Bcast:*.*.*.31  Mask:255.255.255.224
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          Memory:d8420000-d8440000 

eth1      Link encap:Ethernet  HWaddr 00:30:48:62:D0:FD  
          inet addr:172.16.0.7  Bcast:172.16.0.255  Mask:255.255.255.0
          inet6 addr: fe80::230:48ff:fe62:d0fd/64 Scope:Link
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:18995204 errors:0 dropped:0 overruns:0 frame:0
          TX packets:17387412 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:1000 
          RX bytes:15691877151 (14.6 GiB)  TX bytes:6765710791 (6.3 GiB)
          Memory:d8460000-d8480000 

lo        Link encap:Local Loopback  
          inet addr:127.0.0.1  Mask:255.0.0.0
          inet6 addr: ::1/128 Scope:Host
          UP LOOPBACK RUNNING  MTU:16436  Metric:1
          RX packets:1601 errors:0 dropped:0 overruns:0 frame:0
          TX packets:1601 errors:0 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:0 
          RX bytes:494035 (482.4 KiB)  TX bytes:494035 (482.4 KiB)

daltinn
проходил мимо
Сообщения: 2
Зарегистрирован: 2010-09-06 23:23:54

Re: Проблема с heartbeat

Непрочитанное сообщение daltinn » 2010-09-07 9:15:10

на 4-1 в ha.cf

Код: Выделить всё

ucast eth1 172.16.0.7

node    srv-4-1.server   srv-4-0.server
проверьте соответствие srv-4-1.server имени хоста

Код: Выделить всё

man ha.cf:
node nodename1 nodename2 ...
Node names in the directive must match the "uname -n" of that machine.