OVH Cloud OVH Cloud

heartbeat et drbd

1 réponse
Avatar
Franck
Salut la liste,

je cherche à mettre en place une redondance HA sous sarge
master = mail1 (10.1.4.2)
slave = nfs2 (10.1.4.4)

Les fichiers heartbeat sont identiques sur les 2 machines, drbd marche
bien et ce n'est pas la première fois que j'utilise HA.
Le but est de remonter /home/vpopmail qui est en drbd mais je me retrouve
avec plein d'erreurs dans les logs.

Si quelqu'un a eu ce même soucis, je suis preneur de la solution

# haresources
# ===========
mail1 10.1.4.1 drbddisk::r0
mail1 10.1.4.1 Filesystem::/dev/drbd0::/home/vpopmail::reiserfs

# LES ERREURS
# ===========
/etc/init.d/heartbeat start
heartbeat: 2005/09/26_17:05:52 info: **************************
heartbeat: 2005/09/26_17:05:52 info: Configuration validated. Starting
heartbeat 1.2.3
heartbeat: 2005/09/26_17:05:52 info: heartbeat: version 1.2.3
heartbeat: 2005/09/26_17:05:52 info: Heartbeat generation: 42
heartbeat: 2005/09/26_17:05:52 info: UDP Broadcast heartbeat started on
port 694 (694) interface eth1
heartbeat: 2005/09/26_17:05:52 info: pid 8758 locked in memory.
heartbeat: 2005/09/26_17:05:52 info: Local status now set to: 'up'
heartbeat: 2005/09/26_17:05:53 info: pid 8761 locked in memory.
heartbeat: 2005/09/26_17:05:53 info: pid 8763 locked in memory.
heartbeat: 2005/09/26_17:05:53 info: pid 8762 locked in memory.
heartbeat: 2005/09/26_17:05:53 info: Link mail1:eth1 up.
heartbeat: 2005/09/26_17:06:22 WARN: node nfs2: is dead
heartbeat: 2005/09/26_17:06:22 info: Local status now set to: 'active'
heartbeat: 2005/09/26_17:06:22 WARN: No STONITH device configured.
heartbeat: 2005/09/26_17:06:22 WARN: Shared disks are not protected.
heartbeat: 2005/09/26_17:06:22 info: Resources being acquired from nfs2.
heartbeat: 2005/09/26_17:06:22 info: Running /etc/ha.d/rc.d/status status
heartbeat: 2005/09/26_17:06:22 info: /usr/lib/heartbeat/mach_down:
nice_failback: foreign resources acquired
heartbeat: 2005/09/26_17:06:22 info: Initial resource acquisition complete
(T_RESOURCES(us))
heartbeat: 2005/09/26_17:06:22 info: mach_down takeover complete.
heartbeat: 2005/09/26_17:06:22 info: mach_down takeover complete for node
nfs2.
heartbeat: 2005/09/26_17:06:22 info: Local Resource acquisition completed.
heartbeat: 2005/09/26_17:06:22 info: Running
/etc/ha.d/rc.d/ip-request-resp ip-request-resp
heartbeat: 2005/09/26_17:06:22 received ip-request-resp 10.1.4.1 OK yes
heartbeat: 2005/09/26_17:06:23 info: Acquiring resource group: mail1
10.1.4.1 drbddisk::r0 mail1 10.1.4.1
Filesystem::/dev/drbd0::/home/vpopmail::reiserfs
heartbeat: 2005/09/26_17:06:23 info: Running /etc/ha.d/resource.d/IPaddr
10.1.4.1 start
heartbeat: 2005/09/26_17:06:23 info: /sbin/ifconfig eth1:1 10.1.4.1
netmask 255.0.0.0 broadcast 10.255.255.255
heartbeat: 2005/09/26_17:06:23 info: Sending Gratuitous Arp for 10.1.4.1
on eth1:1 [eth1]
heartbeat: 2005/09/26_17:06:23 /usr/lib/heartbeat/send_arp -i 1010 -r 5 -p
/var/lib/heartbeat/rsctmp/send_arp/send_arp-10.1.4.1 eth1 10.1.4.1 auto
10.1.4.1
heartbeat: 2005/09/26_17:06:23 info: Running /etc/ha.d/resource.d/drbddisk
r0 start
heartbeat: 2005/09/26_17:06:23 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:06:23 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:06:23 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:06:23 info: Running
/etc/ha.d/resource.d/Filesystem /dev/drbd0 /home/vpopmail reiserfs start
heartbeat: 2005/09/26_17:06:26 info: Running
/etc/ha.d/rc.d/ip-request-resp ip-request-resp
heartbeat: 2005/09/26_17:06:26 received ip-request-resp 10.1.4.1 OK yes
heartbeat: 2005/09/26_17:06:26 info: Acquiring resource group: mail1
10.1.4.1 drbddisk::r0 mail1 10.1.4.1
Filesystem::/dev/drbd0::/home/vpopmail::reiserfs
heartbeat: 2005/09/26_17:06:26 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:06:26 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:06:26 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:06:33 info: Local Resource acquisition completed.
(none)
heartbeat: 2005/09/26_17:06:33 info: local resource transition completed.
###########

y'a des erreurs mais c'est monté


# LES ERREURS
# ===========
/etc/init.d/heartbeat stop
heartbeat: 2005/09/26_17:08:54 info: Heartbeat shutdown in progress. (8758)
heartbeat: 2005/09/26_17:08:54 info: Giving up all HA resources.
heartbeat: 2005/09/26_17:08:54 info: Releasing resource group: mail1
10.1.4.1 drbddisk::r0 mail1 10.1.4.1
Filesystem::/dev/drbd0::/home/vpopmail::reiserfs
heartbeat: 2005/09/26_17:08:54 info: Running
/etc/ha.d/resource.d/Filesystem /dev/drbd0 /home/vpopmail reiserfs stop
heartbeat: 2005/09/26_17:08:54 info: Running /etc/ha.d/resource.d/IPaddr
10.1.4.1 stop
heartbeat: 2005/09/26_17:08:54 info: /sbin/route -n del -host 10.1.4.1
heartbeat: 2005/09/26_17:08:54 info: /sbin/ifconfig eth1:1 down
heartbeat: 2005/09/26_17:08:54 info: IP Address 10.1.4.1 released
heartbeat: 2005/09/26_17:08:54 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:54 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:55 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:08:55 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:55 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:56 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:08:56 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:56 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:57 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:08:57 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:57 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:58 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:08:58 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:58 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:59 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:08:59 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:08:59 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:00 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:00 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:00 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:01 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:01 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:01 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:02 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:02 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:02 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:03 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:03 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:03 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:04 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:04 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:04 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:04 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:04 ERROR: Resource script for mail1 probably
not LSB-compliant.
heartbeat: 2005/09/26_17:09:05 WARN: it (mail1) MUST succeed on a stop
when already stopped
heartbeat: 2005/09/26_17:09:05 WARN: Machine reboot narrowly avoided!
heartbeat: 2005/09/26_17:09:05 info: Running /etc/ha.d/resource.d/drbddisk
r0 stop
heartbeat: 2005/09/26_17:09:05 info: Running /etc/ha.d/resource.d/IPaddr
10.1.4.1 stop
heartbeat: 2005/09/26_17:09:05 info: Releasing resource group: mail1
10.1.4.1 drbddisk::r0 mail1 10.1.4.1
Filesystem::/dev/drbd0::/home/vpopmail::reiserfs
heartbeat: 2005/09/26_17:09:05 info: Running
/etc/ha.d/resource.d/Filesystem /dev/drbd0 /home/vpopmail reiserfs stop
heartbeat: 2005/09/26_17:09:06 WARNING: Filesystem /home/vpopmail not
mounted?
heartbeat: 2005/09/26_17:09:06 info: Running /etc/ha.d/resource.d/IPaddr
10.1.4.1 stop
heartbeat: 2005/09/26_17:09:06 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:06 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:07 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:07 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:07 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:08 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:08 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:08 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:09 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:09 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:09 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:10 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:10 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:10 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:11 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:11 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:11 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:12 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:12 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:12 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:13 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:13 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:13 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:14 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:14 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:14 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:15 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:15 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:15 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:16 info: Retrying failed stop operation [mail1]
heartbeat: 2005/09/26_17:09:16 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:16 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:16 ERROR: Cannot locate resource script mail1
heartbeat: 2005/09/26_17:09:16 ERROR: Resource script for mail1 probably
not LSB-compliant.
heartbeat: 2005/09/26_17:09:16 WARN: it (mail1) MUST succeed on a stop
when already stopped
heartbeat: 2005/09/26_17:09:16 WARN: Machine reboot narrowly avoided!
heartbeat: 2005/09/26_17:09:16 info: Running /etc/ha.d/resource.d/drbddisk
r0 stop
heartbeat: 2005/09/26_17:09:16 info: Running /etc/ha.d/resource.d/IPaddr
10.1.4.1 stop
heartbeat: 2005/09/26_17:09:16 info: All HA resources relinquished.
heartbeat: 2005/09/26_17:09:17 info: killing HBFIFO process 8761 with
signal 15
heartbeat: 2005/09/26_17:09:17 info: killing HBWRITE process 8762 with
signal 15
heartbeat: 2005/09/26_17:09:17 info: killing HBREAD process 8763 with
signal 15
heartbeat: 2005/09/26_17:09:17 info: Core process 8761 exited. 3 remaining
heartbeat: 2005/09/26_17:09:17 info: Core process 8762 exited. 2 remaining
heartbeat: 2005/09/26_17:09:17 info: Core process 8763 exited. 1 remaining
heartbeat: 2005/09/26_17:09:17 info: Heartbeat shutdown complete.
################

Excusez de la longueur mais comme ça, vous avez tous les éléments

merci de toutes vos pistes

Franck
--
http://www.linuxpourtous.com


--
Pensez à lire la FAQ de la liste avant de poser une question :
http://wiki.debian.net/?DebianFrench

Pensez à rajouter le mot ``spam'' dans vos champs "From" et "Reply-To:"

To UNSUBSCRIBE, email to debian-user-french-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org

1 réponse

Avatar
pingouin osmolateur
--- Franck a écrit :

Salut la liste,



Salut


je cherche à mettre en place une redondance HA sous
sarge
master = mail1 (10.1.4.2)
slave = nfs2 (10.1.4.4)



Qu'est ce que tu veux dire par la la machine d'adresse
IP 10.1.4.2 est le maitre et qu'il founit le service
mail1 ?
Et nfs2 est un service offert par slave ?

Je comprends pas trop ce que tu veux dire ?

De mon coté je l'utilise pour mysql et ldap

Les fichiers heartbeat sont identiques sur les 2
machines, drbd marche
bien et ce n'est pas la première fois que j'utilise
HA.



Peux tu nous fournir le fichier de conf de heartbeat
et de drbd, vérifie que les 2 fichiers sont identique
avec un diff

Le but est de remonter /home/vpopmail qui est en
drbd mais je me retrouve
avec plein d'erreurs dans les logs.

Si quelqu'un a eu ce même soucis, je suis preneur de
la solution

# haresources
# ========== > mail1 10.1.4.1 drbddisk::r0
mail1 10.1.4.1
Filesystem::/dev/drbd0::/home/vpopmail::reiserfs



Ok en voila un bout. Ok je commence à comprendre :
mail1 s'est le nom de ta machine qui va etre master et
qui va utiliser la partition /home/vpopmail sur une
partition drbd0. L'adresse IP aliasé sera 10.1.4.1

[..]

heartbeat: 2005/09/26_17:06:22 WARN: node nfs2: is
dead



Il semble que le slave (de nom nfs2 ne soit pas
joignable.
Utilises-tu un cable serie ou un cable ethernet ou
toute autre chose qui puisse faire le dialogue entre
les deux noeuds (Si c'est un fourchette ça marchera
pas :-))

heartbeat: 2005/09/26_17:06:22 info: Local status
now set to: 'active'
heartbeat: 2005/09/26_17:06:22 WARN: No STONITH
device configured.
heartbeat: 2005/09/26_17:06:22 WARN: Shared disks
are not protected.
heartbeat: 2005/09/26_17:06:22 info: Resources being
acquired from nfs2.
heartbeat: 2005/09/26_17:06:22 info: Running
/etc/ha.d/rc.d/status status
heartbeat: 2005/09/26_17:06:22 info:
/usr/lib/heartbeat/mach_down:
nice_failback: foreign resources acquired
heartbeat: 2005/09/26_17:06:22 info: Initial
resource acquisition complete
(T_RESOURCES(us))
heartbeat: 2005/09/26_17:06:22 info: mach_down
takeover complete.
heartbeat: 2005/09/26_17:06:22 info: mach_down
takeover complete for node
nfs2.
heartbeat: 2005/09/26_17:06:22 info: Local Resource
acquisition completed.
heartbeat: 2005/09/26_17:06:22 info: Running
/etc/ha.d/rc.d/ip-request-resp ip-request-resp
heartbeat: 2005/09/26_17:06:22 received
ip-request-resp 10.1.4.1 OK yes
heartbeat: 2005/09/26_17:06:23 info: Acquiring
resource group: mail1
10.1.4.1 drbddisk::r0 mail1 10.1.4.1
Filesystem::/dev/drbd0::/home/vpopmail::reiserfs



Attention on dirait que ton fichier de conf à un pb
car heartbeat essaie de lancer le script "10.1.4.1
drbddisk::r0 mail1 10.1.4.1" comme si il n'avait pas
vu qu'il y avait un fin de ligne

heartbeat: 2005/09/26_17:06:23 info: Running
/etc/ha.d/resource.d/IPaddr
10.1.4.1 start
heartbeat: 2005/09/26_17:06:23 info: /sbin/ifconfig
eth1:1 10.1.4.1
netmask 255.0.0.0 broadcast 10.255.255.255
heartbeat: 2005/09/26_17:06:23 info: Sending
Gratuitous Arp for 10.1.4.1
on eth1:1 [eth1]
heartbeat: 2005/09/26_17:06:23
/usr/lib/heartbeat/send_arp -i 1010 -r 5 -p
/var/lib/heartbeat/rsctmp/send_arp/send_arp-10.1.4.1
eth1 10.1.4.1 auto
10.1.4.1
heartbeat: 2005/09/26_17:06:23 info: Running
/etc/ha.d/resource.d/drbddisk
r0 start



la il lance r0 ce qui est normal


heartbeat: 2005/09/26_17:06:23 ERROR: Cannot locate
resource script mail1



Et la il essaye de lancer le mail1

heartbeat: 2005/09/26_17:06:23 ERROR: Cannot locate
resource script mail1
heartbeat: 2005/09/26_17:06:23 ERROR: Cannot locate
resource script mail1
heartbeat: 2005/09/26_17:06:23 info: Running
/etc/ha.d/resource.d/Filesystem /dev/drbd0
/home/vpopmail reiserfs start
heartbeat: 2005/09/26_17:06:26 info: Running
/etc/ha.d/rc.d/ip-request-resp ip-request-resp
heartbeat: 2005/09/26_17:06:26 received
ip-request-resp 10.1.4.1 OK yes
heartbeat: 2005/09/26_17:06:26 info: Acquiring
resource group: mail1
10.1.4.1 drbddisk::r0 mail1 10.1.4.1
Filesystem::/dev/drbd0::/home/vpopmail::reiserfs
heartbeat: 2005/09/26_17:06:26 ERROR: Cannot locate
resource script mail1
heartbeat: 2005/09/26_17:06:26 ERROR: Cannot locate
resource script mail1
heartbeat: 2005/09/26_17:06:26 ERROR: Cannot locate
resource script mail1
heartbeat: 2005/09/26_17:06:33 info: Local Resource
acquisition completed.
(none)
heartbeat: 2005/09/26_17:06:33 info: local resource
transition completed.
###########

y'a des erreurs mais c'est monté



Voila ce que je pense.

bonne nuit
AC






___________________________________________________________________________
Appel audio GRATUIT partout dans le monde avec le nouveau Yahoo! Messenger
Téléchargez cette version sur http://fr.messenger.yahoo.com


--
Pensez à lire la FAQ de la liste avant de poser une question :
http://wiki.debian.net/?DebianFrench

Pensez à rajouter le mot ``spam'' dans vos champs "From" et "Reply-To:"

To UNSUBSCRIBE, email to
with a subject of "unsubscribe". Trouble? Contact