[Debian-ha-maintainers] corosync-blackbox qb_rb_chunk_read failed

braun braun at dc.uni-due.de
Wed Jul 22 13:47:03 BST 2015


I want to use corosync/pacemaker productive.  Is this a serious error ?


corosync-blackbox ends with error:

[debug] shm size:8392704; real_size:8392704; rb->word_size:2098176
[debug] read total of: 8392724
ERROR: qb_rb_chunk_read failed: Connection timed out
[trace] ENTERING qb_rb_close()
[debug] Free'ing ringbuffer: /dev/shm/qb-create_from_file-header

root at willi:~/errorblackbox# df -kl /dev/shm
Dateisystem    1K-Blöcke Benutzt Verfügbar Verw% Eingehängt auf
tmpfs           16462300   14448  16447852    1% /dev/shm


I followed the wiki on 
https://wiki.debian.org/Debian-HA/ClustersFromScratch and use the 
folling packets:

corosync:
   Installiert:           2.3.4-1
   Installationskandidat: 2.3.4-1
   Versionstabelle:
  *** 2.3.4-1 0
         500 http://ppa.mmogp.com/apt/debian/ jessie/main amd64 Packages
         100 /var/lib/dpkg/status
      1.4.6-1.1 0
         500 http://debian.uni-duisburg-essen.de/debian/ jessie/main 
amd64 Packag
es
         500 http://ftp.de.debian.org/debian/ testing/main amd64 Packages
pacemaker:
   Installiert:           1.1.12-1
   Installationskandidat: 1.1.12-1
   Versionstabelle:
  *** 1.1.12-1 0
         500 http://ppa.mmogp.com/apt/debian/ jessie/main amd64 Packages
         100 /var/lib/dpkg/status
fence-agents:
   Installiert:           4.0.18-1
   Installationskandidat: 4.0.18-1
   Versionstabelle:
  *** 4.0.18-1 0
         500 http://ftp.de.debian.org/debian/ testing/main amd64 Packages
         100 /var/lib/dpkg/status
      4.0.17-2 0
         500 http://ppa.mmogp.com/apt/debian/ jessie/main amd64 Packages
      3.1.5-2 0
         500 http://debian.uni-duisburg-essen.de/debian/ jessie/main 
amd64 Packag
es
libqb0:
   Installiert:           0.17.1-4
   Installationskandidat: 0.17.1-4
   Versionstabelle:
  *** 0.17.1-4 0
         500 http://ftp.de.debian.org/debian/ testing/main amd64 Packages
         100 /var/lib/dpkg/status
      0.17.1-1 0
         500 http://ppa.mmogp.com/apt/debian/ jessie/main amd64 Packages
      0.11.1-2 0
         500 http://debian.uni-duisburg-essen.de/debian/ jessie/main 
amd64 Packag
es
libqb-dev:
   Installiert:           0.17.1-4
   Installationskandidat: 0.17.1-4
   Versionstabelle:
  *** 0.17.1-4 0
         500 http://ftp.de.debian.org/debian/ testing/main amd64 Packages
         100 /var/lib/dpkg/status
      0.17.1-1 0
         500 http://ppa.mmogp.com/apt/debian/ jessie/main amd64 Packages
      0.11.1-2 0
         500 http://debian.uni-duisburg-essen.de/debian/ jessie/main 
amd64 Packag
es

corosync.conf (modiefied from wheezy,where I don't  try corosync_blackbox):

totem {
     version: 2

         ip_version: ipv4

     # How long before declaring a token lost (ms)
     token: 4000

     # How many token retransmits before forming a new configuration
     token_retransmits_before_loss_const: 10

     # How long to wait for join messages in the membership protocol (ms)
     join: 1100

     # How long to wait for consensus to be achieved before starting a 
new round of membership configuration (ms)
     consensus: 3600

         # how long to wait before checking for a partition when  no 
multicast
     merge: 1000

     # Turn off the virtual synchrony filter
     vsftype: none

     # Number of messages that may be sent by one processor on receipt 
of the token
     max_messages: 100

     # Limit generated nodeids to 31-bits (positive signed integers)
     clear_node_high_bit: yes

     crypto_cipher: aes256
     crypto_hash: sha1

###    # Disable encryption
###     secauth: off

     # How many threads to use for encryption/decryption
     ### wir haben 4 cpu-cores C.B.
      threads: 4

     # Optionally assign a fixed node id (integer)
     ### habe ich von der ip genommen, brauchen wir eigentlich nicht in 
ip4 C.B.
#    nodeid: 22

     # This specifies the mode of redundant ring, which may be none, 
active, or passive.
      rrp_mode: none

### bei den interfaces ucast genommen, da nur 2  hosts und einfacher als 
mcast
### geht aber irgendwie nicht
      interface {
         # The following values need to be set based on your environment
         # fuer diskmirror
         ringnumber: 0
         bindnetaddr: 192.168.63.0
         broadcast: yes
         ##mcastaddr: 230.168.63.1
         #mcastport: 5405
         mcastport: 5405
         }
         transport: udpu
}

nodelist {
               node {
         ring0_addr: 192.168.63.22
         nodeid: 22
                 }
               node {
         ring0_addr: 192.168.63.24
         nodeid: 24
                 }
}


quorum {
         provider: corosync_votequorum
         two_node: 1
         expected_votes: 2
}



logging {
         fileline: off
         to_stderr: yes
         to_logfile: yes
         to_syslog: yes
     logfile: /var/log/corosync.log
     syslog_facility: daemon
         debug: off
         timestamp: on
         logger_subsys {
                 subsys: QUORUM
                 debug: off
         }
}

qb    {
     ipc_type: shm
     }

corosync.log:

Jul 22 13:59:32 [1431] willi corosync info    [MAIN  ] Corosync built-in 
features: dbus testagents rdma watchdog aug
eas systemd upstart xmlconf qdevices snmp pie relro bindnow
Jul 22 13:59:32 [1431] willi corosync notice  [TOTEM ] Initializing 
transport (UDP/IP Unicast).
Jul 22 13:59:32 [1431] willi corosync notice  [TOTEM ] Initializing 
transmit/receive security (NSS) crypto: aes256 h
ash: sha1
Jul 22 13:59:32 [1431] willi corosync notice  [TOTEM ] The network 
interface [192.168.63.24] is now up.
Jul 22 13:59:32 [1431] willi corosync notice  [SERV  ] Service engine 
loaded: corosync configuration map access [0]
Jul 22 13:59:32 [1431] willi corosync info    [QB    ] server name: cmap
Jul 22 13:59:32 [1431] willi corosync notice  [SERV  ] Service engine 
loaded: corosync configuration service [1]
Jul 22 13:59:32 [1431] willi corosync info    [QB    ] server name: cfg
Jul 22 13:59:32 [1431] willi corosync notice  [SERV  ] Service engine 
loaded: corosync cluster closed process group
service v1.01 [2]
Jul 22 13:59:32 [1431] willi corosync info    [QB    ] server name: cpg
Jul 22 13:59:32 [1431] willi corosync notice  [SERV  ] Service engine 
loaded: corosync profile loading service [4]
Jul 22 13:59:32 [1431] willi corosync info    [WD    ] Watchdog is now 
been tickled by corosync.
Jul 22 13:59:32 [1431] willi corosync info    [WD    ] no resources 
configured.
Jul 22 13:59:32 [1431] willi corosync notice  [SERV  ] Service engine 
loaded: corosync watchdog service [7]
Jul 22 13:59:32 [1431] willi corosync notice  [QUORUM] Using quorum 
provider corosync_votequorum
Jul 22 13:59:32 [1431] willi corosync notice  [VOTEQ ] Waiting for all 
cluster members. Current votes: 1 expected_vo
tes: 2
Jul 22 13:59:32 [1431] willi corosync notice  [SERV  ] Service engine 
loaded: corosync vote quorum service v1.0 [5]
Jul 22 13:59:32 [1431] willi corosync info    [QB    ] server name: 
votequorum
Jul 22 13:59:32 [1431] willi corosync notice  [SERV  ] Service engine 
loaded: corosync cluster quorum service v0.1 [
3]
Jul 22 13:59:32 [1431] willi corosync info    [QB    ] server name: quorum
Jul 22 13:59:32 [1431] willi corosync notice  [TOTEM ] adding new UDPU 
member {192.168.63.22}
Jul 22 13:59:32 [1431] willi corosync notice  [TOTEM ] adding new UDPU 
member {192.168.63.24}
Jul 22 13:59:32 [1431] willi corosync notice  [TOTEM ] A new membership 
(192.168.63.24:30668) was formed. Members jo
ined: 24
Jul 22 13:59:32 [1431] willi corosync notice  [VOTEQ ] Waiting for all 
cluster members. Current votes: 1 expected_vo
tes: 2
Jul 22 13:59:32 [1431] willi corosync notice  [VOTEQ ] Waiting for all 
cluster members. Current votes: 1 expected_vo
tes: 2
Jul 22 13:59:32 [1431] willi corosync notice  [VOTEQ ] Waiting for all 
cluster members. Current votes: 1 expected_vo
tes: 2
Jul 22 13:59:32 [1431] willi corosync notice  [QUORUM] Members[1]: 24
Jul 22 13:59:32 [1431] willi corosync notice  [MAIN  ] Completed service 
synchronization, ready to provide service.
Jul 22 13:59:40 [1431] willi corosync notice  [TOTEM ] A new membership 
(192.168.63.22:30672) was formed. Members jo
ined: 22
Jul 22 13:59:40 [1431] willi corosync notice  [QUORUM] This node is 
within the primary component and will provide se
rvice.
Jul 22 13:59:40 [1431] willi corosync notice  [QUORUM] Members[2]: 22 24
Jul 22 13:59:40 [1431] willi corosync notice  [MAIN  ] Completed service 
synchronization, ready to provide service.
root at willi:~/errorblackbox#

-- 
**********************************************
* email: braun at dc.uni-due.de
* Christina Braun
* WiWI/ICB/Informatik
* University DUE
* Schuetzenbahn 70  tel.: + 49    201 183-3929
* D-45127 Essen     fax.: + 49    201 183-2419
**********************************************




More information about the Debian-ha-maintainers mailing list