HiveBrain v1.2.0
Get Started
← Back to all entries
debugsqlMinor

Galera SST fails

Submitted by: @import:stackexchange-dba··
0
Viewed 0 times
galerafailssst

Problem

I am using MariaDB 10.2

I have node2 (2.2.2.2) running as a slave to some external database.
I bootstrapped node2 as the first member of a Galera cluster "my_cluster"

When I try to join node1 (1.1.1.1) to the cluster I get failures.

I see 4 rsync processes running on node1:

mysql    20458  0.0  0.0   4504   788 ?        S    10:49   0:00 sh -c wsrep_sst_rsync --role 'joiner' --address '1.1.1.1:4444' --datadir '/var/mysql/datadir/'   --parent '20440' --binlog '/var/mysql/log/mysql-bin' 
mysql    20459  0.0  0.0   4504  1712 ?        S    10:49   0:00 /bin/sh -ue /usr//bin/wsrep_sst_rsync --role joiner --address 1.1.1.1:4444 --datadir /var/mysql/datadir/ --parent 20440 --binlog /var/mysql/log/mysql-bin
mysql    20500  0.0  0.0  12784  2636 ?        S    10:49   0:00 rsync --daemon --no-detach --port 4444 --config /var/mysql/datadir//rsync_sst.conf
mysql    20755  0.0  0.0  26528  2844 ?        S    10:49   0:00 rsync --daemon --no-detach --port 4444 --config /var/mysql/datadir//rsync_sst.conf
mysql    20779  9.8  0.0  26788  1460 ?        R    10:49   1:00 rsync --daemon --no-detach --port 4444 --config /var/mysql/datadir//rsync_sst.conf


and on node2:

```
mysql 25860 0.0 0.0 4504 748 ? S 10:49 0:00 sh -c wsrep_sst_rsync --role 'donor' --address '1.1.1.1:4444/rsync_sst' --socket '/var/run/mysqld/mysqld.sock' --datadir '/var/mysql/datadir/' --binlog '/var/mysql/log/mysql-bin' --gtid '09e3b6c8-343c-11e8-87cf-07a9813fdf95:0' --gtid-domain-id '0'
mysql 25861 0.0 0.0 4504 1704 ? S 10:49 0:00 /bin/sh -ue /usr//bin/wsrep_sst_rsync --role donor --address 1.1.1.1:4444/rsync_sst --socket /var/run/mysqld/mysqld.sock --datadir /var/mysql/datadir/ --binlog /var/mysql/log/mysql-bin --gtid 09e3b6c8-343c-11e8-87cf-07a9813fdf95:0 --gtid-domain-id 0
mysql 25909 0.0 0.0 6468 1960 ? S 10:49 0:00 xargs -I{} -0 -P 8 rsync --owner --group --perms --links --specials --ignore-times --inplace --recursive --delete --qu

Solution

wsrep_sst_donor=2.2.2.2


wsrep_sst_donor should have the node name, not the IP address.

A similar case was reported in the MariaDB Jira as MDEV-13687.

Code Snippets

wsrep_sst_donor=2.2.2.2

Context

StackExchange Database Administrators Q#202846, answer score: 3

Revisions (0)

No revisions yet.