MySQL Forums
Forum List  »  NDB clusters

Could difference in network speed cause cluster crash?
Posted by: Anton Aleksandrov
Date: July 21, 2005 01:24PM

I don't want to open a new theme, because this is probably the same question.
After 6 hours of hard testing (selecting and updating lots of records) we got these messages in cluster's log file:

2005-07-21 17:25:35 [MgmSrvr] WARNING -- Node 7: Transporter to node 12 reported error 0x16
2005-07-21 17:25:35 [MgmSrvr] WARNING -- Node 7: Transporter to node 12 reported error 0x16 - Repeated 3 times
2005-07-21 17:25:35 [MgmSrvr] WARNING -- Node 7: Transporter to node 14 reported error 0x16
2005-07-21 17:25:56 [MgmSrvr] WARNING -- Node 7: Transporter to node 13 reported error 0x16

There are thousands of these records there. Nodes 12, 13 and 14 are API nodes with mysqld. Node 7 is one of db nodes. Error messages were coming from different db nodes, but not from all of them. After some time (approx. after 3.5 hours) cluster (db nodes only) crashed with few of these records, coming from db nodes:

2005-07-21 20:45:53 [MgmSrvr] INFO -- Node 10: Possible bug in Dbdih::execBLOCK_COMMIT_ORD c_blockCommit = 1 c_blockCommitNo = 7 sig->failNo =

MGM and DB nodes are on Gbit lan, but API nodes are on 100Mbit lan. They all are running 5.0.9-beta. I know it's still a beta, but we wanted to see how it will be (works great, except this crash). If it's our fault, could it happen because of different network speed?

I have posted a bug report, but seems that client will ask me to downgrade to 4.1.13 tomorrow morning. He doesn't like when database is crashing and wants to be sure it will not happen when page goes life.

Regards,
Anton.

Options: ReplyQuote




Sorry, you can't reply to this topic. It has been closed.

Content reproduced on this site is the property of the respective copyright holders. It is not reviewed in advance by Oracle and does not necessarily represent the opinion of Oracle or any other party.