Continuent Blog: Zero Downtime: Upgrading MySQL Server in a Tungsten Cluster

Blog

Introduction

Tungsten Clustering allows for many types of maintenance to happen with no downtime at all.

This blog post will explore how to upgrade the actual MySQL Server on all cluster nodes with zero downtime. These same steps may be used to upgrade the MySQL server or patch the OS.

Zero-Downtime Upgrade Steps

This is a zero-downtime upgrade:

Pick a replica node and shun it, and take all replication offline
- cctrl> datasource NODENAME shun
- tungsten@shell> trepctl -all-services offline
Upgrade the MySQL Server on that node
Bring up replication and verify health
- tungsten@shell> trepctl -all-services online
- tungsten@shell> trepctl services
Ensure that the Replicator has time to catch up after being offline for a period of time
- tungsten@shell> trepctl services
Welcome the replica node back into the cluster
- cctrl> datasource NODENAME welcome
Repeat for the other Replica node
Perform a switch of the Primary to one of the upgraded Replicas
- cctrl> switch to NODENAME
Upgrade the old Primary with the same steps
Done!

The complete procedure is detailed here.

Note

There is always a brief hang when a switch is performed and the Connectors re-route the client traffic to a different node. Additionally, a reconnect will be required when running the Connectors in Bridge mode.

Important

The above procedure is NOT the correct way to upgrade the Tungsten software! For the correct Tungsten software upgrade procedure, please visit:

Zero-Downtime Best Practices

The best practice is to leave the cluster in Automatic mode, because shunning a node takes care of preventing both failover to the node and Connector reads from it.

Using Maintenance mode degrades the service level, because in Maintenance mode, there is no High Availability (HA) providing automatic failover if the MySQL Server dies.

In Automatic mode, there is HA/automatic failover, because the Manager still provides quorum even though the node is shunned.

Take the Replicator offline gracefully using `trepctl offline` before taking down the database.
Bring the Replicator online and verify status using `trepctl online` and `trepctl services` or `trepctl status`.

Wrap-Up

In this post we summarized the zero-downtime upgrade procedure for MySQL Server in a Tungsten Cluster. We also covered best practices for upgrading.

Smooth sailing!

Published In

Categories:

Cluster Management, Database Administration, Zero Downtime Maintenance

Series:

Tungsten University

Tags:

MySQL, MariaDB, zero downtime, Maintenance, update, High Availability

Author

Eric M. Stone

COO and VP of Product Management

Eric is a veteran of fast-paced, large-scale enterprise environments with 40 years of Information Technology experience. With a focus on HA/DR, from building data centers and trading floors to world-wide deployments, Eric has architected, coded, deployed and administered systems for a wide variety of disparate customers, from Fortune 500 financial institutions to SMB’s.

View All Eric M.’s Posts

Zero Downtime: Upgrading MySQL Server in a Tungsten Cluster

Introduction

Zero-Downtime Upgrade Steps

Zero-Downtime Best Practices

Wrap-Up

Published In

Author

Comments

Add new comment

Filtered HTML