Week 6

Cluster and High Availability

OPS3 - Virtualization and Cloud Infrastructure

Welcome to Week 6!

What You'll Learn This Week

1. Creating a Cluster

1.1 Initialization

1.2 Joining a Node

Section 1 Checkpoint

Summary:

Reflection:

Resources:

2. Quorum: The Rule of Majority Algorithm

2.1 The Split Brain Condition

"Split Brain" is a catastrophic failure state in a clustered environment where network communication is severed between nodes, yet the nodes themselves remain operational.

Consider a two-node cluster (Node A and Node B) where the heartbeat connection fails:

Figure 3: Split Brain Scenario - A network cut leads to dual active masters ensuring data corruption without quorum logic

2.2 Quorum Logic

To prevent Split Brain, the Proxmox Cluster Manager (pvecm) enforces a strictly democratic requirement: operations can only proceed if a strict majority of nodes are present. The formula for this is (Total Votes / 2) + 1.

Section 2 Checkpoint

Summary:

Reflection:

Resources:

3. High Availability (HA) Manager

3.1 Architecture Components

The HA system is composed of two primary agents that work in tandem to maintain service availability.

3.2 Fencing Mechanism

The HA mechanism relies on absolute certainty. Before the cluster can steal VMs from a non-responsive node, it must be 100% sure that the node is truly dead.

Figure 5: The Fencing Process - How the cluster physically isolates a failed node before recovering its workloads

Section 3 Checkpoint

Summary:

Reflection:

Resources:

4. Troubleshooting the Cluster

4.1 Check Quorum

4.2 Check Corosync

4.3 Force Quorum (Emergency Only)

In a catastrophic scenario where you have a 2-node cluster and one node permanently fails, the survivor will lose quorum (1 vote < 2 required). To recover management capability on the survivor, you can artificially lower the expected vote count.

Warning: This command tells the survivor, "Pretend we only expected 1 vote." This allows it to become quorate alone. You must only do this if you are absolutely certain the other node is dead. If the other node comes back online while this is active, you will cause a Split Brain scenario.

Section 4 Checkpoint

Summary:

Reflection:

Resources:

5. Live Migration CLI

Section 5 Checkpoint

Summary:

Reflection:

Resources:

6. Enterprise Shared Storage Architectures

6.1 Distributed Storage: Ceph (Advanced)

While ZFS is the gold standard for local storage, modern data centers often span multiple servers. Ceph is a massively scalable, distributed, self-healing file system that runs across a cluster of Proxmox nodes.

6.1.1 Architecture Components Ceph is not just software; it is a living ecosystem made of daemons:

6.1.2 Implementation in Proxmox (HCI) Proxmox VE is unique because it integrates Ceph directly into the hypervisor (Hyper-Converged Infrastructure). You do not need external storage servers. The architecture diagram below shows how OSDs, MONs, and MGRs work together across a Ceph cluster:

Figure 8: Ceph Distributed Storage - OSDs manage disks, MONs maintain cluster maps, and MGRs collect metrics across multiple nodes

6.2 External Shared Storage (SAN & NAS)

While Ceph is great for internal storage, many enterprises already have massive external storage arrays (SANs). Proxmox connects to these using standard protocols.

Section 6 Checkpoint

Summary:

Reflection:

Resources:

7. Proxmox Backups (VZDump)

7.1 Backups (VZDump) vs Snapshots

Figure 9: Snapshot vs. Backup - Snapshots are dependent save points for testing; Backups are independent archives for disaster recovery

7.2 Proxmox Backup Modes

When performing a backup, the state of the VM determines the consistency of the data.

Figure 10: Proxmox Backup Modes - Live (Snapshot), Suspend (Frozen), and Stop (Consistent) modes balance uptime vs. data consistency

Section 7 Checkpoint

Summary:

8. Managing Storage via CLI (pvesm)

Section 8 Checkpoint

Summary:

9. Additional Resources

10. Lab Exercises

Summary

Review the key concepts covered in this week's material

Questions?