Background
The end of the raw device mapping tyrannie on Windows Server Failover Clusters (WSFC) is nigh! If you managed virtual environments that contain Windows Clusters, you probably know that these are a different kind of beast compared to your usual virtual machines equipped with thick and thick virtual disks.
WSFC on vSphere has always required the use of RDM disks. This is due to the fact that WSFC uses SCSI-3 PRs (Persistent Reservations) commands to coordinate access to a clustered disk resource. It essentially uses this command to lock the volume so only the active node is allowed to write to it.
But because VMFS has its own locking mechanisms, these SCSI commands are intercepted and discarded with traditional virtual disks. Hence the need for RDM disks acting as a mapping device for the physical LUN.
Managing these RDM disks isn’t that big a deal, but it represents an exotic configuration in the environment and you have to be in sync with the storage team, it adds a bunch of LUNs, host boot times are longer… Things are better since vSphere 6.5 that added the ability to vmotion running WSFC nodes.
Clustered VMDK
Anyway, vSphere 7.0 brings an amazing new feature called clustered VMDK that allows SCSI3-PR commands issued to a virtual disk, meaning you will no longer need an dedicated physical LUN to spin up a windows failover cluster.
Unfortunately I can’t demonstrate it as it is only available on FC backed datastores as the time of this writing and I don’t have that in my lab. However, here are a few things and caveats to know about clustered VMDK:
- It is enabled at the datastore level, set the attribute “Clustered VMDK Supported” to “yes”. VMFS-6 required.
- The Windows Cluster Parameter “QuorumArbitrationTimeMax“ must be set to 60.
- The Lun backing the datastore must support ATS SCSI commands (usually does).
- The Lun must support Write Exclusive All Resgistrants (WEAR) type reservations.
- The VMDKs must use Eager Zeroed Thick type and VMs must be in compatibility vSphere 7.0 at least.
- Don’t present LUNs used for clustered VMDK to host in version lower than 7.0.
- Do not combine clustered and non-clustered VMDKs on the same clustered datastore.
- Dedicate one datastore per WSFC, as in don’t share a clustered datastore between multiple WSFC instances.
Here is a small table of the maximums. As you can see it is quite limited so far and only available with FC storage but there is no doubt that future releases will improve those numbers as usual.
There is also a fairly long list of limitations that you can review here. The main ones being:
- Cluster in a Box (CIB) configuration is not supported (You must use anti-affinity DRS Rules to separate VMs and nodes of the cluster on different ESXi hosts.).
- Hot expansion of a VMDK that is associated with a clustered VM is not supported (not sure how that case would be handled).
Note that VMware provided a guide on how to migrate your WSFC enabled RDM disks to clustered VMDK and it’s THERE.