TL;DR – How to Fix!

If your Health service is in a Failed state, you most likely cannot get your SDDC Management service to start up. You also probably cannot manage the cluster via Windows Admin Center (WAC).

It also means you cannot run the Stop-ClusterPerformanceHistory command, as that relies on these services to be running.

If you do not care about the historical data of your cluster’s stats, you can purge it all and recreate everything by running the following commands on a S2D cluster node directly:

Before running any code you find on a random website, please read through it and run it at your own risk.

A Little more about SDDC Management

Since Windows Server 2019, when you deploy a Storage Spaces Direct (S2D) cluster, you get a few out-of-the-box resources for managing the S2D components via WAC:

This slightly changed in Windows Server 2022 (and Azure Stack HCI), where Microsoft moved these resources from the Core Cluster Group to its own SDDC Group:

Failover Cluster Manager sometimes does not show the SDDC Group in the GUI due to a display bug. Microsoft has made it clear that they have no interest in fixing this since you should be using PowerShell or Windows Admin Center… 🤷‍♂️

The SDDC Management resource is a grouping of “microservices” responsible for relaying information about the cluster, its member nodes, networking, and storage to whatever is querying the API. This is the main way that Windows Admin Center (WAC) and other tooling gets information about the cluster.

4 thoughts on “Repairing Cluster Health and SDDC Resources in an Azure Stack HCI or Storage Spaces Direct (S2D) Cluster

  1. Is there any information out there with regards to what the Providers specifically do? We have a cluster where we cannot add all the health providers, and in fact, specifically provider 29d1f3ee-dbcf-44e9-b0cc-085bfa362499 causes the health service to fail to start. We can add all the other providers and it works OK, but we are unable to create the Cluster Performance History volume. Running the start command has no result.

    Surprisingly there seems to be very little to go off of in Event Viewer as well 🙁

    1. Sorry, I have no idea what they do. My assumption is there’s a relationship between those provider GUIDs and the various “things” the health service tracks (Storage Scale Units, etc). But I honestly do not know.

Leave a Reply to Braxton Credeur Cancel reply

Your email address will not be published. Required fields are marked *