Skip to main content
Version: Next

Task 03: Enable HCI Insights

Runbook Azure

DOCUMENT CATEGORY: Runbook SCOPE: HCI Insights monitoring integration PURPOSE: Enable Azure Local Insights workbook for cluster health and performance monitoring MASTER REFERENCE: Microsoft Learn - Azure Local Insights

Status: Active


Azure Local Insights provides a rich, pre-built Azure Monitor Workbook that visualizes cluster health, node status, storage performance, and VM state. Enabling Insights automatically installs the Azure Monitor Agent (if not already present) and configures the required Data Collection Rules for HCI-specific telemetry.

Prerequisites

RequirementDescriptionValidation
Log Analytics WorkspaceCreated in Step 1Workspace accessible
Azure Monitor AgentInstalled on cluster nodes (Step 2)Extension status: Succeeded
Arc-Enabled ClusterAzure Local cluster registeredPortal shows cluster resource
RBAC PermissionsMonitoring ContributorRole assignment verified

Variables from variables.yml

VariableConfig PathExample
AZURE_SUBSCRIPTION_IDazure.subscription.id00000000-0000-0000-0000-000000000000
AZURE_SUBSCRIPTION_NAMEazure.subscription.nameAzure Local Production
AZURE_RESOURCE_GROUPazure.resource_group.namerg-azurelocal-prod-eus2
CLUSTER_NAMEcluster.nameazl-dal-cl01
LOG_ANALYTICS_WORKSPACE_NAMEmonitoring.log_analytics.workspace_namelaw-azl-DAL-prod-01
SITE_CODEsite.codeDAL
CLUSTER_NODE_01_NAMEnodes[0].nameazl-dal-node-01

Overview

HCI Insights collects data from specific Windows Event Log channels and performance counters:

Data TypeSourcePurpose
Health EventsMicrosoft-Windows-Health/OperationalNode and component health status
SDDC EventsMicrosoft-Windows-SDDC-Management/OperationalCluster management events
MemoryMemory\Available BytesMemory utilization
NetworkNetwork Interface(*)\Bytes Total/secNetwork throughput
CPUProcessor(_Total)\% Processor TimeCPU utilization
RDMARDMA Activity(*)\RDMA Inbound/Outbound Bytes/secStorage network performance

Configuration Options

Step 3.1: Enable Insights from Cluster Resource

  1. Navigate to Azure PortalAzure Local → Select your cluster
  2. Under Capabilities tab, locate the Insights tile
  3. Click Insights to open the configuration page
  4. Click Get Started

Step 3.2: Configure Data Collection Rule

  1. On the Insights configuration page:
  • Select an existing DCR from the dropdown, OR
  • Click Create New to create a dedicated Insights DCR
  1. For New DCR:
SettingValue
Subscription{{AZURE_SUBSCRIPTION_NAME}}
DCR NameAzureStackHCI-{{CLUSTER_NAME}}-dcr
Data Collection Endpointdce-{{SITE_CODE}}-azl-01
  1. Click Review + create
Automatic AMA Installation

When you configure Insights, Azure Monitor Agent is automatically installed on all cluster nodes if not already present. The DCRs created by Insights have the prefix AzureStackHCI-.

Step 3.3: Verify Insights Configuration

  1. Return to your cluster's Capabilities tab
  2. The Insights tile should now show Configured
  3. Click Insights to view the workbook

Using the Insights Workbook

Once enabled, the Insights workbook provides several views:

Cluster Health Overview

MetricDescriptionAlert Threshold
Cluster HealthOverall cluster statusWarning/Critical
Node HealthIndividual node statusAny node unhealthy
Storage HealthStorage pool and volume statusDegraded/Unhealthy
VM HealthVirtual machine statesFailed VMs

Performance Monitoring

The workbook displays:

  • CPU utilization across all nodes
  • Memory availability trends
  • Network throughput (total bytes/sec)
  • RDMA performance for storage traffic
  • Storage latency for CSV volumes
TabContent
OverviewCluster summary, health status, quick stats
NodesPer-node CPU, memory, network metrics
StorageVolume health, capacity, latency
VMsVM count, state distribution

Validation

Verify Insights Status

# Check cluster Insights configuration
$cluster = Get-AzStackHciCluster `
-ResourceGroupName "{{AZURE_RESOURCE_GROUP}}" `
-ClusterName "{{CLUSTER_NAME}}"

# Check for AMA extensions on nodes
$nodes = @("{{CLUSTER_NODE_01_NAME}}", "{{CLUSTER_NODE_02_NAME}}")
foreach ($node in $nodes) {
$ext = Get-AzConnectedMachineExtension `
-ResourceGroupName "{{AZURE_RESOURCE_GROUP}}" `
-MachineName $node `
-Name "AzureMonitorWindowsAgent"

Write-Host "$node AMA Status: $($ext.ProvisioningState)"
}

Verify Data Collection

Run these queries in Log Analytics to confirm data is flowing:

// Check Health events
Event
| where Source == "Microsoft-Windows-Health"
| where TimeGenerated > ago(1h)
| summarize count() by Computer
| order by Computer asc

// Check SDDC Management events
Event
| where Source == "Microsoft-Windows-SDDC-Management"
| where TimeGenerated > ago(1h)
| summarize count() by Computer

Sample Health Query

// Cluster health summary
Event
| where Source == "Microsoft-Windows-SDDC-Management"
| where EventID == 3000 // Server health event
| where TimeGenerated > ago(24h)
| extend ParsedData = parse_json(RenderedDescription)
| project TimeGenerated, Computer, HealthState = ParsedData.HealthState
| summarize LatestHealth = arg_max(TimeGenerated, *) by Computer

Troubleshooting

IssuePossible CauseResolution
Insights shows "Not configured"AMA not installedCheck Extensions on cluster nodes
No data in workbookDCR not associatedVerify DCR associations in Monitor → DCRs
Stale data (>15 min old)Agent connectivity issueCheck azcmagent show on nodes
Missing health eventsEvent log channel not enabledVerify Windows Event Log settings
Cluster shows "Other" statusRecent Arc reconnectionWait for next health check cycle

Event Log Verification

On each cluster node, verify the required event logs are enabled:

# Run on each cluster node
$logs = @(
"Microsoft-Windows-Health/Operational",
"Microsoft-Windows-SDDC-Management/Operational"
)

foreach ($log in $logs) {
$logInfo = Get-WinEvent -ListLog $log -ErrorAction SilentlyContinue
if ($logInfo) {
Write-Host "✅ $log - Enabled: $($logInfo.IsEnabled)" -ForegroundColor Green
} else {
Write-Host "❌ $log - Not found" -ForegroundColor Red
}
}

Variables Reference

VariableDescriptionExample
{{CLUSTER_NAME}}Azure Local cluster nameazl-dal-cluster-01
{{CLUSTER_NODE_01_NAME}}First node hostnameazl-dal-n01
{{LOG_ANALYTICS_WORKSPACE_NAME}}Workspace namelaw-azl-dal-prod-01

Next Steps

After enabling HCI Insights:

  1. ➡️ Task 4: Setup Alerting — Configure alert rules based on Insights data
  2. Review the Insights workbook for baseline understanding
  3. Bookmark key workbook views for operational monitoring
  4. Consider enabling at scale using Azure Policy for multiple clusters

PreviousUpNext
← Task 02: Azure Monitor AgentPhase 02: Monitoring & ObservabilityTask 04: Setup Alerting →

VersionDateAuthorChanges
1.0.02026-03-24Azure Local Cloudnology TeamInitial release