Node Monitoring
Monitor node group costs, resource utilization, and instance type efficiency.
Overview
Node Monitoring
Node Monitoring shows infrastructure-level details for a node group (auto-scaling group or managed node pool). The page includes cost and efficiency metrics, analytical charts, and a list of individual nodes.
Access: Select cluster > Dashboard > Nodes tab > Click a node group
Metric Cards
Four cards at the top:
- Nodes - Count of nodes in the group with trend indicator
- Monthly Cost - Combined monthly cost with trend indicator
- Average Efficiency - Average of CPU and memory efficiency across the group
- Total Resources - Aggregate CPU cores and memory
Charts
Four analytical charts provide insight into the node group:
- Pod Density Over Time - Bar chart showing how many pods are scheduled per node over time. Includes a formula explanation for density calculation.
- Cost per Node Type - Horizontal bar chart comparing costs across different instance types in the group.
- Availability Zone Distribution - Dual-axis bar chart showing node count and cost per AZ. Helps identify imbalanced distributions.
- Ghost Town Analysis - Area chart comparing provisioned capacity against actual utilization. Identifies off-hours waste where resources sit idle.
Tabs
Two tabs organize node group data:
- Nodes - Table of individual nodes. Columns: name, instance type, status, CPU/memory utilization, hourly cost, pods scheduled. Includes filters for search, instance type, spot/on-demand toggle, and GPU-enabled toggle.
- Details - Infrastructure metadata including provider, instance type, region, zones, architecture, OS, GPU availability, and total CPU/memory/pod capacity.
Individual Node Metrics
Each node in the table shows:
- Name - Node hostname
- Instance Type - EC2/GCE instance size (e.g. m5.2xlarge, t3.large)
- Status - Ready, NotReady, or Unknown
- CPU Usage - Current utilization percentage
- Memory Usage - Current utilization percentage
- Hourly Cost - Total hourly cost for this node
- Pods - Number of pods scheduled on this node
For efficiency score calculations, see Resource Efficiency.