How to change a health threshold
ENA will start managing a Cisco APIC as soon as it has been added to your network. Please see the sections below for information on how it is managed.
Measuring Cisco ACI health:
There are 10 measurements available from the APIC. These are:
Metric Name | Metric Description |
---|---|
APIC CPU | CPU utilization % of the Cisco APIC. |
APIC Memory | Memory utilization % of the Cisco APIC. |
EPG Health | based on the health score of each EPG component, as well as the number of learned endpoints on the leaf. |
FabricNode Health | fabric node health. |
FanTray Health | fan tray health. |
LineCard Health | linecard health. |
Pod Health | based on the leaf and spine switches health scores, as well as the number of learned endpoints on the leaf switches. |
PSU Health | power supply health. |
SupervisorCard Health | supervisor card health |
Tenant Health | aggregate of the tenant-wide logical object health scores, across the infrastructure they use. |
ENA uses the APIC concept of health. For all metrics (except for APIC CPU and APIC Memory), APIC health is measured as a percentage:
- 100% health = healthy.
- 0% health = unhealthy.
Cisco advises that if health drops below 90%, then there is a problem. Therefore, ENA sets 90% as the default health threshold. When a metric's health drops below 90%, ENA generates an event on that metric.
For the APIC CPU and APIC Memory metrics, ENA also uses a percentage, but the are not based on health. Instead, they are based on usage:
- 100% usage = unhealthy.
Cisco ACI faults:
Cisco ACI faults are displayed in the ACI Faults dashlet. In this dashlet, you can see the Cisco ACI faults as displayed by the device.
In ENA, ACI faults are a separate concept from events. ACI faults are a pure APIC concept, and are not turned into ENA events and incidents. However, they can be useful for seeing what could be causing a Cisco APIC health-related event or incident.
- For example, if events and incidents have been raised against a particular Cisco APIC health metric (as described in the section above), you can use the ACI Fault dashlet to view the ACI faults displayed on that component.
How to change a health threshold:
- Navigate to the Cisco APIC for which you want to change the health thresholds.
- Open its Threshold dashboard, and ensure the Thresholds Related To field is set to ACI.
- Follow the instructions in this article on overriding thresholds to change the health thresholds for the selected Cisco APIC.
Comments
0 comments
Please sign in to leave a comment.