If you are using the current version of Cumulus NetQ, the content on this page may not be up to date. The current version of the documentation is available here. If you are redirected to the main page of the user guide, then this page may have been renamed; please search for it there.

Monitor Critical Events

You can easily monitor critical events occurring across your network using the Alarms card. You can determine the number of events for the various system, interface, and network protocols and services components in the network. The content of the cards in the workflow is described first, and then followed by common tasks you would perform using this card workflow.

Alarms Card Workflow Summary

The small Alarms card displays:

Item Description
Indicates data is for all critical severity events in the network
Alarm trend Trend of alarm count, represented by an arrow:
  • Pointing upward and bright pink: alarm count is higher than the last two time periods, an increasing trend
  • Pointing downward and green: alarm count is lower than the last two time periods, a decreasing trend
  • No arrow: alarm count is unchanged over the last two time periods, trend is steady
Alarm score Current count of alarms during the designated time period
Alarm rating Count of alarms relative to the average count of alarms during the designated time period:
  • Low: Count of alarms is below the average count; a nominal count
  • Med: Count of alarms is in range of the average count; some room for improvement
  • High: Count of alarms is above the average count; user intervention recommended
Chart Distribution alarms received during the designated time period and a total count of all alarms present in the system

The medium Alarms card displays:

Item Description
Time period Range of time in which the displayed data was collected; applies to all card sizes
Indicates data is for all critical events in the network
Count Total number of alarms received during the designated time period
Alarm score Current count of alarms received from each category (overall, system, interface, and network services) during the designated time period
Chart Distribution of all alarms received from each category during the designated time period

The large Alarms card has one tab.

The Alarm Summary tab displays:

Item Description
Time period Range of time in which the displayed data was collected; applies to all card sizes
Indicates data is for all system, trace and interface critical events in the network
Alarm Distribution

Chart: Distribution of all alarms received from each category during the designated time period:

  • NetQ Agent
  • BTRFS Information
  • CL Support
  • Config Diff
  • CL License
  • Installed Packages
  • Link
  • LLDP
  • MTU
  • Node
  • Port
  • Resource
  • Running Config Diff
  • Sensor
  • Services
  • SSD Utilization
  • TCA Interface Stats
  • TCA Resource Utilization
  • TCA Sensors
The category with the largest number of alarms is shown at the top, followed by the next most, down to the chart with the fewest alarms.

Count: Total number of alarms received from each category during the designated time period

Table Listing of items that match the filter selection for the selected alarm categories:
  • Events by Most Recent: Most recent event are listed at the top
  • Devices by Event Count: Devices with the most events are listed at the top
Show All Events Opens full screen Events | Alarms card with a listing of all events

The full screen Alarms card provides tabs for all events.

Item Description
Title Events | Alarms
Closes full screen card and returns to workbench
Default Time Range of time in which the displayed data was collected
Displays data refresh status. Click to pause data refresh. Click to resume data refresh. Current refresh rate is visible by hovering over icon.
Results Number of results found for the selected tab
All Events Displays all events (both alarms and info) received in the time period. By default, the requests list is sorted by the date and time that the event occurred (Time). This tab provides the following additional data about each request:
  • Source: Hostname of the given event
  • Message: Text describing the alarm or info event that occurred
  • Type: Name of network protocol and/or service that triggered the given event
  • Severity: Importance of the event-critical, warning, info, or debug
Table Actions Select, export, or filter the list. Refer to Table Settings.

View Alarm Status Summary

A summary of the critical alarms in the network includes the number of alarms, a trend indicator, a performance indicator, and a distribution of those alarms.

To view the summary, open the small Alarms card.

In this example, there are a small number of alarms (2), the number of alarms is decreasing (down arrow), and there are fewer alarms right now than the average number of alarms during this time period. This would indicate no further investigation is needed. Note that with such a small number of alarms, the rating may be a bit skewed.

View the Distribution of Alarms

It is helpful to know where and when alarms are occurring in your network. The Alarms card workflow enables you to see the distribution of alarms based on its source-network services, interfaces, or other system services. You can also view the trend of alarms in each source category.

To view the alarm distribution, open the medium Alarms card. Scroll down to view all of the charts.

Monitor System and Interface Alarm Details

The Alarms card workflow enables users to easily view and track critical severity system and interface alarms occurring anywhere in your network.

View All System and Interface Alarms

You can view the alarms associated with the system and interfaces using the Alarms card workflow. You can sort alarms based on their occurrence or view devices with the most network services alarms.

To view network services alarms, open the large Alarms card.

From this card, you can view the distribution of alarms for each of the categories over time. The charts are sorted by total alarm count, with the highest number of alarms i a category listed at the top. Scroll down to view any hidden charts. A list of the associated alarms is also displayed. By default, the list of the most recent alarms for the systems and interfaces is displayed when viewing the large cards.

View Devices with the Most Alarms

You can filter instead for the devices that have the most alarms.

To view devices with the most alarms, open the large Alarms card, and then select Devices by event count from the dropdown.

Filter Alarms by Category

You can focus your view to include alarms for one or more selected alarm categories.

To filter for selected categories:

  1. Click the checkbox to the left of one or more charts to remove that set of alarms from the table on the right.

  2. Select the Devices by event count to view the devices with the most alarms for the selected categories.

  3. Switch back to most recent events by selecting Events by most recent.

  4. Click the checkbox again to return a category’s data to the table.

In this example, we removed the Services from the event listing.

Compare Alarms with a Prior Time

You can change the time period for the data to compare with a prior time. If the same devices are consistently indicating the most alarms, you might want to look more carefully at those devices using the Switches card workflow.

To compare two time periods:

  1. Open a second Alarm Events card. Remember it goes to the bottom of the workbench.

  2. Switch to the large size view.

  3. Move the card to be next to the original Alarm Events card. Note that moving large cards can take a few extra seconds since they contain a large amount of data.

  4. Hover over the card and click .

  5. Select a different time period.

  6. Compare the two cards with the Devices by event count filter applied.

    In this example, both the total alarm count and the devices with the most alarms in each time period are unchanged. You could go back further in time to see if this changes or investigate the current status of the largest offenders.

View All Events

You can view all events in the network either by clicking the Show All Events link under the table on the large Alarm Events card, or by opening the full screen Alarm Events card.

OR

To return to your workbench, click in the top right corner of the card.