Table of ContentsPreviousNextIndex

Fidelia Technology Logo

    End-User Web Guide

12.1 Introduction

Welcome to NetVigil! Fidelia NetVigil provides your organization with proactive monitoring, event detection, reporting and problem escalation for mission-critical components of IT infrastructures.

NetVigil provides Instant Business Visibility by monitoring your critical IT Infrastructure including but not limited to your network based services, Applications and Systems. Current monitoring services include availability, performance, Applications, E-Commerce, SNMP, port monitoring, and custom tests. NetVigil is designed to be a flexible system where you can set thresholds, test intervals, filters on status views, schedule notifications and generate standard or custom reports based on the privileges you have. NetVigil's Instant Business Visibility provides you access to your data through a web interface. It saves you time and reduces costs by enabling you to quickly identify and resolve downtime problems without any investment in additional infrastructure or technical expertise.

NetVigil offers two types of reporting, real-time status reports and periodic trend reports. Immediately upon login to the NetVigil system, by default you are able to see the real-time status of your monitored devices on the Device Summary page. Instantly you are able to see any current failures and performance losses. In a single click NetVigil provides you with test details on any monitored device, a 24 hour graphical snapshot of performance and event history, and test results for the last 30 days.

12.2 NetVigil Terms

NetVigil monitors the availability and performance of your network and application systems, and their underlying components. These systems and components may be routers, switches, servers, databases, networks, or applications.

A test is the measure of device functioning. Tests are used to monitor your devices. NetVigil reports the status of each test. Test status (shown on the Status | Tests page) is the current status category (ok, warning, critical, unknown, unreachable, suspended, unknown or not configured) for a test. Device status (shown on the Status | Device page) is the worst current test status for a device.

NetVigil uses boundaries called thresholds to determine a test's status. An event occurs whenever a test result crosses a threshold.

An action is an activity that is automatically triggered by an event. Actions can be designed to take place immediately when a single event occurs or after the same event occurs repeatedly. For instance, an E-mail notification can be sent whenever a test crosses the warning threshold, or it can be sent after a test has crossed the warning threshold five consecutive times.

12.3 NetVigil Basics

12.3.1 The NetVigil Interface

Figure 12.1 below shows the top three rows of the NetVigil interface. All NetVigil pages have the components shown, including:

The NetVigil Interface

Figure 12.2 displays all the NetVigil icons (except TRANSIENT) used to communicate device and test status. okay, warning, and critical test statuses are determined by the thresholds for a specific test. Test status is unknown if the test cannot be performed (for example, if the device is down) or if the database cannot be reached. Although not represented by a particular icon, a test can have status = FAIL, which means that the device was reached but the test failed to be performed. An example is when a POP3 port test is performed and the supplied login/password combination fails.

A test status is not configured if a device has been added to the application for monitoring but no tests have been created for it. An unreachable test status will be shown if a parent/child dependency has been created between two or more devices and a failure in the parent device causes the child to be physically unreachable via the network. Please refer to Section 8.3, "Device Dependency" on page 8-3 for detailed information on NetVigil's device dependency feature.

NetVigil Symbols Used to Report Status

12.3.2 Site Map

The diagrams that follow show the top levels of the hierarchy of pages in the NetVigil User Interface. Each diagram shows the hierarchy for one of the main menus. Some items are only available to administrators and do not appear if you are logged in as an end-user.

Site Map, Account Menu

Site Map, Reports Menu

Site Map, Manage Menu

Site Map, Help Menu

12.4 Getting Started

Your administrator will set up your account and provide you with an account name, login and password.

12.5 Viewing Status

12.5.1 Device Summary View

The Device Summary View is the default view after the STATUS tab is selected. There is one row for each device on the account being monitored. Each row gives the device name and the status for each of three categories of tests: Network, System, and Application.

The modify icon links you to a page for modifying a device's settings.

If the device status for one group of tests is warning, at least one current test result for that test category is in warning range. Similarly, if the device status for one category of tests is critical, at least one current test result for that group is in critical range. The worst test status of all tests in the category determines the icon displayed. The rule for displaying the icons (from most to least severe) is:

A sample Device Summary page is shown below.

Device Summary Page

12.5.2 Test Summary View

The Test Summary page contains one row for each test being conducted. Each row contains test status, test name, current test value, the warning and critical thresholds, the time the last test was conducted, and the time the test has remained in the current state.

Test Summary Page

Click on the STATUS tab on the main navigation bar to go to the Device Summary page. Click on the device name link for the device of interest and you will be taken to the Device Status Details page.

12.5.3 Test Details View

The Test Details page graphically displays performance and event history for a single test over the last 6-24 hours. Figure 5 below illustrates the four graphs on the Test Details page:

Test Details Page

From the Test Details page, users also have access to the following information for that test:

12.5.4 Service Container Summary View

The Service Container Summary View, available via STATUS | containers displays custom groupings of tests and/or devices to create a consolidated view of a logical system or application. Please see Section 12.6.4, "Managing Service Containers" on page 12-19 for more detailed information.

12.5.5 Device Display Filters

Via NetVigil's device summary views (i.e. Device Summary and Device Groups Summary pages), users can set default filters in order to only view devices in specific states. For example, users may elect to filter out devices that are in an 'OK' status. Additionally, users can specify how many devices are displayed on a single page. Especially for large deployments, these two features can dramatically cut down on the number of entries a user must scroll through to get a quick snapshot of system health. A toggle switch on the Device Summary & Device Groups Summary pages quickly disables or enables the filter(s).

These changes will become part of your user profile and will serve as defaults each time you log in to NetVigil.

You can also choose the starting page when you logon to the NetVigil.

12.5.6 Device Comment Field

A user can enter a comment that will display on the Device Summary page. This could be used in any way by the user to communicate device-specific information, such as to identify why a device is being suspended or as general information on the current state of the device.

12.5.7 Context-sensitive Help or Action

NetVigil's Test Summary view displays a HELP link used to provide context-sensitive help to users. Selecting the link displays a pop-up window with information configured by your administrator or operations personnel to address device or test help topics. Although completely customizable, one suggested use of this functionality is to provide online help documentation for a specific device or test in the absence of senior administration personnel (e.g. nighttime operations).

An alternative to providing text based help, is to enable an action (e.g. server re-start) via the HELP link. This is a powerful option, as an administrator can configure any number of files to work in this fashion, enabling a large number of background processes via the web app. Please contact your NetVigil administrator for details of how the functionality is being deployed in your organization.

12.5.8 Event Logs

An Event Log lists every time a test status has changed in the past 24 hours. Each entry gives the device name, time the event occurred, test name, type of test, low (warning) and high (critical) thresholds, and the actual test value. The Event Log can be viewed in aggregate for all devices and tests on an account, or may be viewed in a filtered manner for a specific device or test. The various levels of viewing event logs are explained below.

Please wait for the information to load, as the databases for all the data gathering elements (DGEs) are being queried.

12.5.9 Messages

Messages are the alerts generated by the system for each event of importance including the Administrator or user actions that result in changes in what needs to be monitored and how. Examples include Suspension, Deletion, Modification of Devices

12.6 Managing NetVigil

12.6.1 User Groups

Each account belongs to only one User Group. Your administrator will assign privileges and limits to each User Group which will in turn govern your account's ability to perform create/delete, suspend/resume and update activities on devices, tests, testing intervals, thresholds, actions, reports, and any applicable limits to the same. For example, you may or may not have the ability to delete devices, tests, and actions if your User Group has not been given these privileges. As well, the default minimum test interval may vary with the User Group, giving higher order User Groups the ability to configure more frequent test intervals than lower order User Groups.

All accounts belonging to a User Group will have identical privileges and limits, although some exceptions may exist for users who have been given read-only access to an account.

12.6.2 Managing Devices

The Manage Devices page displays all the account's devices and links to perform various administrative functions on the devices. Each row contains the device name and address, type of device, whether monitoring is currently active or suspended, a link for suspending or resuming monitoring, and the physical device location. Additionally, there are links for updating or deleting the device, and for managing the tests for the device.

Manage Devices Page
note Test discovery may take up to 1 minute, depending on the number of test types you chose. Please follow the on-screen instructions as the device is queried.

Create Device Page

The suspend/resume feature allows you to temporarily turn off all the tests for a device and turn them on again. This feature is useful if you are performing maintenance task on a device and do not want to receive alerts while the device is offline. Once a device is suspended, the polling and data collection for all the tests on the device is suspended and thus any associated actions to the tests will not generate notifications. The suspend/resume feature is available at both the device and the individual test level.

12.6.3 Auto-Update for Device Capacity Change

NetVigil provides a mechanism for refreshing maximum values or SNMP object identifiers (SNMP OID) when an SNMP test has changed. For example, when memory or disk capacity has changed, tests that return percentage-based values would be incorrect unless the max value (for determining 100%) is refreshed. As well, in the case of a device rebuild, it is possible that the SNMP OIDs may change, thus creating a mismatch between the current SNMP OIDs and the once which NetVigil discovered during initial provisioning.

If one of the previous situations occurs, the user need only repeat the test provisioning process in the web application for a changed device. NetVigil will discover whether any material changes on the device have occurred and highlight those changes on the Configure Tests page, giving the user the option to also change thresholds and/or actions that apply to the test.

12.6.4 Managing Service Containers

Service containers allow you to group tests and devices to create logical, business-oriented views of your network in addition to your hardware-oriented views. An Administrator can create a service that includes tests and/or devices from multiple Departments (e.g., any Departments associated with a User-Class that the Administrator controls).

There are two types of service containers:

Virtual Devices contain tests only. A real NetVigil device has a collection of tests associated with it. A Virtual Device is a collection of tests that are logically related, but not associated with the same physical device. For example, you can create a Virtual Device that includes ping tests for all devices on your network. This allows you to see at a glance which devices are unreachable without looking at test results for individual devices.

Device Containers can contain real devices and Virtual Devices (described above). They can also include other Device Containers, creating a nested hierarchy. For example, you can create a Device Container called Payroll that includes the web server, router, and backend database used by the Payroll division. This allows you to quickly spot and troubleshoot problems that affect the Payroll group's ability to provide service.

You can generate reports on virtual devices and device containers, get uptime, and get real-time status if any of the underlying components fail or cross any threshold.

NOTE Service Container is a generic term, and can refer to either a Virtual Device or a Device Container.

Figure 12.12 illustrates a Device Container that contains real devices, a Service Container, and a nested Device Container.

Example: Device Containers and Virtual Devices

NOTE Updating or deleting a Service Container has no impact on devices or tests. If you delete a Virtual Device, for example, the tests included in the Virtual Device still exist in NetVigil.

12.6.5 Managing Actions

NetVigil supports actions in the form of E-mail, Compact E-Mail, SNMP Trap or executing an external script. Typically, actions are some form of notification that a test result has crossed a defined threshold into OK, WARNING, CRITICAL or UNKNOWN status. An action consists of a group of 1 to 5 sub-actions, allowing the user to define multiple notification recipients and specific notification rules for each recipient. Figure 12.13 below illustrates how a test result can trigger multiple actions.

Example of Test Results Triggering Actions

In this latest release of Fidelia's NetVigil, actions can be created by either end-users or an administrator to notify up to five (5) separate recipients when a test status changes, or when a test status has been in a particular state for a predetermined number of test cycles. In the above example, the action has been configured with three (3) separate sub-actions in a typical escalation scenario. When the Ping RTT test on the NT Server reaches a WARNING status, the NOC Admin user receives an E-mail notification of the problem. If the test crosses the upper threshold to CRITICAL status, the NOC Manager is notified. Once the test has remained in CRITICAL status for three (3) test cycles, the CTO is notified.

Note Actions and their sub-actions must first be defined before they are available to be assigned to one or more tests.
Note When you receive an E-mail notification that a test result for one of your devices has crossed into WARNING or CRITICAL status, that E-mail has only been sent to you or to the distribution list of which you are a member. The thresholds your administrator uses to trigger notifications may be different from your personal thresholds. Please do not create actions to alert the administrator or IT staff.

12.6.6 Managing Your Account


Fidelia Technology, Inc.
Contact Us
Table of ContentsPreviousNextIndex