Skip to main content
  • Place orders quickly and easily
  • View orders and track your shipping status
  • Enjoy members-only rewards and discounts
  • Create and access a list of your products
  • Manage your Dell EMC sites, products, and product-level contacts using Company Administration.

PowerProtect Data Manager 19.16 Administrator Guide

Monitor system health

You can monitor system health information from the Health window of the PowerProtect Data Manager UI.

To view a summary of any issues affecting the health of PowerProtect Data Manager, select Health from the navigation pane or View All from the Dashboard health widget.

PowerProtect Data Manager automatically performs a health check every two minutes. If an issue is detected, it is assigned a category and a deduction value based on its severity. All issues are displayed on the Health window. Resolved issues are automatically removed the next time a health check is performed.

Health details and status are provided for the following categories:

  • Components identifies the state of hardware and software services, such as Running or Failed.
  • Configuration identifies whether any aspects of the PowerProtect Data Manager configuration are incomplete, such as System Support configuration.
  • Capacity identifies the provisioned and currently allocated size of the associated storage system.
  • Performance identifies key performance indicators, such as memory use.
  • Data Protection identifies key protection indicators, such as service-level agreements not being met and disaster-recovery backup copies not being present.

Each category starts with a score of 100. If there is an outstanding health check issue in one of these categories, its score is reduced by the deduction value assigned to the issue. If there is more than one outstanding issue in the category, its score is only reduced by the deduction value of the most severe issue.

NOTE: Earlier versions of PowerProtect Data Manager did not protect DDMC virtual assets. Now that these assets can be protected, the system health score includes them in its calculation. This change to the calculation can result in a decrease in the health score that is displayed when updating from an earlier version of PowerProtect Data Manager. However, the lower score does not mean that assets that were previously protected are no longer protected. Instead, the lower score means that DDMC virtual assets that could not be protected before but now can be are currently not being protected. To restore the score to the expected value, add DDMC virtual assets to a protection policy.

Click Details next to an entry to see the details of the issue.

In the Health window, you can export health data by using the Export All functionality.

The overall health score of the system is represented by the most severe issue and the category with the lowest score.

NOTE:After changing the hostname or IP address of the PowerProtect Data Manager server, the overall health score of the system can be reported as lower than normal for up to two hours.
Table 1. Overall health scoreHealth score
Health score Indicates
95–100 System is in good health.
71–94 System is in fair health.
0–70 System is in poor health.
Table 2. Health check descriptionsHealth check descriptions
Category Health Check Maximum Deduction Description
Configuration Asset source configuration -30

Deduction occurs when no asset sources have been added and enabled in PowerProtect Data Manager. When at least one asset source is added, the health score returns to normal.

Storage configuration -30

Deduction occurs when there are no storage targets configured in the system. When at least one storage target is set, the health score returns to normal.

Support configuration -10

Deduction occurs when there are no support options configured in the system. The support options include:

  • Email setup
  • Support assist
  • Auto support

When a support option is configured, the health score returns to normal. If a support option is configured but initialization is still in progress, the health score reduction is set to -5 until the initialization is complete.

Policies defined for all assets -2

Deduction occurs if any of the assets from the asset sources enabled in PowerProtect Data Manager are not protected (for example, Protected/Exclude). The deduction is -2 when unprotected assets total is greater than 0.

Once all assets have been moved to a protected state, the health score returns to normal.

System disaster recovery (DR) backup schedule -10

Deduction occurs when there is no scheduled system DR backup.

Once the system DR schedules have been set, the health score returns to normal.

License -30

Deduction occurs when the license status is not valid or close to its expiration date:

  • When the license is invalid or has expired: -30
  • When the license expires in less than 7 days: -20

The health returns to normal upon application of a valid PowerProtect Data Manager license.

Operating system account health check -60

Deduction occurs if any of the operating system account passwords are about to expire or already expired:

  • Before operating system account password expiry: -15
  • Upon password expiry: -60

Once the operating system account expiry error is fixed, the health score returns to normal.

Search cluster configuration

Search cluster is disabled: -5

Search node parent vCenter Servers are removed: -5

Deduction occurs when the Search cluster is disabled or the parent vCenter Servers are removed.

The health score returns to normal once the Search cluster is properly configured.

Reporting cluster configuration When reporting node parent vCenter Servers are removed: -5

Deduction occurs when the Reporting node parent vCenter Servers are removed.

Once the reporting cluster error is fixed, the health score returns to normal.

ES configuration -5

Deduction occurs when undefined ES settings have been added.

Once the error is fixed on the ES side, the health score returns to normal.

Components PowerProtect Data Manager core infrastructure services status

Business services: 30

Core services: 30

Infrastructure services: 60

Management services: 40

Protection services: 20

Deduction occurs when one or more of the PowerProtect Data Manager services is not running or is disabled.

The health score returns to normal when all services are up and running.

Protection engines status -10

Deduction occurs when the protection engine requires attention.

The health score returns to normal when the protection engine status is in operational state.

Reporting -10

Deduction occurs when one or more of the report nodes cannot be detected.

Once the health check error is fixed, the health score returns to normal.

Search cluster -25

Deduction occurs when one or more Search clusters or nodes are disabled or cannot be detected.

The health score returns to normal once all the Search cluster issues are resolved.

Cloud Disaster Recovery -25

Deduction occurs when the Cloud DR Server in PowerProtect Data Manager cannot be detected or the password is invalid.

The health status/score returns to normal once the DD Cloud Disaster Recovery server issues have been resolved.

Heap dump -2

Deduction occurs when java heap dump files are detected in the java service log folder.

The health score returns to normal when there are no java heap dump files detected.

DNS -60

Deduction occurs when all the DNS servers are unreachable.

The health score returns to normal when at least one of the DNS servers can be reached.

NTP -10

Deduction occurs when all the NTP servers are unavailable.

The health score returns to normal when at least one of the configured NTP servers can be reached.

ES Shards Health Check

-50 (replica shards unassigned)

-70 (primary shards unassigned)

Deduction occurs when the Replica or Primary shards are unassigned.

Once the ES Shards errors are fixed, the health score returns to normal.

Data Protection Service Level Agreement (SLA) compliance -50 Deduction occurs when SLA compliance is defined but has not been met, for example, asset compliance ratio is defined as: Out Of Compliance Asset Count/In Compliance Asset Count + Out Of Compliance Asset Count
  • Low ratio: Compliance ratio <= 1/3
  • High ratio: 1/3< Compliance ratio <=2/3
  • Critical ratio: Compliance ratio > 2/3

    When more than 2/3 of protection policies are out of compliance with the defined SLAs, the score deduction is -50.

The health score returns to normal when the SLA compliance has been met, for example, complianceRatio= 0.
System DR backup copy present -40 Deduction occurs when the System DR backup copy is not present. When the DR backup copy exists, the health score returns to normal.
Discovery status -20 (for PowerProtect Data Manager)

-5 (for Cloud Snapshot Manager)

Deduction occurs when the PowerProtect Data Manager or Cloud Snapshot Manager (CSM) discovery job completes with an error. The health score returns to normal once the Discovery jobs errors are fixed.
Capacity PowerProtect Data Manager disk space -60

Deduction occurs when there is heavy disk partition space use. When disk space usage is 75-90%, the score deduction is -15. When the disk space usage exceeds 90%, the score deduction is -60.

The health score returns to normal when disk space usage falls below the 75% threshold.

Performance Memory usage -40

Deduction occurs when there is heavy operating system memory usage. When memory usage is 80-9%, the score deduction is -15. When the memory usage exceeds 95%, the score deduction is -40.

The health score returns to normal when disk space usage falls below the 80% threshold.

The following health checks provide grace periods, allowing you a period of time after deployment to configure your system without a significant reduction in the overall health score. An informational alert notification appears up to 24 hours before the score deduction occurs.

Table 3. Deductions with grace periodDeductions with grace period
Health check component Deduction by grace period
Asset source configuration
  • Not configured for up to 48 hours: -5
  • Not configured for more than 48 hours but less than one week: -20
  • Not configured after more than one week: -30
Storage configuration
  • Not configured for up to 24 hours: -5
  • Not configured after 24 hours: -30
System Support configuration
  • Not configured for up to one week: -5
  • Not configured after more than one week: -10
System Disaster Recovery (DR) backup schedule
  • Not configured for up to 48 hours: -5
  • Not configured for more than 48 hours: -10

Rate this content

Accurate
Useful
Easy to understand
Was this article helpful?
0/3000 characters
  Please provide ratings (1-5 stars).
  Please provide ratings (1-5 stars).
  Please provide ratings (1-5 stars).
  Please select whether the article was helpful or not.
  Comments cannot contain these special characters: <>()\