View Advanced Monitoring Dashboards

To view the advanced monitoring dashboards in the ECS Portal, select Advanced Monitoring. Data Access Performance - Overview dashboard is the default.

Table 1. Advanced monitoring dashboards
Dashboard Description
Data Access Performance - Overview You can use the Data Access Performance - Overview dashboard to monitor VDC data.
Data Access Performance - by Namespaces You can use the Data Access Performance - by Namespaces dashboard to monitor performance data for individual namespace or group of Namespaces.
Data Access Performance - by Nodes You can use the Data Access Performance - by Nodes dashboard to see performance data for individual node or group of nodes in a VDC.
Data Access Performance - by Protocols You can use the Data Access Performance - by Protocols dashboard to see performance data for each supported protocol (S3, ATMOS, SWIFT) or set of protocols.
Disk Bandwidth - by Nodes You can use the Disk Bandwidth - by Nodes dashboard to monitor the disk usage metrics by read or write operations at the node level. The dashboard displays the latest values.
NOTE: For Disk Bandwidth - by Nodes dashboard, consistency checker metric shows data only for read but not write as it is irrelevant.
Disk Bandwidth - Overview You can use the Disk Bandwidth - Overview dashboard to monitor the disk usage metrics by read or write operations at the VDC level.
NOTE: For Disk Bandwidth - Overview dashboard, consistency checker metric shows data only for read but not write as it is irrelevant.
Node Rebalancing You can use the Node Rebalancing dashboard to monitor the status of data rebalancing operations when nodes are added to, or removed from, a cluster. Node rebalancing is enabled by default at installation. Contact your technical support representative to disable or reenable this feature.
Process Health - by Nodes You can use the Process Health - by Nodes dashboard to monitor for each node of the VDC use of network interface, CPU, and available memory. The dashboard displays the latest values, and the history graphs display values in the selected range.
Process Health - Overview You can use the Process Health - Overview dashboard to monitor the VDC use of network interface, CPU, and available memory. The dashboard displays the latest average values, and the history graphs display values in the selected time range.
Process Health - Process List by Node You can use the Process Health - Process List by Node dashboard to monitor processes use of CPU, memory, average thread number and last restart time in the selected time range. The dashboard displays the latest values in the selected time range.
Recovery Status You can use the Recovery Status dashboard to monitor the data recovered by the system.
SSD Read Cache You can use the SSD Read Cache dashboard to monitor total SSD disk capacity and disk space that is used by SSD read cache.
Tech Refresh: Data Migration You can use the Tech Refresh: Data Migration dashboard to monitor the data migration off and on a node or cluster.
Top Buckets You can use the Top Buckets dashboard to monitor the number of buckets with top utilization that is based on total object size and count.
Table 2. Advanced monitoring dashboard fields
Dashboard Field Description
  • Data Access Performance - Overview
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Related dashboards Allows you to switch to other dashboards in access performance group, with the selected time.
  • Data Access Performance - Overview
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Transaction Summary Lists the total Successful requests, System Failures, User Failures, and Failure % Rate for the selected VDCs, namespaces, nodes, or protocols.
  • Data Access Performance - Overview
  • Data Access Performance - by Nodes
Performance Summary Lists the latest values of data access bandwidth and latency of read/write requests for selected range.
  • Data Access Performance - Overview
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Successful requests The number of data requests that were successfully completed.
  • Data Access Performance - Overview
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
System Failures The number of data requests that failed due to hardware or service errors. System failures are failed requests that are associated with hardware or service errors (typically an HTTP error code of 5xx).
  • Data Access Performance - Overview
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
User Failures The number of data requests from all object heads are classified as user failures. User failures are known error types originating from the object heads (typically an HTTP error code of 4xx).
  • Data Access Performance - Overview
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Failure % Rate The percentage of failures for the VDC, namespace, nodes, or protocols.
  • Data Access Performance - Overview
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
TPS (success/failure) Rate of successful requests and failures per second.
  • Data Access Performance - Overview
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Bandwidth (read/write) Data access bandwidth of successful requests per second.
  • Data Access Performance - Overview
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Failed Requests/s by error type (user/system) Rate of failed requests per second, split by error type (user/system).
  • Data Access Performance - Overview
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
  • SSD Read Cache
Latency Latency of read/write requests.
  • Data Access Performance - Overview
  • Data Access Performance - by Nodes
Successful request drill down Displays the rate of successful requests per second, by method, node, and protocol.
  • Data Access Performance - Overview
  • Data Access Performance - by Nodes
Successful Requests/s by Method Rate of successful requests per second, by method.
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Successful Requests/s by Node Rate of successful requests per second, by node.
  • Data Access Performance - Overview
  • Data Access Performance - by Nodes
Successful Requests/s by Protocol Rate of successful requests per second, by protocol.
  • Data Access Performance - Overview
  • Data Access Performance - by Nodes
Failures drill down Displays the rate of failed requests per second, by method, node, and protocol.
  • Data Access Performance - Overview
  • Data Access Performance - by Nodes
Failed Requests/s by Method Rate of failed requests per second, by method.
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Failed Requests/s by Node Rate of failed requests per second, by node.
  • Data Access Performance - Overview
  • Data Access Performance - by Nodes
Failed Requests/s by Protocol Rate of failed requests per second, by protocol.
  • Data Access Performance - Overview
  • Data Access Performance - by Nodes
Failed Requests/s by error code Rate of failed requests per second, by error code.
  • Data Access Performance - by Nodes
  • Data Access Performance - by Namespaces
  • Data Access Performance - by Protocols
Compare TPS of successful requests Select multiple nodes and compare rates of successful requests per second.
Data Access Performance - by Namespaces Compare TPS of failed requests Select multiple nodes and compare rates of failed requests per second, by error type (user/system).
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Compare read bandwidth Select multiple nodes and compare data access bandwidth (read) of successful requests per second.
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Compare write bandwidth Select multiple nodes and compare data access bandwidth (write) of successful requests per second.
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Compare read latency Select multiple nodes and compare latency of read requests.
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Compare write latency Select multiple nodes and compare latency of write requests.
  • Data Access Performance - by Nodes
  • Data Access Performance - by Protocols
Compare rate of failed requests/s Select multiple nodes and compare rates of failed requests per second, split by error type (user/system).
Data Access Performance - by Namespaces Request drill down by nodes Rate of requests per second, split by node.
  • Disk Bandwidth - by Nodes
  • Disk Bandwidth - Overview
Read or Write Indicates whether the row describes read data or write data.
  • Disk Bandwidth - by Nodes
  • Disk Bandwidth - Overview
Nodes The number of nodes in the VDC. You can click the nodes number to see the disk bandwidth metrics for each node. There is no Nodes column when you have drilled down into the Nodes display for a VDC.
  • Disk Bandwidth - by Nodes
  • Disk Bandwidth - Overview
Total Total disk bandwidth that is used for either read or write operations.
  • Disk Bandwidth - by Nodes
  • Disk Bandwidth - Overview
Hardware Recovery Rate at which disk bandwidth is used to recover data after a hardware failure.
  • Disk Bandwidth - by Nodes
  • Disk Bandwidth - Overview
Erasure Encoding Rate at which disk bandwidth is used in system erasure coding operations.
  • Disk Bandwidth - by Nodes
  • Disk Bandwidth - Overview
XOR Rate at which disk bandwidth is used in the XOR data protection operations of the system. XOR operations occur for systems with three or more sites (VDCs).
  • Disk Bandwidth - by Nodes
  • Disk Bandwidth - Overview
Consistency Checker Rate at which disk bandwidth is used to check for inconsistencies between protected data and its replicas.
  • Disk Bandwidth - by Nodes
  • Disk Bandwidth - Overview
Geo Rate at which disk bandwidth is used to support geo replication operations.
  • Disk Bandwidth - by Nodes
  • Disk Bandwidth - Overview
User Traffic Rate at which disk bandwidth is used by object users.
Node Rebalancing Data Rebalanced Amount of data that has been rebalanced.
Node Rebalancing Pending Rebalancing Amount of data that is in the rebalance queue but has not been rebalanced yet.
Node Rebalancing Rate of Rebalance (per day) The incremental amount of data that was rebalanced during a specific time period. The default time period is one day.
Process Health - Process List by Node Process Restarts The last time the process restarted on the node in the selected time range. The maximum time range could be 5 days because it is limited by the retention policy.
Process Health - Overview Avg. NIC Bandwidth Average bandwidth of the network interface controller hardware that is used by the selected VDC or node.
Process Health - Process List by Node NIC Bandwidth Bandwidth of the network interface controller hardware that is used by the selected VDC or node.
Process Health - Overview Avg. CPU Usage Average percentage of the CPU hardware that is used by the selected VDC or node.
Process Health - Overview Avg. Memory Usage Average usage of the aggregate memory available to the VDC or node.
  • Process Health - by Nodes
  • Process Health - Overview
Relative NIC (%) Percentage of the available bandwidth of the network interface controller hardware that is used by the selected VDC or node.
  • Process Health - by Nodes
  • Process Health - Overview
  • Process Health - Process List by Node
Relative Memory (%) Percentage of the memory used relative to the memory available to the selected VDC or node.
  • Process Health - by Nodes
  • Process Health - Process List by Node
CPU Usage Percentage of the node's CPU used by the process. The list of processes that are tracked is not the complete list of processes running on the node. The sum of the CPU used by the processes is not equal to the CPU usage shown for the node.
Process Health - by Nodes Memory Usage The memory used by the process.
  • Process Health - by Nodes
  • Process Health - Overview
  • Process Health - Process List by Node
Relative Memory (%) Percentage of the memory used relative to the memory available to the process.
Process Health - Process List by Node Avg. # Thread Average number of threads used by the process.
Process Health - Process List by Node Last Restart The last time the process restarted on the node.
Process Health - by Nodes Host -
Process Health - Process List by Node Process -
Recovery Status Amount of Data to be Recovered With the Current filter selected, this is the logical size of the data yet to be recovered.
  • When a historical period is selected as the filter, the meaning of Total Amount Data to be Recovered is the average amount of data pending recovery during the selected time.
  • For example, if the first hourly snapshot of the data showed 400 GB of data to be recovered in a historical time period and every other snapshot showed 0 GB waiting to be recovered, the value of this field would be 400 GB divided by the total number of hourly snapshots in the period.
SSD Read Cache Disk Usage Used SSD space by Read Cache
SSD Read Cache Disk Capacity Total SSD disk capacity
Tech Refresh: Data Migration Remaining Volume to Migrate This panel shows graph of remaining volume on source nodes.
Tech Refresh: Data Migration Migration Speed This panel shows graph of remaining volume on source nodes.
Tech Refresh: Data Migration Data Migration Status Detailed status of migration on source nodes. Migration speed and predictions are calculated based on last 1 hour of currently selected time interval.
Top buckets Top Buckets by Size Top used buckets by size.
Top buckets Top Buckets by Object Count Top used buckets by object count.
Top buckets Time of Calculation The time at which the displayed metrics of Top Buckets dashboard were calculated.