Longhorn Metrics for Monitoring

Volume

NameDescriptionExample
longhorn_volume_actual_size_bytesActual space used by each replica of the volume on the corresponding nodelonghorn_volume_actual_size_bytes{pvc_namespace="default”,node="worker-2”,pvc="testvol”,volume="testvol”} 1.1917312e+08
longhorn_volume_capacity_bytesConfigured size in bytes for this volumelonghorn_volume_capacity_bytes{pvc_namespace="default”,node="worker-2”,pvc="testvol”,volume="testvol”} 6.442450944e+09
longhorn_volume_stateState of this volume: 1=creating, 2=attached, 3=Detached, 4=Attaching, 5=Detaching, 6=Deletinglonghorn_volume_state{pvc_namespace="default”,node="worker-2”,pvc="testvol”,volume="testvol”} 2
longhorn_volume_robustnessRobustness of this volume: 0=unknown, 1=healthy, 2=degraded, 3=faultedlonghorn_volume_robustness{pvc_namespace="default”,node="worker-2”,pvc="testvol”,volume="testvol”} 1
longhorn_volume_read_throughputRead throughput of this volume (Bytes/s)longhorn_volume_read_throughput{pvc_namespace="default”,node="worker-2”,pvc="testvol”,volume="testvol”} 5120000
longhorn_volume_write_throughputWrite throughput of this volume (Bytes/s)longhorn_volume_write_throughput{pvc_namespace="default”,node="worker-2”,pvc="testvol”,volume="testvol”} 512000
longhorn_volume_read_iopsRead IOPS of this volumelonghorn_volume_read_iops{pvc_namespace="default”,node="worker-2”,pvc="testvol”,volume="testvol”} 100
longhorn_volume_write_iopsWrite IOPS of this volumelonghorn_volume_write_iops{pvc_namespace="default”,node="worker-2”,pvc="testvol”,volume="testvol”} 100
longhorn_volume_read_latencyRead latency of this volume (ns)longhorn_volume_read_latency{pvc_namespace="default”,node="worker-2”,pvc="testvol”,volume="testvol”} 100000
longhorn_volume_write_latencyWrite latency of this volume (ns)longhorn_volume_write_latency{pvc_namespace="default”,node="worker-2”,pvc="testvol”,volume="testvol”} 100000
longhorn_volume_file_system_read_onlyThis metric indicates that the volume is now in read-only mode. The metric is either 1 or no record for each volumelonghorn_volume_file_system_read_only{node="worker-2”,pvc="testvol”,pvc_namespace="default”,volume="testvol”} 1

Node

NameDescriptionExample
longhorn_node_statusStatus of this node: 1=true, 0=falselonghorn_node_status{condition="ready”,condition_reason=”",node="worker-2”} 1
longhorn_node_count_totalTotal number of nodes in the Longhorn systemlonghorn_node_count_total 4
longhorn_node_cpu_capacity_millicpuThe maximum allocatable CPU on this nodelonghorn_node_cpu_capacity_millicpu{node="worker-2”} 2000
longhorn_node_cpu_usage_millicpuThe CPU usage on this nodelonghorn_node_cpu_usage_millicpu{node="pworker-2”} 186
longhorn_node_memory_capacity_bytesThe maximum allocatable memory on this nodelonghorn_node_memory_capacity_bytes{node="worker-2”} 4.031229952e+09
longhorn_node_memory_usage_bytesThe memory usage on this nodelonghorn_node_memory_usage_bytes{node="worker-2”} 1.833582592e+09
longhorn_node_storage_capacity_bytesThe storage capacity of this nodelonghorn_node_storage_capacity_bytes{node="worker-3”} 8.3987283968e+10
longhorn_node_storage_usage_bytesThe used storage of this nodelonghorn_node_storage_usage_bytes{node="worker-3”} 9.060941824e+09
longhorn_node_storage_reservation_bytesThe reserved storage for other applications and system on this nodelonghorn_node_storage_reservation_bytes{node="worker-3”} 2.519618519e+10

Disk

NameDescriptionExample
longhorn_disk_capacity_bytesThe storage capacity of this disklonghorn_disk_capacity_bytes{disk="default-disk-8b28ee3134628183”,node="worker-3”} 8.3987283968e+10
longhorn_disk_usage_bytesThe used storage of this disklonghorn_disk_usage_bytes{disk="default-disk-8b28ee3134628183”,node="worker-3”} 9.060941824e+09
longhorn_disk_reservation_bytesThe reserved storage for other applications and system on this disklonghorn_disk_reservation_bytes{disk="default-disk-8b28ee3134628183”,node="worker-3”} 2.519618519e+10
longhorn_disk_statusThe status of this disklonghorn_disk_status{condition="ready”,condition_reason=”",disk="default-disk-ca0300000000”,node="worker-3”}

Instance Manager

NameDescriptionExample
longhorn_instance_manager_cpu_usage_millicpuThe cpu usage of this longhorn instance managerlonghorn_instance_manager_cpu_usage_millicpu{instance_manager="instance-manager-e-2189ed13”,instance_manager_type="engine”,node="worker-2”} 80
longhorn_instance_manager_cpu_requests_millicpuRequested CPU resources in kubernetes of this Longhorn instance managerlonghorn_instance_manager_cpu_requests_millicpu{instance_manager="instance-manager-e-2189ed13”,instance_manager_type="engine”,node="worker-2”} 250
longhorn_instance_manager_memory_usage_bytesThe memory usage of this longhorn instance managerlonghorn_instance_manager_memory_usage_bytes{instance_manager="instance-manager-e-2189ed13”,instance_manager_type="engine”,node="worker-2”} 2.4072192e+07
longhorn_instance_manager_memory_requests_bytesRequested memory in Kubernetes of this longhorn instance managerlonghorn_instance_manager_memory_requests_bytes{instance_manager="instance-manager-e-2189ed13”,instance_manager_type="engine”,node="worker-2”} 0
longhorn_instance_manager_proxy_grpc_connectionThe number of proxy gRPC connection of this longhorn instance managerlonghorn_instance_manager_proxy_grpc_connection{instance_manager="instance-manager-e-814dfd05”, instance_manager_type="engine”, node="worker-2”} 0

Manager

NameDescriptionExample
longhorn_manager_cpu_usage_millicpuThe CPU usage of this Longhorn Managerlonghorn_manager_cpu_usage_millicpu{manager="longhorn-manager-5rx2n”,node="worker-2”} 27
longhorn_manager_memory_usage_bytesThe memory usage of this Longhorn Managerlonghorn_manager_memory_usage_bytes{manager="longhorn-manager-5rx2n”,node="worker-2”} 2.6144768e+07

Backup

NameDescriptionExample
longhorn_backup_actual_size_bytesActual size of this backuplonghorn_backup_actual_size_bytes{backup="backup-4ab66eca0d60473e”,volume="testvol”, recurring_job="backup”} 6.291456e+07
longhorn_backup_stateState of this backup: 0=New, 1=Pending, 2=InProgress, 3=Completed, 4=Error, 5=Unknownlonghorn_backup_state{backup="backup-4ab66eca0d60473e”,volume="testvol”, recurring_job=”"} 3

Snapshot

NameDescriptionExample
longhorn_snapshot_actual_size_bytesActual size of this snapshotlonghorn_snapshot_actual_size_bytes{snapshot="f4468111-2efa-45f5-aef6-63109e30d92c”,user_created="false”,volume="testvol”} 1.048576e+07

BackingImage

NameDescriptionExample
longhorn_backing_image_actual_size_bytesActual size of this backing imagelonghorn_backing_image_actual_size_bytes{backing_image="parrot”,disk="ca203ce8-2cad-4cd1-92a7-542851f50518”,node="kworker1”} 3.3554432e+07
longhorn_backing_image_stateState of this backing image: 0=Pending, 1=Starting, 2=InProgress, 3=ReadyForTransfer, 4=Ready, 5=Failed, 6=FailedAndCleanUp, 7=Unknownlonghorn_backing_image_state{backing_image="parrot”,disk="ca203ce8-2cad-4cd1-92a7-542851f50518”,node="kworker1”} 4

BackupBackingImage

NameDescriptionExample
longhorn_backup_backing_image_actual_size_bytesActual size of this backup backing imagelonghorn_backup_backing_image_actual_size_bytes{backup_backing_image="parrot”} 3.3554432e+07
longhorn_backup_backing_image_stateState of this backup backing image: 0=New, 1=Pending, 2=InProgress, 3=Completed, 4=Error, 5=Unknownlonghorn_backup_backing_image_state{backup_backing_image="parrot”} 3

CSI

The CSI sidecar component has built-in metrics for users to get insights into CSI operations. The CSI operations metrics cover total count, error count, and call latency. Longhorn enables the metrics by adding the flag --http-endpoint for each CSI sidecar component. You can use Prometheus’s PodMonitor to collect these metrics.

NamePort
longhorn-csi-attacher8000
longhorn-csi-provisioner8000
longhorn-csi-resizer8000
longhorn-csi-snapshotter8000

The metrics provided by the CSI sidecar component are provided in a histogram format. For example, you can obtain metrics observing the time it takes to create a Longhorn Volume for the PVC.

csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="0.1"} 0
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="0.25"} 0
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="0.5"} 0
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="1"} 0
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="2.5"} 3
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="5"} 3
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="10"} 3
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="15"} 9
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="25"} 9
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="50"} 9
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="120"} 9
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="300"} 9
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="600"} 9
csi_sidecar_operations_seconds_bucket{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume",le="+Inf"} 9
csi_sidecar_operations_seconds_sum{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume"} 66.816478825
csi_sidecar_operations_seconds_count{driver_name="driver.longhorn.io",grpc_status_code="OK",method_name="/csi.v1.Controller/ControllerPublishVolume"} 9

© 2019-2024 Longhorn Authors | Documentation Distributed under CC-BY-4.0


© 2024 The Linux Foundation. All rights reserved. The Linux Foundation has registered trademarks and uses trademarks. For a list of trademarks of The Linux Foundation, please see our Trademark Usage page.