Scale Replica Rebuilding
Starting with v1.11.0, Longhorn supports scale replica rebuilding, allowing a rebuilding replica to fetch snapshot data from multiple healthy replicas concurrently to improve rebuild performance.
Traditionally, Longhorn rebuilds a failed replica by fetching all snapshot data from a single healthy replica. With scale replica rebuilding, the rebuilding replica can fetch snapshot data from multiple source replicas simultaneously, potentially reducing rebuild time.
This feature is particularly beneficial when volumes contain scattered small data chunks and holes in their snapshots, as it allows better utilization of available network bandwidth and parallel processing.
When scale replica rebuilding is enabled:
replica-rebuild-concurrent-sync-limit setting.replica-rebuild-concurrent-sync-limitThis setting controls the maximum number of healthy replicas that can sync snapshot data to a single rebuilding replica concurrently.
1 (scale rebuilding disabled)1 to 5When set to 1, only one source replica syncs to the rebuilding replica at a time (traditional behavior). Values greater than 1 enable scale replica rebuilding, allowing multiple source replicas (up to the configured limit) to sync snapshot data to the rebuilding replica simultaneously.
For more information, see Settings Reference.
You can override the global replica-rebuild-concurrent-sync-limit setting for individual volumes:
Rebuild Concurrent Sync Limit field.kubectl -n longhorn-system edit volume [volume-name] and modify the spec.rebuildConcurrentSyncLimit field.When the per-volume setting is set to 0, the volume uses the global setting. Otherwise, the per-volume setting takes precedence.
Scale replica rebuilding provides significant performance improvements in the following scenarios:
Start with the default: The default value of 1 (scale replica rebuilding disabled) is conservative and suitable for most environments.
Test before increasing: Before increasing the limit, test in a non-production environment to understand the resource impact on your specific workload.
Consider your workload:
2 or higher).Monitor resource usage: When scale replica rebuilding is enabled, monitor CPU usage on nodes hosting source and destination replicas to ensure sufficient resources are available.
Balance performance and resources: Higher concurrent sync limits can improve rebuild speed but consume more CPU resources. Consider the trade-off based on your cluster’s resource availability and rebuild urgency.
5.For more information on related rebuilding features, see Longhorn #11331.
© 2019-2026 Longhorn Authors | Documentation Distributed under CC-BY-4.0
© 2026 The Linux Foundation. All rights reserved. The Linux Foundation has registered trademarks and uses trademarks. For a list of trademarks of The Linux Foundation, please see our Trademark Usage page.