BugZero | MongoDB BugID 3311853 - Balancer does not make progress when the most load...

MongoDB - Defect ID: 3311853

Balancer does not make progress when the most loaded shard is already balanced within its zones

MongoDB - Defect ID: 3311853

Balancer does not make progress when the most loaded shard is already balanced within its zones

Last updated on December 24th, 2025

BugZero Risk Score
5.6 Medium

Overall: 5.6

Severity: 6.4

Community: 3.7

Lifecycle: 9.1

What is the BugZero Risk Score?

MongoDB Integration

Learn more about where this data comes from

MongoDB Integration

Learn more

Bug Scrub Advisor

Streamline upgrades with automated vendor bug scrubs

Bug Scrub Advisor

Learn more

BugZero Enterprise

Wish you caught this bug sooner? Get proactive today.

BugZero Enterprise

Learn more

Bug Details

Priority: Major - P3
Status: Closed
Resolution: Fixed
Views: 4

Description

Info

Summary The balancer does not make progress in certain scenarios where the most loaded shard belongs to a balanced zone, because it keeps selecting that shard as donor even when all shards in its zone are already balanced, and then fails to find a suitable recipient since the remaining underloaded shards belong to different zones. Details When the cluster has zones configured and the most overloaded shard (by data size) is in a zone that is already internally balanced, the balancer repeatedly tries to move chunks from that shard. However, since the other shards in the same zone are already balanced, there are no valid chunk candidates that can be donated while still honoring the existing zone configuration. As a result: The balancer keeps choosing the most loaded shard as the donor No migrations are actually performed, so the overall balancing does not make progress Zones themselves are respected at all times; the issue is with donor selection and progress when the top candidate shard cannot actually donate any chunks Impact Balancer rounds can appear to be “stuck” or not making progress, even though the system is correctly enforcing the configured zones. This mainly affects situations where: One shard is globally the most loaded shard That shard is in a zone that is already locally balanced Other zones may remain unbalanced Expected Behavior If the most loaded shard in a zone cannot donate any further chunks without violating zone constraints, the balancer should: Skip it as a donor candidate for that round, and Consider other shards/zones where valid migrations would still respect the zone configuration and effectively reduce imbalance.

Top User Comments

xgen-internal-githook commented on Wed, 24 Dec 2025 15:46:50 +0000: Author: {'name': 'Pierlauro Sciarelli', 'email': 'pierlauro.sciarelli@mongodb.com', 'username': 'pierlauro'} Message: SERVER-115962 Balancer must progress when the most loaded shard is already balanced within its zones (#45621) GitOrigin-RevId: ccd41ed80e1f3a5d35bc20e5d689239cf95bb13c Branch: master https://github.com/mongodb/mongo/commit/4295703db125804041cac8c479dad559c9e56dea

Steps to Reproduce

Shard1 [Zone_US] 500 GB Shard2 [Zone_EU] 300 GB Shard3 [Zone_EU] 100 GB In this scenario, the balancer will fail to balance Zone_EU and it will not move any chunks. Instead it should mvoe 100GB from Shard2 to Shard3

Relevant Products

Click on a version to see all relevant bugs

Affected versions:6.0.3, 7.0.0, 8.0.0, 8.2.0

Fixed versions: 8.3.0-rc0

Relevant Products

Click on a version to see all relevant bugs

Affected versions:6.0.3, 7.0.0, 8.0.0, 8.2.0

Fixed versions: 8.3.0-rc0

Top MongoDB Defects

8.4Defect ID: 3392546
SIGSEGV (Exit Code 139) exactly 30s after start on AMD Zen 5 due to hardware Shadow Stacks (user_shstk) clashing with coroutines
8.4Defect ID: 3380084
aggregate sort on string field inconsistent on Linux
5.5Defect ID: 3407629
[v8.0] Fix write_without_shard_key_base.js to avoid issuing getMore command
5.4Defect ID: 3361783
feature_compatibility_version.idl is still being linked to mongos
5.3Defect ID: 3395053
createCollection operations issued within the execution of movePrimary should be marked as fromMigrate: true

Ready to prevent the next vendor outage?

Get a demo

MongoDB - Defect ID: 3311853

Balancer does not make progress when the most loaded shard is already balanced within its zones

MongoDB - Defect ID: 3311853

Balancer does not make progress when the most loaded shard is already balanced within its zones

Last updated on December 24th, 2025

BugZero Risk Score5.6 Medium

Bug Details

Info

Top User Comments

Steps to Reproduce

Top MongoDB Defects

Ready to prevent the next vendor outage?

Links

BugZero Risk Score
5.6 Medium