BugZero | Dell BugID 196966 - VxRail: Nodes May Experience High LSOM Congestion

Dell - Defect ID: 196966

VxRail: Nodes May Experience High LSOM Congestion

Dell - Defect ID: 196966

VxRail: Nodes May Experience High LSOM Congestion

Last updated on December 17th, 2025

BugZero Risk Score
0.0 Coming soon

Overall: N/A

Severity: N/A

Community: N/A

Lifecycle: N/A

What is the BugZero Risk Score?

Dell Integration

Learn more about where this data comes from

Dell Integration

Learn more

Bug Scrub Advisor

Streamline upgrades with automated vendor bug scrubs

Bug Scrub Advisor

Learn more

BugZero Enterprise

Wish you caught this bug sooner? Get proactive today.

BugZero Enterprise

Learn more

Bug Details

Support Case Count: 62
Article View Count: 934
Impact Category: N/A

Description

Symptoms

Note: the info provided based off VMware KB 82619 (External Link). Review the article for any potential newer updates.When running VxRail versions 4.7.511-526 and 7.0.130-132, you may experience the following issues: "Number of elements in the commit tables" are more than 100k and do not decrease over a period of hours.Loss of ability to see files and folders on the vSAN datastoreSevere performance degradationOne or more nodes presenting high Local Log Structured Object Management (LSOM) memory congestion (see command 1)."Number of elements in the commit tables" are more than 100k (see command 2).Memory Congestion that has propagated to all the nodes in the cluster.Logs messages in vmkernel.log: LSOM: LSOM_ThrowCongestionVOB:3429: Throttled: Virtual SAN node "HOSTNAME" maximum Memory congestion reached. Logs messages in vobd.log and vmkernel.log LSOM_ThrowAsyncCongestionVOB:1669: LSOM Memory Congestion State: Exceeded. Congestion Threshold: 200 Current Congestion: 204. The following scripted commands can be used to determine if the host may be experiencing this issue.Script 1 while true; do echo "================================================"; date; for ssd in $(localcli vsan storage list |grep "Group UUID"|awk '{print $5}'|sort -u);do echo $ssd;vsish -e get /vmkModules/lsom/disks/$ssd/info|grep Congestion;done; for ssd in $(localcli vsan storage list |grep "Group UUID"|awk '{print $5}'|sort -u);do llogTotal=$(vsish -e get /vmkModules/lsom/disks/$ssd/info|grep "Log space consumed by LLOG"|awk -F : '{print $2}');plogTotal=$(vsish -e get /vmkModules/lsom/disks/$ssd/info|grep "Log space consumed by PLOG"|awk -F : '{print $2}');llogGib=$(echo $llogTotal |awk '{print $1 / 1073741824}');plogGib=$(echo $plogTotal |awk '{print $1 / 1073741824}');allGibTotal=$(expr $llogTotal + $plogTotal|awk '{print $1 / 1073741824}');echo $ssd;echo " LLOG consumption: $llogGib";echo " PLOG consumption: $plogGib";echo " Total log consumption: $allGibTotal";done;sleep 30; done ; Sample output Fri Feb 12 06:40:51 UTC 2021 529dd4dc--xxxx-xxxx-xxxx-xxxxxxxxxxxx memCongestion:0 >> This value is higher than 0 ( ranger 0-250 ) slabCongestion:0 ssdCongestion:0 iopsCongestion:0 logCongestion:0 compCongestion:0 memCongestionLocalMax:0 slabCongestionLocalMax:0 ssdCongestionLocalMax:0 iopsCongestionLocalMax:0 logCongestionLocalMax:0 compCongestionLocalMax:0 529dd4dc-xxxx-xxxx-xxxx-xxxxxxxxxxxxxx LLOG consumption: 0.270882 PLOG consumption: 0.632553 Total log consumption: 0.903435 Script 2 vsish -e ls /vmkModules/lsom/disks/ 2>/dev/null | while read d ; do echo -n ${d/\//} ; vsish -e get /vmkModules/lsom/disks/${d}WBQStats | grep "Number of elements in commit tables" ; done | grep -v ":0$" Sample output(This is on cache disks only; you can ignore any results of capacity disks) 52f395f3-03fd-f005-bf02-40287362403b/ Number of elements in commit tables:300891 526709f4-8790-8a91-2151-a491e2d3aec5/ Number of elements in commit tables:289371

Cause

Scrubber configuration values were modified in vSAN 6.7 P04, and vSAN 7.0 U1 P02 releases to scrub objects at a higher frequency. This change results in persisting scrubber progress of each object more frequently than before. If there are idle objects in the cluster, then the scrubber accumulates commit table entries for these objects at LSOM. Eventually, the accumulation leads to LSOM memory congestion.Idle objects in this context refer to objects which are unassociated, powered off VMs, replicated objects, and so forth.

Resolution

Long-term resolution: update to 4.7.530 or later or 7.0.200 or laterIf a host has a high number of elements in the commit tables, as determined in script 2, one of the two following steps is recommended to clear the congestion. Put the problem host in maintenance mode with Ensure Accessibility, then reboot the host.Unmount and remount each host's disk groups using Ensure Accessibility. You may have to do this on multiple nodes one at a time in the cluster.Workaround:If unable to upgrade, for now still implement the following advanced settings changes to mitigate against this issue occurring. Change scrubber frequency to once per year: esxcfg-advcfg -s 1 /VSAN/ObjectScrubsPerYear Disable scrubber persist timer: esxcfg-advcfg -s 0 /VSAN/ObjectScrubPersistMin

Support Cases

Relevant Products

Click on a version to see all relevant bugs

Affected versions:3

Fixed versions: No known fixed versions

Relevant Products

Click on a version to see all relevant bugs

Affected versions:3

Fixed versions: No known fixed versions

Top Dell Defects

Defect ID: 396062
Precision 7875 Tower with Certain CPU Configuration Does Not Detect SD Cards
Defect ID: 56957
Dell Unity: Sync Replication status Lost Sync Communication (0x840e)
Defect ID: 196966
VxRail: Nodes May Experience High LSOM Congestion
Defect ID: 222345
VxRail: Recover Plug-in When the Remote Plug-in Primary Server Is Down
Defect ID: 203863
VxRail: Physical View Shows Node Health as Critical

Ready to prevent the next vendor outage?

Get a demo

Dell - Defect ID: 196966

VxRail: Nodes May Experience High LSOM Congestion

Dell - Defect ID: 196966

VxRail: Nodes May Experience High LSOM Congestion

Last updated on December 17th, 2025

BugZero Risk Score0.0 Coming soon

Bug Details

Symptoms

Cause

Resolution

Support Cases

Top Dell Defects

Ready to prevent the next vendor outage?

Links

BugZero Risk Score
0.0 Coming soon