OOBM Issue during KVM HA #12353
Replies: 12 comments 12 replies
-
|
@muthukrishnang1100 Your error message from the log does not directly belong to the KVM HA setup. The CloudStack management server does not get a connection to the database. Please check your database server and connection limits. |
Beta Was this translation helpful? Give feedback.
-
|
@muthukrishnang1100 you may check this:
under regular circumstances check:
What latency do you have between management and your hypervisor? |
Beta Was this translation helpful? Give feedback.
-
|
@muthukrishnang1100 Okay, then let's have a look at your DB. Please check the following values on your DB. Before and after you enable OOBM. SHOW VARIABLES LIKE 'max_connections'; |
Beta Was this translation helpful? Give feedback.
-
|
@muthukrishnang1100 you're having connection issues to your DB. OOBM is just the trigger. Check for "abandoned connection" messages in your management log. Also check and paste this while you having the issue: You also may want to try these in your /etc/cloudstack/management/db.properties |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
|
@muthukrishnang1100 Thank you for the details. Please check this:
|
Beta Was this translation helpful? Give feedback.
-
|
So it's a question of the right Galera + HAProxy + Keepalived VIP setup.
If you still having issues try without the VIP. |
Beta Was this translation helpful? Give feedback.
-
|
Can you confirm you still facing the issue without VIP/HAProxy layer? Please paste your HAProxy config. Especially DB listener/backend section. Provide the logs around the event. |
Beta Was this translation helpful? Give feedback.
-
|
The root cause is HAProxy TCP timeouts (50 seconds) applied to the MariaDB VIP. CloudStack keeps long-lived pooled DB connections; HAProxy closes idle TCP sessions after 50 seconds, causing stale/broken pooled connections. Eventually CloudStack’s DB pool becomes exhausted and throws Timeout waiting for idle object. Update your proxy config and reload the haproxy: backend galera_backend Update your /etc/cloudstack/management/db.properties to: |
Beta Was this translation helpful? Give feedback.
-
|
Your VMs are not restarted due to insufficient capacity: Unable to restart VM ... due to Unable to create a deployment Check resources and tagging. |
Beta Was this translation helpful? Give feedback.
-
|
Can you manually start the VM from the "failed" host on one of the others? What servers are your system VMs are running on, especially the virtual router? |
Beta Was this translation helpful? Give feedback.
-
|
@muthukrishnang1100 Share full log files from management servers ( /var/log/cloudstack/management/management-server.log) to understand why the manual start is failing. The snippet you shared is not very useful to identify the issue |
Beta Was this translation helpful? Give feedback.












Uh oh!
There was an error while loading. Please reload this page.
-
Hi, I have an issue. I am trying to Test KVM HA test on my 3 host cloudstack. When I enable the OOBM on any one of the host 5 to minutes later the error comes. and after that node not able to login dashboard. Please anyone face this isuue if please what i do for this issue? and also after i disable and restart the Cloudstack management it will works fine no error.
Caused by: java.sql.SQLException: Cannot get a connection, pool error Timeout waiting for idle object
at org.apache.commons.dbcp2.PoolingDataSource.getConnection(PoolingDataSource.java:147)
at com.cloud.utils.db.TransactionLegacy.getConnection(TransactionLegacy.java:557)
at com.cloud.utils.db.TransactionLegacy.prepareStatement(TransactionLegacy.java:468)
... 30 more
Caused by: java.util.NoSuchElementException: Timeout waiting for idle object
at org.apache.commons.pool2.impl.GenericObjectPool.borrowObject(GenericObjectPool.java:446)
at org.apache.commons.pool2.impl.GenericObjectPool.borrowObject(GenericObjectPool.java:356)
at org.apache.commons.dbcp2.PoolingDataSource.getConnection(PoolingDataSource.java:141)
... 32 more
2025-12-30 12:51:49,231 ERROR [c.c.s.StatsCollector] (StatsCollector-1:ctx-2c2139dd) (logid:e41447a0) Error trying to monitor autoscaling
com.cloud.utils.exception.CloudRuntimeException: DB Exception on: null
at com.cloud.utils.db.GenericDaoBase.searchIncludingRemoved(GenericDaoBase.java:433)
at com.cloud.utils.db.GenericDaoBase.searchIncludingRemoved(GenericDaoBase.java:362)
at com.cloud.utils.db.GenericDaoBase.listIncludingRemovedBy(GenericDaoBase.java:960)
at com.cloud.utils.db.GenericDaoBase.listBy(GenericDaoBase.java:939)
at com.cloud.cluster.dao.ManagementServerHostDaoImpl.findOneInUpState(ManagementServerHostDaoImpl.java:270)
at jdk.internal.reflect.GeneratedMethodAccessor284.invoke(Unknown Source)
at java.base/jdk.internal.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
at java.base/java.lang.reflect.Method.invoke(Method.java:566)
at org.springframework.aop.support.AopUtils.invokeJoinpointUsingReflection(AopUtils.java:344)
at org.springframework.aop.framework.ReflectiveMethodInvocation.invokeJoinpoint(ReflectiveMethodInvocation.java:198)
at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:163)
at com.cloud.utils.db.TransactionContextInterceptor.invoke(TransactionContextInterceptor.java:34)
at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:175)
at org.springframework.aop.interceptor.ExposeInvocationInterceptor.invoke(ExposeInvocationInterceptor.java:97)
at org.springframework.aop.framework.ReflectiveMethodInvocation.proceed(ReflectiveMethodInvocation.java:186)
at org.springframework.aop.framework.JdkDynamicAopProxy.invoke(JdkDynamicAopProxy.java:215)
at com.sun.proxy.$Proxy161.findOneInUpState(Unknown Source)
at com.cloud.server.StatsCollector$AutoScaleMonitor.runInContext(StatsCollector.java:1766)
at org.apache.cloudstack.managed.context.ManagedContextRunnable$1.run(ManagedContextRunnable.java:48)
at org.apache.cloudstack.managed.context.impl.DefaultManagedContext$1.call(DefaultManagedContext.java:55)
at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.callWithContext(DefaultManagedContext.java:102)
at org.apache.cloudstack.managed.context.impl.DefaultManagedContext.runWithContext(DefaultManagedContext.java:52)
at org.apache.cloudstack.managed.context.ManagedContextRunnable.run(ManagedContextRunnable.java:45)
at java.base/java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:515)
at java.base/java.util.concurrent.FutureTask.runAndReset(FutureTask.java:305)
at java.base/java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:305)
at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1128)
at java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:628)
at java.base/java.lang.Thread.run(Thread.java:829)
Caused by: java.sql.SQLException: Cannot get a connection, pool error Timeout waiting for idle object
at org.apache.commons.dbcp2.PoolingDataSource.getConnection(PoolingDataSource.java:147)
at com.cloud.utils.db.TransactionLegacy.getConnection(TransactionLegacy.java:557)
at com.cloud.utils.db.TransactionLegacy.prepareStatement(TransactionLegacy.java:468)
... 30 more
Caused by: java.util.NoSuchElementException: Timeout waiting for idle object
at org.apache.commons.pool2.impl.GenericObjectPool.borrowObject(GenericObjectPool.java:446)
at org.apache.commons.pool2.impl.GenericObjectPool.borrowObject(GenericObjectPool.java:356)
at org.apache.commons.dbcp2.PoolingDataSource.getConnection(PoolingDataSource.java:141)
... 32 more
2025-12-30 12:51:49,240 ERROR [c.c.n.r.VirtualNetworkApplianceManagerImpl] (RouterStatusMonitor-1:ctx-8d4322d8) (logid:dcafa96b) Fail to complete the CheckRouterTask!
com.cloud.utils.exception.CloudRuntimeException: DB Exception on: null
at com.cloud.utils.db.GenericDaoBase.searchIncludingRemoved(GenericDaoBase.java:433)
at com.cloud.utils.db.GenericDaoBase.searchIncludingRemoved(GenericDaoBase.java:362)
Beta Was this translation helpful? Give feedback.
All reactions