SQL Transaction Log is Huge

realslacker · September 20, 2022, 8:01pm

Product: PowerShell Universal
Version: 3.3.3

I have a single server running PSU v3 with an SQL backend on MSSQL Server 2019 where the database is an availability group. My transaction log with just one job in PSU is growing out of control at around 25GB / hour. Is this expected behavior? Is there any tuning I can do to MSSQL and/or PSU to trim down the size of the transaction log?

adam · September 20, 2022, 9:05pm

This is not expected behavior. I did have another user mention this same problem but they ran a command to shrink the db log to resolve it and we never got to the bottom of why that happened. I think they used dbcc shrinkfile.

Some more info would be good:

How many jobs are you running per hour?
Are you storing a lot of pipeline data with your jobs?
Do you have a lot of stuff backed up in your hangfire job queues?

I’m running a 3 PSU cluster with a SQL server backend. It’s not super busy but grabs all the running processes very minute just to inflate the database.

My job queues are healthy and nothing is backed up.

My SQL transaction log seems like a generally reasonable size.

realslacker · September 21, 2022, 1:46pm

This is our first PSUv3 server, and I’m still just getting it setup. We have one job that runs every 5 minutes, the job is set to discard the pipeline (but there is very little output). We also have between two and three dashboards running as I test.

I have no jobs in the queue generally, however I am seeing MANY GroomService.Groom jobs scheduled, and over 2k failed Groom jobs.

My SQL server is being backed up every 4 hours, and it looks like it’s maxing around 65GB between backups.

realslacker · September 21, 2022, 2:14pm

I’m definitely seeing the GroomService.Groom jobs pile up. I suspect that could be contributing to the log file growth.

adam · September 21, 2022, 2:22pm

Can you send me a log file? It seems like either the groom job is stuck or it’s failing and retrying.

You can open a case if you’d like by emailing support@ironmansoftware.com

realslacker · September 21, 2022, 2:30pm

I just emailed a log over, but it looks like a pretty generic error.

2022-09-21 00:03:00.614 -05:00 [ERR] Failed to process the job '23464': an exception occurred.
Hangfire.Storage.DistributedLockTimeoutException: Timeout expired. The timeout elapsed prior to obtaining a distributed lock on the 'HangFire:GroomService.Groom' resource.
   at Hangfire.SqlServer.SqlServerDistributedLock.Acquire(IDbConnection connection, String resource, TimeSpan timeout)
   at Hangfire.SqlServer.SqlServerConnection.AcquireLock(String resource, TimeSpan timeout)
   at Hangfire.SqlServer.SqlServerConnection.AcquireDistributedLock(String resource, TimeSpan timeout)
   at Hangfire.DisableConcurrentExecutionAttribute.OnPerforming(PerformingContext filterContext)
   at Hangfire.Profiling.ProfilerExtensions.InvokeAction[TInstance](InstanceAction`1 tuple)
   at Hangfire.Profiling.SlowLogProfiler.InvokeMeasured[TInstance,TResult](TInstance instance, Func`2 action, String message)
   at Hangfire.Profiling.ProfilerExtensions.InvokeMeasured[TInstance](IProfiler profiler, TInstance instance, Action`1 action, String message)
   at Hangfire.Server.BackgroundJobPerformer.InvokePerformFilter(IServerFilter filter, PerformingContext preContext, Func`1 continuation)

adam · September 21, 2022, 2:49pm

I’m able to reproduce a similar problem when forcing the groom job to hang. Additional groom jobs will be queued up, wait for the distributed lock, fail to receive the lock and then reschedule.

This is not the correct behavior. The incoming groom jobs should be cancelled and not requeued if they cannot access the lock.

There is still an underlying issue that is causing the groom job to hang that may be the root cause here but I will have to try to reproduce that myself to see if we can get to the bottom of it. I’ll let you know if I need some more information.

adam · September 21, 2022, 3:35pm

Do you have any groom jobs running?

realslacker · September 21, 2022, 4:16pm

There are no groom jobs completing as far as I can tell. I haven’t scheduled any groom jobs, just what is included by default.

adam · September 21, 2022, 4:22pm

Ok. Thanks. I’ll let you know if I need some more info.

DavidB · September 29, 2022, 3:08pm

+1 with the same error now, 2 jobs running, logs at 50GB and growing

onehit · October 7, 2022, 6:34pm

@adam We are also having a similar growth issue, happy to provide anything you need

Eric (we talked yesterday AM)

muravsky · October 25, 2022, 7:31am

Same here. 250GB of transaction logs per day… Running latest version of PSU.

DavidB · October 25, 2022, 4:50pm

check you dont have a rogue job in hangfire which is going mental.

adam · October 27, 2022, 1:03pm

I’d also recommend trying version 3.4.4. We had an issue where database connections were remaining open and causing large transaction logs.

Topic		Replies	Views
Groom Service "hung" in 3.3.0? PowerShell Universal	7	327	September 14, 2022
Joblog updates inflate database transaction log PowerShell Universal	4	110	April 15, 2024
Database over 1.8 GB - how to shrink? PowerShell Universal	2	74	January 30, 2025
Database-log.db Size Configuration PowerShell Universal	3	215	June 27, 2023
Performance - Web Interface PowerShell Universal	9	113	December 23, 2024

SQL Transaction Log is Huge

Related topics