fix(core): Resolve multi-main startup race condition in AuthRolesService by afitzek · Pull Request #26176 · n8n-io/n8n

afitzek · 2026-02-24T09:59:36Z

Summary

Fixes a multi-main startup race condition where concurrent n8n instances running AuthRolesService.syncScopes() against the same Postgres database hit duplicate key constraint violations.

Introduces a centralized DbLockService in @n8n/db that wraps Postgres advisory locks (pg_advisory_xact_lock) inside transactions, with a DbLock enum to prevent lock ID collisions
Supports an optional timeoutMs on withLock (via SET LOCAL lock_timeout) and a non-blocking tryWithLock (via pg_try_advisory_xact_lock) that fails fast if the lock is already held
On SQLite, advisory locks are skipped — the transaction alone provides serialization
Refactors AuthRolesService.init() to use DbLockService.withLock(DbLock.AUTH_ROLES_SYNC, ...) instead of repository-level calls
Removes the leader-only guard in start.ts — all main instances now call AuthRolesService.init(), and the advisory lock serializes them safely
Lock contention throws OperationalError (timeout exceeded or try-lock busy)

Test plan

10 unit tests for DbLockService (withLock + tryWithLock, Postgres/SQLite, timeout, error propagation)
8 integration tests against live database (transaction behavior, advisory lock serialization, timeout, try-lock contention)
27 existing AuthRolesService unit tests updated and passing
4 existing start.ts unit tests updated (non-leader now calls init)
Typecheck and lint clean on both @n8n/db and cli packages

Related Linear tickets, Github issues, and Community forum posts

closes https://linear.app/n8n/issue/IAM-298

Review / Merge checklist

PR title and summary are descriptive. (conventions)
Docs updated or follow-up ticket created.
Tests included.
PR Labeled with release/backport (if the PR is an urgent fix that needs to be backported)

codecov · 2026-02-24T10:03:01Z

Codecov Report

❌ Patch coverage is 57.89474% with 24 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
...ackages/@n8n/db/src/services/auth.roles.service.ts	8.00%	23 Missing ⚠️
packages/@n8n/db/src/services/index.ts	0.00%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

guillaumejacquart

Just a few questions

guillaumejacquart · 2026-02-24T13:41:51Z

packages/@n8n/db/src/services/auth.roles.service.ts

 		this.logger.debug('Initializing AuthRolesService...');
-		await this.syncScopes();
-		await this.syncRoles();
+		await this.dbLockService.withLock(DbLock.AUTH_ROLES_SYNC, async (tx) => {


Why not use the tryWithLock ? I feel like it would be faster, and would fail only if another main instance is doing the job ?
Also, why no catching the OperationalError in case of timeout ? Do we want this to prevent the instance from starting ?

The OperationalError is only thrown when the timeout parameter is set, which it is not in this call. The thinking for not using tryWithLock is that I want all instances to sync there changes to avoid any potential miss behaviors. The downside is that instances might wait for each other if they reach this point at the same time, but since this is a few ms, it should be fine during the bootstrap process.

phyllis-noester · 2026-02-24T13:48:10Z

packages/@n8n/db/src/services/auth.roles.service.ts

-		await this.syncScopes();
-		await this.syncRoles();
+		await this.dbLockService.withLock(DbLock.AUTH_ROLES_SYNC, async (tx) => {
+			await this.syncScopes(tx);


just to make sure I understand: the reason why we want to do this whenever a new instance starts is that the instance start could be related to a deployment that either added or removed scopes. correct?

Yes correct, this approach safes us from adding a DB migration for every scope that we want to add.

given that we don't do rolling updates, there is no scenario where there could be a race condition and the scopes remain in the old state right?
I was trying to think of edge cases e.g.:

a container restarts for non deployment related reason

we deploy (miliseconds after)

container restart locks the table

deploy does not sync roles
=> scopes remain in old state

but I assume that would only be possible if we did rolling updates and even then it would be super unlikely

If the container restarts it would acquire the advisory lock, and sync its roles, the deployment (the new versions), would reach this point and wait for the advisory lock to be released. One of the new instances would acquire the lock and sync its roles, the other still wait. So one after another passes this point.

ah i see, that's essentially the answer to guillaumes question :)

…ice (#26176)

n8n-assistant · 2026-03-03T08:15:44Z

Got released with n8n@2.11.0

Add db lock service, and use it for auth role synchronization

8450fec

n8n-assistant bot added core Enhancement outside /nodes-base and /editor-ui n8n team Authored by the n8n team labels Feb 24, 2026

This comment has been minimized.

Sign in to view

afitzek added 2 commits February 24, 2026 12:04

Fix tests for race condition

4eae67f

Fix tests in CI

581c8dc

afitzek marked this pull request as ready for review February 24, 2026 12:44

afitzek requested review from a team, BGZStephen, cstuncsik, guillaumejacquart and phyllis-noester and removed request for a team February 24, 2026 12:45

guillaumejacquart reviewed Feb 24, 2026

View reviewed changes

phyllis-noester reviewed Feb 24, 2026

View reviewed changes

guillaumejacquart approved these changes Feb 24, 2026

View reviewed changes

afitzek added this pull request to the merge queue Feb 24, 2026

Merged via the queue into master with commit 5a85a4f Feb 24, 2026
79 checks passed

afitzek deleted the iam-298-multi-main-startup-race-condition-in branch February 24, 2026 18:29

n8n-assistant bot mentioned this pull request Mar 2, 2026

🚀 Release 2.11.0 #26426

Merged

Tuukkaa pushed a commit that referenced this pull request Mar 2, 2026

fix(core): Resolve multi-main startup race condition in AuthRolesServ…

58897b2

…ice (#26176)

This was referenced Mar 3, 2026

🚀 Release 2.11.0 #26455

Closed

🚀 Release 2.11.0 #26456

Merged

n8n-assistant bot added the Released label Mar 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(core): Resolve multi-main startup race condition in AuthRolesService#26176

fix(core): Resolve multi-main startup race condition in AuthRolesService#26176
afitzek merged 3 commits intomasterfrom
iam-298-multi-main-startup-race-condition-in

afitzek commented Feb 24, 2026

Uh oh!

codecov bot commented Feb 24, 2026 •

edited

Loading

Uh oh!

This comment has been minimized.

guillaumejacquart left a comment

Uh oh!

guillaumejacquart Feb 24, 2026

Uh oh!

afitzek Feb 24, 2026

Uh oh!

phyllis-noester Feb 24, 2026

Uh oh!

afitzek Feb 24, 2026

Uh oh!

phyllis-noester Feb 24, 2026

Uh oh!

afitzek Feb 24, 2026

Uh oh!

phyllis-noester Feb 24, 2026

Uh oh!

Uh oh!

n8n-assistant bot commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

afitzek commented Feb 24, 2026

Summary

Test plan

Related Linear tickets, Github issues, and Community forum posts

Review / Merge checklist

Uh oh!

codecov bot commented Feb 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

This comment has been minimized.

guillaumejacquart left a comment

Choose a reason for hiding this comment

Uh oh!

guillaumejacquart Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

afitzek Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

phyllis-noester Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

afitzek Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

phyllis-noester Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

afitzek Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

phyllis-noester Feb 24, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

n8n-assistant bot commented Mar 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov bot commented Feb 24, 2026 •

edited

Loading