schemachanger: fix race in distributed merge checkpoint updates with lazy SST cleanup by spilchen · Pull Request #163776 · cockroachdb/cockroach

spilchen · 2026-02-17T20:10:40Z

Previously, distributed merge checkpoint progress updates had a race with SST cleanup. SetBackfillProgress only updates in-memory state; a background goroutine persists the checkpoint to disk at regular intervals. However, SST cleanup was triggered immediately after calling SetBackfillProgress, before the updated checkpoint was durably written. This could result in temporary SSTs being removed too early if a distributed merge phase transition occurred before the checkpoint hit disk.

This change moves SST cleanup into the backfill progress tracker and performs it lazily. Cleanup now runs only after the checkpoint has been persisted and a distributed merge phase transition has completed. This ensures that old temporary SSTs are removed only once the corresponding progress is durable, eliminating the race.

Fixes #162330
Fixes #163264
Epic: CRDB-48845

Release note: None

trunk-io · 2026-02-17T20:10:49Z

😎 Merged successfully - details.

cockroach-teamcity · 2026-02-17T20:10:57Z

This change is

Previously, checkpoint progress updates had a race with SST cleanup. SetBackfillProgress only updates in-memory state; a background goroutine persists the checkpoint to disk at regular intervals. However, SST cleanup was triggered immediately after calling SetBackfillProgress, before the updated checkpoint was durably written. This could result in temporary SSTs being removed too early if a distributed merge phase transition occurred before the checkpoint hit disk. This change moves SST cleanup into the backfill progress tracker and performs it lazily. Cleanup now runs only after the checkpoint has been persisted and a distributed merge phase transition has completed. This ensures that old temporary SSTs are removed only once the corresponding progress is durable, eliminating the race. Fixes cockroachdb#162330 Epic: CRDB-48845 Release note: None

fqazi

One question related to the flushing behaviour

@fqazi reviewed 10 files and all commit messages, and made 2 comments.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on spilchen).

pkg/sql/schemachanger/scexec/backfiller/tracker.go line 284 at r1 (raw file):

	// After successful write, detect phase transitions and handle cleanup.
	if b.cleaner != nil {

Do we immediately flush on these transitions? Or do we rely on the default timer based behavior?

spilchen

@spilchen made 1 comment.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on fqazi).

pkg/sql/schemachanger/scexec/backfiller/tracker.go line 284 at r1 (raw file):

Previously, fqazi (Faizan Qazi) wrote…

Do we immediately flush on these transitions? Or do we rely on the default timer based behavior?

We rely on the timer based behaviour. In an earlier attempt at fixing this, I did add FlushCheckpoint() during phase transitions. But that introduced new timing scenarios if two goroutines are calling FlushCheckpoint() at the same time. So, I opted for the lazy update once the checkpoint has been persisted. This ensures only 1 goroutine will ever be calling FlushCheckpoint.

spilchen · 2026-02-18T16:56:44Z

TFTR!

spilchen · 2026-02-18T16:56:54Z

/trunk merge

spilchen self-assigned this Feb 17, 2026

spilchen force-pushed the gh-162330/260217/1441/merge/early-sst-removal-bugfix branch from 8bce049 to 0974e8d Compare February 18, 2026 13:07

spilchen changed the title ~~schemachanger: fix race in checkpoint updates with lazy SST cleanup~~ schemachanger: fix race in distributed merge checkpoint updates with lazy SST cleanup Feb 18, 2026

spilchen marked this pull request as ready for review February 18, 2026 13:09

spilchen requested a review from a team as a code owner February 18, 2026 13:09

fqazi approved these changes Feb 18, 2026

View reviewed changes

spilchen commented Feb 18, 2026

View reviewed changes

spilchen mentioned this pull request Feb 18, 2026

sql: TestDistributedMergeStoragePrefixPreservedAcrossPauseResume failed #163264

Closed

trunk-io bot merged commit 13fafdc into cockroachdb:master Feb 18, 2026
28 checks passed

celeste-cockroachdb bot added the target-release-26.2.0 label Feb 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

schemachanger: fix race in distributed merge checkpoint updates with lazy SST cleanup#163776

schemachanger: fix race in distributed merge checkpoint updates with lazy SST cleanup#163776
trunk-io[bot] merged 1 commit intocockroachdb:masterfrom
spilchen:gh-162330/260217/1441/merge/early-sst-removal-bugfix

spilchen commented Feb 17, 2026 •

edited

Loading

Uh oh!

trunk-io bot commented Feb 17, 2026 •

edited

Loading

Uh oh!

cockroach-teamcity commented Feb 17, 2026

Uh oh!

fqazi left a comment

Uh oh!

spilchen left a comment

Uh oh!

spilchen commented Feb 18, 2026

Uh oh!

spilchen commented Feb 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

spilchen commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

trunk-io bot commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cockroach-teamcity commented Feb 17, 2026

Uh oh!

fqazi left a comment

Choose a reason for hiding this comment

Uh oh!

spilchen left a comment

Choose a reason for hiding this comment

Uh oh!

spilchen commented Feb 18, 2026

Uh oh!

spilchen commented Feb 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

spilchen commented Feb 17, 2026 •

edited

Loading

trunk-io bot commented Feb 17, 2026 •

edited

Loading