Begin backend scheduler/worker for consistent compaction #4770

zalegrala · 2025-02-28T15:54:16Z

What this PR does:

Here we add two new target modules BackendScheduler and BackendWorker. At present, these modules work together to replace the function of the current Compactor module. These modules are not included in any of the default targets and require enablement in order to function.

The BackendScheduler is responsible for scheduling compaction jobs, tracking their status and persisting that status to the backend for reload at startup. The BackendWorker is responsible for picking up these jobs and executing them. The BackendWorker is also responsible for writing the tenant index to the object storage. Sharding for which tenants are owned by which worker is handled by the ring, which is copied from the current Compactor module. Small duplication of effort during scaling events for the tenant index is a non-issue.

Tenant fairness is acheived with a new tenantselector package which aims to ensure that all tenants get compacted, but also not neglect any tenant for too long. This is a simple approach which is not perfect, but should work for our use case. With the current implementation, the blocklist, outstanding blocks, and the last compaction time are taken into account to determmine which tenants need to have jobs scheduled. Eventually, if a tenant has not been compacted for a long time, it will become the priority no matter the lengh of the blocklist.

Why we need it:

During sclaing events of the compactors, the ring may not be fully propogated and there is a race between compactors that may already be executing a compaction job for a given block and new compactors just joining the ring. This can lead to a small amount of duplicated data in the backend until the ring is fully propgated and stable. For small environments, this may never surface as an issue, but in large environments that wish to autoscale their compactors with load, this can be problematic. Additionally, we want to rely on RF1 data in the backend for future work.

Completed and failed jobs are dropped from the state after 1 hour.

Known issues to follow up:

The output blocks are not idempotent, since the desitntion block ID is not known until the compaction is complete. This is a smallish change to the encoding package to include the target block ID in the output block. This is not included in this PR, but we can follow up.

The worker does not wait to complete the in flight jobs before shutting down. This is something I would like to resolve and is not included in this PR.

Tenant retention jobs are not included in this PR, but I expect this to be a fast follow with the current pattern.

Which issue(s) this PR fixes:
Fixes #

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

This reverts commit 239c4f2.

…ckage

zalegrala · 2025-03-14T15:50:44Z

cmd/tempo/app/modules.go

+	var reader backend.RawReader
+	var writer backend.RawWriter
+
+	switch t.cfg.StorageConfig.Trace.Backend {


I don't love this. I copied the pattern we have in the usagestats module, but perhaps a storage.Store interface extention makes sense here. I'm open to thoughts.

…d with error return

…ack to the queue

zalegrala force-pushed the beingBackendScheduler branch from 5f0f32f to a8a7ca5 Compare February 28, 2025 15:57

zalegrala added 24 commits March 5, 2025 16:53

Begin changes for backend-scheduler module

2c84ae3

Comment breadcrumb

dd06fdf

Initial compactor extensions and backendscheduler module

a3ee9ba

Add initial jsonnet

c4fb3cb

Some cleanup after discussion

86cda5a

Add basic integration test for jobs fetching

36d0017

Flesh out some unit tests

f6e9285

Clean up and add test coverage

cdeb06d

Improve coverage

5c206c8

Clean up dead ring code

c113f0e

Set default value for compactor job polling

c3a41eb

Add log message for backend scheduling

defa994

Add note about fixing the compactor flow

0541cc9

Small operational fixes

d120f15

Plumb client defaults

6cbaf32

Fix job handling and module dependency chicken/egg/lizard

e6ef7c4

Update config manifest

8045d35

Move CompactionBlockSelector

e73214d

Revert tempodb compactor changes

22923dc

Revert "Revert tempodb compactor changes"

b375f06

This reverts commit 239c4f2.

Add backendworker module and work around comapctor dependencies

c278a49

Move job work and queue into separate package

440993e

Add jsonnet for backend-worker

8f3c7b5

Fix backendscheduler tests

4614192

zalegrala force-pushed the beingBackendScheduler branch from 007fc0c to 4614192 Compare March 5, 2025 16:54

zalegrala added 4 commits March 5, 2025 17:24

Update docs manifest

32e0278

Revert compaction method rename

afb0e52

Drop debug

bb92232

Revert compactor module changes

88510d2

zalegrala added 12 commits March 13, 2025 17:45

Push job collection into work package

4884d39

Push logic for skipping blocks which are already working into work pa…

3fc4006

…ckage

Log the workerID

09ecdd0

Return gRPC error on invalid job status

b5898eb

Metric worker retry calls

e3b1b9a

Improve test coverage for work

8a6a182

Improve test coverage for job

0c602f1

Fix test logic

33c7a7c

Reduce surface area, improve test coverage, clean up error handling

efcdcb5

Include new errors

66331c6

Fix lint

22a7b1e

Add tests for backendworker module

480ce9f

zalegrala commented Mar 14, 2025

View reviewed changes

Drop comments

c4293ea

zalegrala force-pushed the beingBackendScheduler branch from b8c7874 to c4293ea Compare March 14, 2025 15:57

zalegrala marked this pull request as ready for review March 14, 2025 16:13

zalegrala requested review from knylander-grafana, mdisibio, mapno, yvrhdn, electron0zero, ie-pham, stoewer, javiermolinar and carles-grafana as code owners March 14, 2025 16:13

zalegrala added 5 commits March 14, 2025 18:08

Fix backoff for Next() call on the worker to ensure backoff is honore…

1ea8c54

…d with error return

Bring back outstanding block metric on scheduler

b72fece

Make note about losing the last work time when the tenant is pushed b…

74a440a

…ack to the queue

Measure all tenants, not just ones which get scheduled

3739cc9

Continue scheduling if a single tenant was not enough work to load

47c9d1b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Begin backend scheduler/worker for consistent compaction #4770

Begin backend scheduler/worker for consistent compaction #4770

zalegrala commented Feb 28, 2025 •

edited

Loading

zalegrala Mar 14, 2025

Begin backend scheduler/worker for consistent compaction #4770

Are you sure you want to change the base?

Begin backend scheduler/worker for consistent compaction #4770

Conversation

zalegrala commented Feb 28, 2025 • edited Loading

zalegrala Mar 14, 2025

Choose a reason for hiding this comment

zalegrala commented Feb 28, 2025 •

edited

Loading