Add Litestream replica stress test #340

LudovicoRighi · 2024-07-29T17:25:15Z

As suggested here: https://github.com/alexhsamuel/apsis/pull/338#issuecomment-2241202464, I'm adding a stress test.
In particular, in the test 500 runs are started immediately and other 500 are scheduled with a small delay.

@alexhsamuel let me know if this is a reasonable load to run the test with.

The test takes ~20 sec to pass.

alexhsamuel · 2024-08-01T19:10:18Z

test/int/litestream/test_replica.py

+
+            client = inst.client
+            # populate apsis db with a large number of runs
+            with ThreadPoolExecutor() as exe:


Have you tried this without the executor? The API handler is single-threaded async, so it's not clear to me that concurrent access will be significantly faster than sequential.

I was in fact getting very similar results, that would explain why

OK, so I suggest removing the executor then, to keep things simple.

Ok sounds good

alexhsamuel · 2024-08-01T19:10:56Z

test/int/litestream/test_replica.py

+            # populate apsis db with a large number of runs
+            with ThreadPoolExecutor() as exe:
+                runs_fut = [exe.submit(client.schedule, "sleep", {"duration": "9"}) for _ in range(num_jobs)]
+                sched_runs_fut = [exe.submit(client.schedule, "sleep", {"duration": "1"}, time=ora.now() + 14) for _ in range(num_jobs)]


Where does the 9 and 14 come from? Did you determine this empirically while running it yourself? Seems like the right numbers to use will depend both on num_jobs as well as on the environment the test is running in.

Did you determine this empirically while running it yourself?

Correct.

Seems like the right numbers to use will depend both on num_jobs as well as on the environment the test is running in.

I could easily make these durations dependent on num_jobs; as per the environment, which factors would you take into account?

per the environment, which factors would you take into account?

Nothing useful to suggest. Just keep in mind that the VM that runs the GH Agent CI might be somewhat slower than your hosts.

Ultimately it would be good to build an integration test setup for Apsis with deterministic time, i.e. the test can roll application time forward explicitly, to avoid these kind of timing issues.

Understood, thank you

alexhsamuel · 2024-08-09T12:55:36Z

Great!

Add litestream replica stress test

caa9324

LudovicoRighi force-pushed the litestream-stress-test branch from 39afc5c to caa9324 Compare July 29, 2024 17:30

Unify final assertions

075ea6d

alexhsamuel reviewed Aug 1, 2024

View reviewed changes

Remove ThreadPoolExecutor

8a971a0

LudovicoRighi requested a review from alexhsamuel August 9, 2024 11:27

alexhsamuel merged commit 2916e79 into apsis-scheduler:master Aug 9, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Litestream replica stress test #340

Add Litestream replica stress test #340

LudovicoRighi commented Jul 29, 2024 •

edited

Loading

alexhsamuel Aug 1, 2024

LudovicoRighi Aug 2, 2024

alexhsamuel Aug 2, 2024

LudovicoRighi Aug 9, 2024

alexhsamuel Aug 1, 2024

LudovicoRighi Aug 2, 2024 •

edited

Loading

alexhsamuel Aug 2, 2024

LudovicoRighi Aug 9, 2024

alexhsamuel commented Aug 9, 2024

Add Litestream replica stress test #340

Add Litestream replica stress test #340

Conversation

LudovicoRighi commented Jul 29, 2024 • edited Loading

alexhsamuel Aug 1, 2024

Choose a reason for hiding this comment

LudovicoRighi Aug 2, 2024

Choose a reason for hiding this comment

alexhsamuel Aug 2, 2024

Choose a reason for hiding this comment

LudovicoRighi Aug 9, 2024

Choose a reason for hiding this comment

alexhsamuel Aug 1, 2024

Choose a reason for hiding this comment

LudovicoRighi Aug 2, 2024 • edited Loading

Choose a reason for hiding this comment

alexhsamuel Aug 2, 2024

Choose a reason for hiding this comment

LudovicoRighi Aug 9, 2024

Choose a reason for hiding this comment

alexhsamuel commented Aug 9, 2024

LudovicoRighi commented Jul 29, 2024 •

edited

Loading

LudovicoRighi Aug 2, 2024 •

edited

Loading