feat(infra): concurrent materializer tests #1243

LePremierHomme · 2024-12-28T20:52:17Z

Introducing concurrent tests in materializer, enabling the generation of traceable workloads without significantly altering how test scenarios are written.

Existing tests refactored to use make_testnet instead of with_testnet.

Progress

For follow-up PRs

Create manifest files dynamically
Introduce profiling features
Introduce tracing / statistical analysis features

This change is

fendermint/testing/materializer/tests/docker.rs

fendermint/testing/materializer/src/concurrency.rs

fendermint/testing/materializer/tests/docker.rs

cryptoAtwill · 2025-01-08T14:20:03Z

fendermint/testing/materializer/src/bencher.rs

+    }
+
+    pub async fn record(&mut self, label: String) {
+        let duration = self.start_time.unwrap().elapsed();


I feel expect here if better if you assume caller know calling "start" should happen first.

I'll revise this API once the reporting summary is more solid.

fendermint/testing/materializer/src/concurrency/mod.rs

cryptoAtwill · 2025-01-08T14:30:27Z

fendermint/testing/materializer/src/concurrency/nonce_manager.rs

+
+#[derive(Default)]
+pub struct NonceManager {
+    nonces: Arc<Mutex<HashMap<H160, U256>>>,


I think this is a bottom neck as well, every address is waiting on the same lock. Maybe this might help: https://github.com/xacrimon/dashmap

Yes, this is just a temporary solution, I was hoping to remove it entirely. If not, I'll optimize it.

I think it would be good to try building in NonceManager from Ethers. Was there any problem with that?

Please do not use dashmap. Last time I checked ~6 months ago, it still had soundness issues for async code.

karlem · 2025-01-21T13:12:50Z

I have a suggestion about how to improve the framework design to make it cleaner and more intuitive. While the current implementation works, there are some areas where terminology and structure can be refined to improve clarity and usability. Consider the following approach:

BenchmarkRunner

The BenchmarkRunner should:

Orchestrate the entire benchmarking process.

Execute each BenchmarkStep sequentially within the specified duration limit.

struct BenchmarkRunner {
    steps: Vec<BenchmarkStep>,
    max_duration: Duration,
}

impl BenchmarkRunner {
    fn new(steps: Vec<BenchmarkStep>, max_duration: Duration) -> Self;
    fn run(&self) -> BenchmarkResult;
}

BenchmarkStep

The BenchmarkStep would be similar to the current ExecutionStep, but it would encapsulate a specific test function, making it more modular and allowing different functions to run within a single test.

struct BenchmarkStep<F>
where
    F: Fn(TestInput) -> TestResult + Send + Sync + 'static {
    concurrency: usize,      // Number of concurrent test executions (N)
    run_duration: Duration,  // Execution time duration (in seconds)
    test_fn: Arc<F>,
}

impl<F> BenchmarkStep<F>
where
    F: Fn(TestInput) -> TestResult + Send + Sync + 'static
{
    fn execute(&self, stop_flag: Arc<AtomicBool>) -> StepResult;
}

The stop_flag (using AtomicBool) is used to stop execution gracefully if the overall test time has expired or in case of any other issue. This is just a suggestion—other mechanisms, such as a Signal abstraction, could also be considered.

Test Input and Result

The TestInput structure can remain as it is, without the current Bencher, simplifying the design.

struct TestResult {
    pub test_id: usize,
    pub step_id: usize,
    pub tx_hash: Option<H256>,
    pub tx_tracker: TransactionTracker,
    pub err: Option<anyhow::Error>,
}

TransactionTracker

Instead of the existing Bencher, a TransactionTracker can be introduced to provide a clearer API. The current API has the potential for errors if start is forgotten, leading to incorrect results.

The new method should automatically set the submission time to ensure correct tracking without requiring manual intervention.

struct TransactionTracker {
    submission_time: Instant,
    mempool_time: Option<Instant>,
    block_time: Option<Instant>,
}

impl TransactionTracker {
    fn new() -> Self;
    fn mark_mempool(&mut self);
    fn mark_block(&mut self);
    fn get_mempool_latency(&self) -> Option<Duration>;
    fn get_block_latency(&self) -> Option<Duration>;
}

StepResult

The StepResult should pre-calculate average latencies and other useful statistics for each step, making it equivalent to the current StepSummary.

struct StepResult {
    step_id: usize,
    tests: Vec<TestResult>,
    avg_mempool_latency: Duration,
    avg_block_latency: Duration,
    // Additional useful statistics for the step
}

impl StepResult {
    fn new(results: Vec<TestResult>) -> Self;
}

Execution Engine

The execution engine should support concurrent execution of the test function for a specified duration, allowing precise control over execution time.

fn run_concurrent<F>(concurrency: usize, run_duration: Duration, test_fn: F, stop_flag: Arc<AtomicBool>)
where
    F: Fn(TestInput) -> TestResult + Send + Sync + 'static;

The stop_flag ensures the execution stops when the total benchmark duration is reached or when other termination conditions occur.

BenchmarkResult

The BenchmarkResult serves as the overall execution summary, aggregating results from all benchmark steps.

struct BenchmarkResult {
    steps: Vec<StepResult>,
}

Conclusion

This revised design primarily improves terminology and clarity, making the framework more cohesive and intuitive. The key benefits of the proposed approach include:

Encapsulation: Each BenchmarkStep holds its own test function, making it easier to run varied tests within a single benchmark.

Clarity: Replacing Bencher with TransactionTracker simplifies the API and eliminates potential misuses.

Intuitive Structure: The separation of responsibilities across BenchmarkRunner, BenchmarkStep, and TransactionTracker makes the design easier to understand and maintain.

Overall, this proposal aligns closely with the current design but improves cohesion, intuitiveness, and robustness.

karlem

This is the first major review batch (1/2). Tomorrow, a smaller set of reviews will follow.

Outstanding reviews:

The tests in benches.rs
Thoroughly review summary.rs

karlem · 2025-01-21T11:09:41Z

fendermint/testing/materializer/src/concurrency/nonce_manager.rs

+
+#[derive(Default)]
+pub struct NonceManager {
+    nonces: Arc<Mutex<HashMap<H160, U256>>>,


I think it would be good to try building in NonceManager from Ethers. Was there any problem with that?

fendermint/testing/materializer/src/concurrency/mod.rs

karlem · 2025-01-21T13:47:28Z

fendermint/testing/materializer/src/concurrency/mod.rs

+        let step_results = Arc::new(tokio::sync::Mutex::new(Vec::new()));
+        let execution_start = Instant::now();
+        loop {
+            if execution_start.elapsed() > step.duration {


maybe?

while execution_start.elapsed() < step.duration {

fendermint/testing/materializer/src/concurrency/reporting/dataset.rs

fendermint/testing/materializer/src/concurrency/reporting/summary.rs

karlem · 2025-01-22T20:23:54Z

fendermint/testing/materializer/tests/docker_tests/benches.rs

+                .await
+                .unwrap();
+            tx = tx.gas(gas_estimation);
+            assert!(gas_estimation <= max_tx_gas_limit);


Should this fail to whole test run?

I don't see a reason for allowing this to pass, and having to deal with partial failures. However, this shouldn't normally fail, and if so should fail consistently.

karlem · 2025-01-22T20:26:42Z

fendermint/testing/materializer/tests/docker_tests/benches.rs

+            }
+            input.bencher.mempool();
+
+            let receipt = pending


Maybe we should add a timeout here in case it isn't included?

Yes, will be added.

fendermint/testing/materializer/src/concurrency/reporting/summary.rs

fendermint/testing/materializer/src/concurrency/reporting/tps.rs

karlem · 2025-01-22T21:48:53Z

Both reviews (2/2) are now complete. That should be all for now :)

drahnr · 2025-02-05T15:09:24Z

fendermint/testing/materializer/src/concurrency/signal.rs

+// Copyright 2022-2024 Protocol Labs
+// SPDX-License-Identifier: Apache-2.0, MIT
+
+pub struct Signal(tokio::sync::Semaphore);


Uses a loom::sync::Mutex internally and is hence not signal safe, since pthread_mutex_lock is not signal safe. Please correct me if I am wrong.

i was just browsing code base, but i was curious what is your concern here, e.g what signal safety means

it is generally working pattern to hold blocking locks that guard short sections executed in async context. blocking primitives are used consistently in the tokio codebase, one of the examples https://github.com/tokio-rs/tokio/blob/4b3da20c9847b202cf110f7b7772fd4674edaecf/tokio/src/sync/barrier.rs#L142-L148 , and some info here https://tokio.rs/tokio/tutorial/shared-state under Tasks, threads, and contention paragraph

specifically in semaphore it guards a section that doesn't yield by itself, and should be very fast to complete (i expect that to be sub 1us), so preemption by os is very unlikely.

that said, there is actually no waiting in this wrapper

i guess you meant that if this is used directly in the signal interrupt handler, then it doesn't protect from re-entrancy

I think Signal needs some more context. My assumption was handling UNIX signals from doing a single pass.

As @dshulyak mentioned, there's no actual scheduler waiting here (as initially planned), so introducing this primitive to wrap Semaphore turned out to be confusing and an overkill. I downgraded it to a simple AtomicBool wrapper here: d00f2e1

drahnr

Reviewable status: 0 of 27 files reviewed, 34 unresolved discussions (waiting on @cryptoAtwill, @dshulyak, @karlem, @LePremierHomme, and @raulk)

fendermint/testing/materializer/src/concurrency/reporting/dataset.rs line 43 at r6 (raw file):

    let median = if count % 2 == 0 {
        (sorted_data[count / 2 - 1] + sorted_data[count / 2]) / 2.0

Nit: that's not a median, pick one, I'd argue the else branch is all we need

fendermint/testing/materializer/src/concurrency/signal.rs line 4 at r6 (raw file):

// SPDX-License-Identifier: Apache-2.0, MIT

pub struct Signal(tokio::sync::Semaphore);

A better name and some documentation would be great

fendermint/testing/materializer/tests/docker_tests/benches.rs line 248 at r6 (raw file):

            deploy_tx.set_gas(gas_estimation);
            assert!(gas_estimation <= max_tx_gas_limit);

The setup code deserves a few comments

fendermint/testing/materializer/src/concurrency/reporting/summary.rs line 59 at r6 (raw file):

    }

    pub fn print(&self) {

Nit: move to std::fmt::Display implementation

fendermint/testing/materializer/src/concurrency/mod.rs

drahnr

Reviewable status: 0 of 28 files reviewed, 34 unresolved discussions (waiting on @cryptoAtwill, @karlem, @LePremierHomme, and @raulk)

drahnr · 2025-02-24T17:20:35Z

fendermint/testing/materializer/src/concurrency/nonce_manager.rs

+
+#[derive(Default)]
+pub struct NonceManager {
+    nonces: Arc<Mutex<HashMap<H160, U256>>>,


Please do not use dashmap. Last time I checked ~6 months ago, it still had soundness issues for async code.

feat(tests): concurrent materializer tests

86437d7

LePremierHomme requested a review from a team as a code owner December 28, 2024 20:52

LePremierHomme marked this pull request as draft December 28, 2024 20:56

LePremierHomme changed the title ~~feat(tests): concurrent materializer tests~~ [WIP] feat(tests): concurrent materializer tests Dec 28, 2024

cryptoAtwill reviewed Dec 30, 2024

View reviewed changes

fendermint/testing/materializer/tests/docker.rs Outdated Show resolved Hide resolved

raulk requested changes Dec 30, 2024

View reviewed changes

fendermint/testing/materializer/src/concurrency.rs Outdated Show resolved Hide resolved

fendermint/testing/materializer/src/concurrency.rs Outdated Show resolved Hide resolved

fendermint/testing/materializer/tests/docker.rs Outdated Show resolved Hide resolved

LePremierHomme added 4 commits December 30, 2024 23:15

implement concurrency::execute

9c4d665

add Bencher, wait for block inclusion

c06cc01

add NonceManager

3cbde9d

with_testnet -> make_testnet

2de3e9a

LePremierHomme changed the title ~~[WIP] feat(tests): concurrent materializer tests~~ feat(test): concurrent materializer tests Jan 7, 2025

add license headers

82ed89b

LePremierHomme marked this pull request as ready for review January 7, 2025 12:23

LePremierHomme changed the title ~~feat(test): concurrent materializer tests~~ feat(infra): concurrent materializer tests Jan 7, 2025

LePremierHomme added 3 commits January 7, 2025 14:36

add license headers

8d9041d

clippy

4bdfad7

extract provider and chain_id, add reporting table

2cae29d

cryptoAtwill reviewed Jan 8, 2025

View reviewed changes

LePremierHomme added 7 commits January 10, 2025 10:45

add basic TPS analysis

886a0e2

cargo fmt

d841316

fix test_out_of_order_mempool

034c799

add tps, dataset metrics

d395d1b

rename test func

14f948c

add test_contract_deployment

184ea13

add test_contract_call

2a92492

karlem self-requested a review January 20, 2025 17:48

karlem requested changes Jan 21, 2025

View reviewed changes

karlem reviewed Jan 22, 2025

View reviewed changes

fendermint/testing/materializer/src/concurrency/reporting/summary.rs Outdated Show resolved Hide resolved

karlem reviewed Jan 22, 2025

View reviewed changes

fendermint/testing/materializer/src/concurrency/reporting/summary.rs Show resolved Hide resolved

karlem reviewed Jan 22, 2025

View reviewed changes

fendermint/testing/materializer/src/concurrency/reporting/tps.rs Outdated Show resolved Hide resolved

karlem reviewed Jan 22, 2025

View reviewed changes

fendermint/testing/materializer/src/concurrency/reporting/tps.rs Outdated Show resolved Hide resolved

drahnr reviewed Feb 5, 2025

View reviewed changes

LePremierHomme linked an issue Feb 10, 2025 that may be closed by this pull request

Single-node benchmarking #1272

Open

drahnr requested review from cryptoAtwill, karlem and raulk February 12, 2025 16:01

drahnr reviewed Feb 12, 2025

View reviewed changes

fendermint/testing/materializer/src/concurrency/mod.rs Outdated Show resolved Hide resolved

LePremierHomme added 10 commits February 20, 2025 11:19

Merge branch 'refs/heads/main' into concurrent_materializer

6e8604f

simplify Testnet::account_mod_nth

b515b18

Signal -> CancellationFlag

d00f2e1

impl Display for ExecutionSummary

5412375

use statrs

db687eb

extract and unify make_middleware

8e78b49

use blocks.windows(2) to iterate block intervals

9f46093

interval.le -> interval.is_zero

7ef6c36

improve naming/docs

0de2092

use generics instead of heap allocation

f7fa58e

drahnr reviewed Feb 24, 2025

View reviewed changes

use FuturesUnordered

ee6054a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(infra): concurrent materializer tests #1243

feat(infra): concurrent materializer tests #1243

LePremierHomme commented Dec 28, 2024 •

edited by raulk

Loading

cryptoAtwill Jan 8, 2025

LePremierHomme Jan 10, 2025

cryptoAtwill Jan 8, 2025

LePremierHomme Jan 10, 2025

karlem Jan 21, 2025

drahnr Feb 24, 2025

karlem commented Jan 21, 2025

karlem left a comment

karlem Jan 21, 2025

karlem Jan 21, 2025

karlem Jan 22, 2025

LePremierHomme Feb 23, 2025

karlem Jan 22, 2025

LePremierHomme Feb 23, 2025

karlem commented Jan 22, 2025

drahnr Feb 5, 2025

dshulyak Feb 6, 2025

dshulyak Feb 6, 2025

drahnr Feb 6, 2025

LePremierHomme Feb 23, 2025

drahnr left a comment

drahnr left a comment

drahnr Feb 24, 2025

feat(infra): concurrent materializer tests #1243

Are you sure you want to change the base?

feat(infra): concurrent materializer tests #1243

Conversation

LePremierHomme commented Dec 28, 2024 • edited by raulk Loading

Progress

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

karlem commented Jan 21, 2025

Conclusion

karlem left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

karlem commented Jan 22, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

drahnr left a comment

Choose a reason for hiding this comment

drahnr left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LePremierHomme commented Dec 28, 2024 •

edited by raulk

Loading