Allow Custom Data in Replay Files #78

marcelotrevisani · 2025-02-03T09:26:22Z

Description

Hey folks! 👋

I have a use case where I need to add extra data to the replay file so I can properly reproduce my tests. One specific example is when a test involves random number generation—I need to store the seed that was used. Since each test might have a different seed, this would be super useful to ensure reproducibility.

Proposal

It would be great if pytest-replay allowed users to store custom data in the replay file. This could be done via a fixture or some other mechanism that lets users define extra metadata when a test runs. Then, when replaying, this data would be available to ensure the test environment is correctly reconstructed.

Implementation Idea

Introduce a way to let users attach custom metadata (maybe via fixture or another hook, or custom hook? Maybe a way to store some fixture values?).
Store this data alongside the standard replay info.
When replaying, load the metadata so tests can use it.
This could be useful for things beyond random seeds too!

Limitations

I know there are some edge cases, like when a core dump happens before the data is written. But even with that limitation, I think this would be a really nice addition for a lot of users (at least for me! 😄).

Happy to Contribute 🚀

If this sounds like a useful feature, I’d be happy to create a PR and implement it! Let me know what you think. 😊

cc: @tadeu @nicoddemus @prusse-martin

The text was updated successfully, but these errors were encountered:

nicoddemus · 2025-02-03T11:02:32Z

Hi @marcelotrevisani!

Thanks for the detailed description.

I want to understand the requirement better though. In my POV, the test environment should always be reproducible, regardless of pytest-replay being used or not, so I'm a bit skeptical that pytest-replay itself needs to support storing extra meta data.

In the example you gave, seed numbers in tests are usually fixed, so we always get the same result, so I don't really understand how that would fit here. Can you elaborate your example a bit?

marcelotrevisani · 2025-02-03T11:08:43Z

Hi @nicoddemus , thanks for your reply, sure, I am happy to explain that a bit more

We use a mix of stochastic testing, Monte Carlo testing, and property-based testing. Since it's virtually impossible to cover all possible scenarios, we introduce a random seed to always generate a new set of tests. This helps us explore a wider input space over multiple runs while still maintaining some level of control. The key point is that our tests are not designed to be fully reproducible in the traditional sense (like always running with the same fixed inputs). Instead, we prioritize broader test coverage by allowing different runs to explore different edge cases. That said, if reproducibility is needed for debugging, we can reuse a specific seed value, ensuring the same test case can be re-executed when necessary.

If you need more explanation, please let me know :)

I know it is a bit of a different scenario, but specially for property base testing (like hypothesis) that might help other users as well.

nicoddemus · 2025-02-03T11:20:01Z

Thanks! Things are a bit clearer now.

One thing is not very clear to me yet is how you would use this functionality... can you write a simple example of your use case, assuming we have this feature? For the sake of this exercise, we can assume we have a pytest_replay_metadata function-scoped fixture, with a .metadata attribute which is a dict that is load/dump as JSON for each test. How you would use that feature in your scenario (including example code if you can).

marcelotrevisani · 2025-02-03T11:36:58Z

Yes, sure, no worries!

I thought in two ways to implement/use this, either one is fine for me

Using a hook or custom hook after the tests and fixtures are collected, so I can use it to control my tests and my fixtures and setup the seed for it.
Scope function fixture to add and recover metadata

I believe the fixture solution might be more generic.

As an example with what you described I would use it something like this:
lets say I have the following fixture that generates the random state for me (I am using numpy here just as an example)

import random

import pytest
import numpy as np


@pytest.fixture
def rng(pytest_replay_metadata, request):
    current_test = request.node.nodeid
    current_seed = pytest_replay_metadata.metadata.get(current_test, {}).get("seed", None)
    
    if current_seed is None:
        current_seed = random.randint(0, 9_999_999_999)
        pytest_replay_metadata.metadata[current_test] = current_seed
    
    return np.random.default_rng(seed=current_seed)


def test_rand_val(rng):
    assert run_model_foo(rng.standard_normal((1_000_000, 1_000_000))) < 0.3

That is a just a toy example, the run_mode_foo would run a function that receives my input and do some calculations and return a value.
That is quite simple but if you need something more elaborated, please let me know

nicoddemus · 2025-02-03T11:42:02Z

Hmm yeah OK, I think that clarifies things.

If you would like to contribute that please go ahead.

One change from the above example though, pytest_replay_metadata.metadata would already be only the metadata for that test, so no need to access it via the current test's node id:

import random

import pytest
import numpy as np


@pytest.fixture
def rng(pytest_replay_metadata):
    current_seed = pytest_replay_metadata.metadata.get("seed", None)
    
    if current_seed is None:
        current_seed = random.randint(0, 9_999_999_999)
        pytest_replay_metadata.metadata["seed"] = current_seed
    
    return np.random.default_rng(seed=current_seed)


def test_rand_val(rng):
    assert run_model_foo(rng.standard_normal((1_000_000, 1_000_000))) < 0.3

marcelotrevisani · 2025-02-03T12:46:23Z

Perfect! That works even better for me. I'll try to implement this feature in the coming days

Thanks for your input!

marcelotrevisani · 2025-02-03T18:51:58Z

PR created!
#82
😃

New `replay_metadata` fixture letting the user associate extra metadata information to each test. This allows users to attach extra information for reproducibility in some cases, for example when RNG is involved. Fixes #78 --------- Co-authored-by: Bruno Oliveira <bruno@soliv.dev>

nicoddemus · 2025-02-05T12:46:17Z

1.6.0 released, thanks again @marcelotrevisani!

marcelotrevisani mentioned this issue Feb 3, 2025

Allow custom metadata for replay file #82

Merged

nicoddemus closed this as completed in #82 Feb 5, 2025

prusse-martin mentioned this issue Feb 5, 2025

Handle "metadata" durring test setup #85

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow Custom Data in Replay Files #78

Allow Custom Data in Replay Files #78

marcelotrevisani commented Feb 3, 2025 •

edited

Loading

nicoddemus commented Feb 3, 2025

marcelotrevisani commented Feb 3, 2025 •

edited

Loading

nicoddemus commented Feb 3, 2025

marcelotrevisani commented Feb 3, 2025 •

edited

Loading

nicoddemus commented Feb 3, 2025

marcelotrevisani commented Feb 3, 2025

marcelotrevisani commented Feb 3, 2025

nicoddemus commented Feb 5, 2025

Allow Custom Data in Replay Files #78

Allow Custom Data in Replay Files #78

Comments

marcelotrevisani commented Feb 3, 2025 • edited Loading

Description

Proposal

Implementation Idea

Limitations

Happy to Contribute 🚀

nicoddemus commented Feb 3, 2025

marcelotrevisani commented Feb 3, 2025 • edited Loading

nicoddemus commented Feb 3, 2025

marcelotrevisani commented Feb 3, 2025 • edited Loading

nicoddemus commented Feb 3, 2025

marcelotrevisani commented Feb 3, 2025

marcelotrevisani commented Feb 3, 2025

nicoddemus commented Feb 5, 2025

marcelotrevisani commented Feb 3, 2025 •

edited

Loading

marcelotrevisani commented Feb 3, 2025 •

edited

Loading

marcelotrevisani commented Feb 3, 2025 •

edited

Loading