Transformation that converts design matrix into records #2847

xjules · 2022-02-04T11:27:16Z

When working on doe (design of experiment) it is currently quite cumbersome to read the parameters from the design matrix file (eg. by means of an external job) and then create parameter json representation (one by one) in order to load them later on explicitly as records. Therefore it would be meaningful to have a transformation that reads such a design matrix csv file and does the parameter records automatically.

berland · 2022-02-04T11:50:30Z

For reference, this is the corresponding code for ERT2: https://github.com/equinor/semeio/blob/main/semeio/jobs/design2params/design2params.py

jondequinor · 2022-03-16T08:42:10Z

Transformation multiplexing/singleplexing

There's clearly a need for multiplexing transformations, i.e. transformations that create multiple records from one file, or vice-versa—or both. Transformations already do singleplexing with from_record and to_record. Multiplexing would introduce to_records and from_records.

This increases the complexity of the transformation API. So, to make this livable, we need very strict rules for how *plexing is dealt with.

E.g.

SerializationTransformation does not currently allow any multiplexing.
CopyTransformation is singleplexing only.
DesignMatrixTransformation allows multiplexing and singleplexing.
EclSumTransformation will only allow one-directional singleplexing, because it produces a "single" record tree from file to record only.

After configuration and creation on the DesignMatrixTransformation instance, it should be decided what kind of *plexing the instance will do. So some form of rule or heuristic would exist in the instance factory. The point is, consumers of the transformation shouldn't have to guess, and it should fail immediately if something is unclear/unsupported.

Further, I initially thought that the interface would be something like this:

async def to_record(self, root_path = Path()) -> Record:

async def to_records(self, root_path = Path()) -> RecordCollection:

but a significant usage of design matrices is to create only one group (which we can call design_matrix). This group is only part of the parameters that is to be defined—other parameters might come from a stochastic source.

So RecordCollection is a too generic, too dumb data model for this. A RecordCollection should be able to support

selecting a subset based on grouping (in the parameter dimension)
be built iteratively, meaning multiple sources can constitute a record

For DOE there's also not vectors, but scalars, so #2934 blocks this.

TBC…

eivindjahren · 2022-09-15T13:23:17Z

Closing as related to ert3, which is no longer the direction taken by the project. Feel free to reopen if still relevant.

sondreso · 2022-09-16T08:31:02Z

This one relates to code in the ert.data package and is still relevant

sondreso · 2023-02-07T09:02:42Z

Closing in favor of #4656

xjules added enhancement ert3 labels Feb 4, 2022

xjules self-assigned this Feb 4, 2022

xjules mentioned this issue Feb 4, 2022

Add TabularDataTransformation #2850

Closed

1 task

jondequinor self-assigned this Mar 16, 2022

sondreso unassigned xjules and jondequinor Mar 17, 2022

eivindjahren closed this as completed Sep 15, 2022

sondreso reopened this Sep 16, 2022

eivindjahren removed the ert3 label Sep 16, 2022

sondreso closed this as completed Feb 7, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transformation that converts design matrix into records #2847

Transformation that converts design matrix into records #2847

xjules commented Feb 4, 2022

berland commented Feb 4, 2022

jondequinor commented Mar 16, 2022 •

edited

Loading

eivindjahren commented Sep 15, 2022

sondreso commented Sep 16, 2022

sondreso commented Feb 7, 2023

Transformation that converts design matrix into records #2847

Transformation that converts design matrix into records #2847

Comments

xjules commented Feb 4, 2022

berland commented Feb 4, 2022

jondequinor commented Mar 16, 2022 • edited Loading

eivindjahren commented Sep 15, 2022

sondreso commented Sep 16, 2022

sondreso commented Feb 7, 2023

jondequinor commented Mar 16, 2022 •

edited

Loading