feat: GraphtransformerProcessor chunking #66

japols · 2025-01-08T08:30:51Z

Describe your changes

This PR adds chunking for the GraphTransformerProcessorBlock to reduce memory usage in inference. The functionality is equivalent to the GraphTransformerMapperBlock chunking and uses the same env variable ANEMOI_INFERENCE_NUM_CHUNKS to control chunking behaviour.

Type of change

Please delete options that are not relevant.

New feature (non-breaking change which adds functionality)

Checklist before requesting a review

Tag possible reviewers

@ssmmnn11 @gabrieloks

ssmmnn11 · 2025-01-08T09:15:56Z

Hi Jan, thank you for adding this, very nice. Looking at the code it seems that some parts do the same but look slightly different. I was wondering if this would also be a good opportunity to reduce code duplication between GraphTransformerProcessorBlock and GraphTransformerMapperBlock. Maybe encapsulated in a common routine? I think differences are

dim of x_skip
providing size (I think that goes in the block though, ... need to check)
and the part of updating the source nodes.

…rBlock

for more information, see https://pre-commit.ci

japols · 2025-01-10T14:55:59Z

Hi Jan, thank you for adding this, very nice. Looking at the code it seems that some parts do the same but look slightly different. I was wondering if this would also be a good opportunity to reduce code duplication between GraphTransformerProcessorBlock and GraphTransformerMapperBlock. Maybe encapsulated in a common routine? I think differences are

dim of x_skip providing size (I think that goes in the block though, ... need to check) and the part of updating the source nodes.

I moved the common "attention" part to the GraphTransformerBaseBlock.

…unking

HCookie · 2025-02-25T17:59:33Z

What's the connection between this PR and the shard everything one?

japols · 2025-02-26T08:43:09Z

What's the connection between this PR and the shard everything one?

There is none, I can make it ready for merge

japols added 2 commits January 8, 2025 09:20

feature: GraphTransformerProcessorBlock chunking

c205e4e

fix sharded chunking

ebf4764

japols self-assigned this Jan 8, 2025

anaprietonem added the models label Jan 9, 2025

japols and others added 2 commits January 10, 2025 15:48

refactor: reduce code duplication for GraphTransformerProcessor/Mappe…

3df1315

…rBlock

[pre-commit.ci] auto fixes from pre-commit.com hooks

b4f0616

for more information, see https://pre-commit.ci

HCookie changed the title ~~Models: GraphtransformerProcessor chunking~~ feat: GraphtransformerProcessor chunking Jan 30, 2025

Merge branch 'main' into models_feature/graphtransformer_processor_ch…

e58af0c

…unking

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: GraphtransformerProcessor chunking #66

feat: GraphtransformerProcessor chunking #66

japols commented Jan 8, 2025

ssmmnn11 commented Jan 8, 2025

japols commented Jan 10, 2025

HCookie commented Feb 25, 2025

japols commented Feb 26, 2025

feat: GraphtransformerProcessor chunking #66

Are you sure you want to change the base?

feat: GraphtransformerProcessor chunking #66

Conversation

japols commented Jan 8, 2025

Describe your changes

Type of change

Checklist before requesting a review

Tag possible reviewers

ssmmnn11 commented Jan 8, 2025

japols commented Jan 10, 2025

HCookie commented Feb 25, 2025

japols commented Feb 26, 2025