Convert `aten.sum.dim_IntList` to `ttnn.sum` #264

jdh8 · 2024-09-25T12:54:27Z

Ticket

Resolves aten.sum.dim_IntList #434
Possibly depends on PCC drops to ~0.7 with ttnn.sum tt-metal#16336

Problem description

To support aten.sum.dim_IntList, we need ttnn.sum to support a tuple of dims. I'm investigating how to patch the kernel.

What's changed

tests/lowering/reduction/test_sum.py

jdh8 · 2024-11-09T20:07:29Z

I have no idea about the failing test case.

jdh8 · 2024-11-10T01:46:50Z

tests/lowering/reduction/test_sum.py

+        ((1024, 640), (0,)),
+        ((14, 2048), (0,)),
+        ((14, 512), (0,)),
+        ((16384, 128), (0,)),


Found mild PCC error locally for (16384, 128), (0,):

expected_pytorch_result = tensor([[ 53.5000, -96.0000, -113.5000, -139.0000, -72.5000, 39.2500, -159.0000, -13.6875, -87.0000, ...187.0000, -130.0000, -23.7500, -18.7500, -25.2500, -132.0000, -22.0000, 87.5000]], dtype=torch.bfloat16) actual_pytorch_result = TorchTensor([[ 63.0000, -111.5000, -129.0000, -142.0000, -74.0000, 59.2500, -153.0000, -14.1875, -...000, -137.0000, -5.2188, -18.7500, -34.0000, -153.0000, -15.6250, 102.5000]], dtype=torch.bfloat16) pcc = 0.999 def assert_with_pcc(expected_pytorch_result, actual_pytorch_result, pcc=0.999): assert list(expected_pytorch_result.shape) == list( actual_pytorch_result.shape ), f"list(expected_pytorch_result.shape)={list(expected_pytorch_result.shape)} vs list(actual_pytorch_result.shape)={list(actual_pytorch_result.shape)}" pcc_passed, pcc_message = comp_pcc(expected_pytorch_result, actual_pytorch_result, pcc) > assert pcc_passed, construct_pcc_assert_message(pcc_message, expected_pytorch_result, actual_pytorch_result) E AssertionError: 0.9923741703805607 tests/utils.py:219: AssertionError

PCC accuracy declines as tensors grow in size because of aggregation of differences. Would it make sense to lower the threshold for tensors where a dimension is over 10k?

jdh8 · 2024-11-10T01:46:50Z

tests/lowering/reduction/test_sum.py

+        ((14, 2048), (0,)),
+        ((14, 512), (0,)),
+        ((16384, 128), (0,)),
+        ((16384, 32), (0,)),


Found mild PCC error locally for (16384, 32), (0,):

expected_pytorch_result = tensor([[-156.0000, -105.0000, -136.0000, 71.0000, -59.5000, -163.0000, -26.1250, -26.2500, -67.0000, ... 10.8750, -237.0000, -108.0000, -38.2500, -107.5000, 47.2500, -41.2500, -85.0000]], dtype=torch.bfloat16) actual_pytorch_result = TorchTensor([[-185.0000, -127.5000, -161.0000, 93.5000, -63.2500, -199.0000, -40.0000, -44.2500, -...000, -280.0000, -116.5000, -36.0000, -147.0000, 76.0000, -37.7500, -104.0000]], dtype=torch.bfloat16) pcc = 0.999 def assert_with_pcc(expected_pytorch_result, actual_pytorch_result, pcc=0.999): assert list(expected_pytorch_result.shape) == list( actual_pytorch_result.shape ), f"list(expected_pytorch_result.shape)={list(expected_pytorch_result.shape)} vs list(actual_pytorch_result.shape)={list(actual_pytorch_result.shape)}" pcc_passed, pcc_message = comp_pcc(expected_pytorch_result, actual_pytorch_result, pcc) > assert pcc_passed, construct_pcc_assert_message(pcc_message, expected_pytorch_result, actual_pytorch_result) E AssertionError: 0.993693118840562 tests/utils.py:219: AssertionError

jdh8 · 2024-11-10T06:08:45Z

torch_ttnn/passes/lowering/to_tt_pass.py

+                        tensor = g.call_function(ttnn.to_layout, (tensor, TtnnRowMajorLayout()))
+                        tensor = g.call_function(ttnn.unsqueeze, (tensor, 0))
+                        tensor = g.call_function(ttnn.sum, (tensor, 1))
+                        tensor = g.call_function(ttnn.squeeze, (tensor, 0))


Summing along dimension 0 seems unsupported, so I made this workaround

Our current use cases only reduce at most 2 dimensions, so I'm using this naive algorithm. If the loop becomes a bottleneck, consider the following algorithm producing O(1) ops: 1. Transpose the tensor to group together dimensions to reduce 2. Reshape the tensor to merge dimensions to reduce 3. Reduce the only dimension 4. Unsqueeze/reshape if keepdims=True

…eezing at arbitrary axis is unsupported yet

I doubt if this makes a difference in Python though

jdh8 requested review from kevinwuTT and ayerofieiev-tt September 25, 2024 12:54

jdh8 force-pushed the feature/sum branch 2 times, most recently from 680876a to 79ffb55 Compare November 8, 2024 18:00

ayerofieiev-tt reviewed Nov 8, 2024

View reviewed changes

tests/lowering/reduction/test_sum.py Outdated Show resolved Hide resolved

ayerofieiev-tt approved these changes Nov 8, 2024

View reviewed changes

jdh8 self-assigned this Nov 9, 2024

jdh8 added the conversion label Nov 9, 2024

jdh8 marked this pull request as ready for review November 9, 2024 09:05

jdh8 force-pushed the feature/sum branch from 5fb5b19 to 111dbed Compare November 9, 2024 09:05

jdh8 added this pull request to the merge queue Nov 9, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Nov 9, 2024

jdh8 added the help wanted Extra attention is needed label Nov 9, 2024

jdh8 commented Nov 10, 2024

View reviewed changes

jdh8 force-pushed the feature/sum branch from b502826 to 6f11867 Compare November 12, 2024 16:03

jdh8 added this pull request to the merge queue Nov 12, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Nov 12, 2024

jdh8 added 11 commits December 27, 2024 19:43

Test conversion for ttnn.sum

54fa1af

Register data movement for ttnn.sum

20848fb

Skip conversion for aten.sum.dim_IntList(keepdim=False) because squ…

d726ca2

…eezing at arbitrary axis is unsupported yet

Test more cases from docs/operations/aten.sum.dim_IntList.md

e48da8d

Simplify the algorithm for getting dimensions to sum up

6d5f596

Workaround for summing along dimension 0

380169c

Apply formatting rules

9618b01

Add more test cases

429776e

Convert output of ttnn.sum(dim=0) back to tile layout

53d1c8c

Fix position of the default return statement

4633f2f

jdh8 added 2 commits December 27, 2024 19:47

Remove duplicate entries in add_data_move_pass.py

b98ab37

Fix and optimize conversion from aten.sum.dim_IntList to ttnn.sum

4753ae0

jdh8 force-pushed the feature/sum branch from a7f301d to 4753ae0 Compare December 27, 2024 19:56

Use graceful get_shape instead of exception handling

90bb109

I doubt if this makes a difference in Python though

jdh8 enabled auto-merge December 27, 2024 20:08

jdh8 added this pull request to the merge queue Dec 27, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 27, 2024

jdh8 removed the help wanted Extra attention is needed label Dec 27, 2024

jdh8 added this pull request to the merge queue Dec 27, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 27, 2024

Make unique test file name for `aten.sum.dim_IntList'

953748d

jdh8 added this pull request to the merge queue Dec 27, 2024

github-merge-queue bot removed this pull request from the merge queue due to failed status checks Dec 27, 2024

jdh8 mentioned this pull request Dec 27, 2024

PCC drops to ~0.7 with ttnn.sum tenstorrent/tt-metal#16336

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert `aten.sum.dim_IntList` to `ttnn.sum` #264

Convert `aten.sum.dim_IntList` to `ttnn.sum` #264

jdh8 commented Sep 25, 2024 •

edited

Loading

jdh8 commented Nov 9, 2024

jdh8 Nov 10, 2024 •

edited

Loading

bbradelTT Nov 12, 2024

jdh8 Nov 10, 2024 •

edited

Loading

jdh8 Nov 10, 2024

Convert aten.sum.dim_IntList to ttnn.sum #264

Are you sure you want to change the base?

Convert aten.sum.dim_IntList to ttnn.sum #264

Conversation

jdh8 commented Sep 25, 2024 • edited Loading

Ticket

Problem description

What's changed

jdh8 commented Nov 9, 2024

jdh8 Nov 10, 2024 • edited Loading

Choose a reason for hiding this comment

bbradelTT Nov 12, 2024

Choose a reason for hiding this comment

jdh8 Nov 10, 2024 • edited Loading

Choose a reason for hiding this comment

jdh8 Nov 10, 2024

Choose a reason for hiding this comment

Convert `aten.sum.dim_IntList` to `ttnn.sum` #264

Convert `aten.sum.dim_IntList` to `ttnn.sum` #264

jdh8 commented Sep 25, 2024 •

edited

Loading

jdh8 Nov 10, 2024 •

edited

Loading

jdh8 Nov 10, 2024 •

edited

Loading