Fix the execution phase and stage recording issue and replace compare_with_golden with verify function #1358

chandrasekaranpradeep · 2025-03-04T07:39:15Z

Problem Description

The execution phase and stage are recorded incorrectly whenever test cases contain post-processing code that reruns the compiled model.

Example:

Test Cases	Execution Phase	Execution Stage
forge/test/models/pytorch/vision/resnet/test_resnet.py::test_resnet_hf[microsoft/resnet-50]	EXECUTED_TTNN	EXECUTED_TTNN
forge/test/models/pytorch/vision/resnet/test_resnet.py::test_resnet_timm	PASSED	VERIFICATION

In the table above, both test cases passed verification, but the execution phase and stage for the forge/test/models/pytorch/vision/resnet/test_resnet.py::test_resnet_hf[microsoft/resnet-50] test case are incorrect. The execution phase should be PASSED, and the execution stage should be VERIFICATION.

This occurs because the test case contains a post-processing function (run_and_print_results) that reruns the compiled model, which updates the execution phase from PASSED to EXECUTED_TTNN and the execution stage from VERIFICATION to EXECUTED_TTNN.

On the other hand, another ResNet test case, forge/test/models/pytorch/vision/resnet/test_resnet.py::test_resnet_timm, does not contain post-processing, so the execution phase and stage remain unchanged.

Issue Origin

Execution depth (EXECUTED_TTNN) is recorded inside the CompiledModel class in forge/forge/compiled_graph_state.py, specifically in the call method, at this line.

Whenever the compiled model is rerun, it updates the execution depth again.

Reason for Current Implementation

Some test cases still use the compare_with_golden function for verification instead of verify. Test cases using compare_with_golden may run the compiled model anywhere inside the test function. To ensure execution depth tracking, EXECUTED_TTNN was recorded inside the call method of the CompiledModel class.

Proposed Fix

To prevent incorrect updates of execution phase and stage due to post-processing:

Record execution depth (EXECUTED_TTNN) inside the verify function rather than inside the call method in the CompiledModel class.
Replace compare_with_golden with verify in test cases.

Benefits of This Change

Prevents Depth Changes During Post-Processing
- Currently, execution depth can change from PASSED → EXECUTED_TTNN if post-processing code calls the compiled model.
- By recording EXECUTED_TTNN within verify, execution depth remains correct.
Ensures Proper Depth Handling Across Loops (e.g., Epochs)
- If compare_with_golden is replaced with verify in forge/test/mlir/mnist/training/test_training.py, consider this scenario:
  - Epoch 1: Verification passes, setting execution depth to PASSED.
  - Epoch 2: Data mismatch occurs. By recording EXECUTED_TTNN inside verify, execution depth is tracked correctly per epoch, allowing proper rollback if mismatches occur later.

Note:

@vkovinicTT replaced the compare_with_golden with verify function for the forge/test/mlir/mnist/training/test_training.py in this PR. So I haven't replaced it for forge/test/mlir/mnist/training/test_training.py file

Llama Decode with cache:

The to_pt_tensors function was modified to accept the List/tuple of tensor in this commit but in the llama decode with cache test will pass the List[input_ids, attention_mask, postion_ids, past_key_values] as input, here the past_key_values is list of tuple of tensor, so modified the llama decode with cache test to accept as list of tensor and update the llama decode implementation to process past key values as list of tensor, not as list of tuple of tensors.

github-actions · 2025-03-04T08:15:20Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	624 ran	483 passed	141 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-04T08:18:30Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	683 ran	538 passed	145 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-04T08:19:21Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	683 ran	538 passed	145 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-04T08:21:02Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	624 ran	483 passed	141 skipped	0 failed

Test	Result
No test annotations available

forge/test/mlir/llama/tests/test_llama_decode.py

github-actions · 2025-03-04T13:27:06Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	624 ran	483 passed	141 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-04T13:32:09Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	624 ran	483 passed	141 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-04T13:32:21Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	683 ran	538 passed	145 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-04T13:44:05Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	683 ran	538 passed	145 skipped	0 failed

Test	Result
No test annotations available

codecov-commenter · 2025-03-06T09:13:32Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 43.40%. Comparing base (754b717) to head (d2b1f0d).

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1358   +/-   ##
=======================================
  Coverage   43.40%   43.40%           
=======================================
  Files          48       48           
  Lines        7860     7860           
=======================================
  Hits         3412     3412           
  Misses       4448     4448

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

github-actions · 2025-03-06T09:45:35Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	684 ran	541 passed	143 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-06T09:48:33Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	626 ran	483 passed	143 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-06T09:50:35Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	684 ran	541 passed	143 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-06T09:52:55Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	626 ran	483 passed	143 skipped	0 failed

Test	Result
No test annotations available

forge/test/mlir/test_training.py

forge/test/mlir/llama/tests/test_llama_decode.py

github-actions · 2025-03-06T18:00:08Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	634 ran	489 passed	145 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-06T18:04:35Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	693 ran	553 passed	140 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-06T18:07:55Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	634 ran	489 passed	145 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-06T18:31:39Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	693 ran	553 passed	140 skipped	0 failed

Test	Result
No test annotations available

ashokkumarkannan1

LGTM!!!

forge/test/mlir/test_training.py

…_with_golden with verify function

github-actions · 2025-03-07T14:30:41Z

	Tests	Passed ❌️	Skipped	Failed
TT-Forge-FE Tests	0 ran	0 passed	0 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-07T14:39:26Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	634 ran	490 passed	144 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-07T14:44:40Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	693 ran	554 passed	139 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-07T14:51:27Z

	Tests	Passed ☑️	Skipped ⚠️	Failed ❌️
TT-Forge-FE Tests	693 ran	552 passed	139 skipped	2 failed

Test	Result
TT-Forge-FE Tests
pytest
test_mobilenet_v2.test_mobilenetv2_basic	❌ failure
test_resnext.test_resnext_101_torchhub_pytorch[resnext101_32x8d]	❌ failure

github-actions · 2025-03-07T15:20:32Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	634 ran	490 passed	144 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-07T15:35:25Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	693 ran	554 passed	139 skipped	0 failed

Test	Result
No test annotations available

chandrasekaranpradeep requested a review from ashokkumarkannan1 March 4, 2025 07:47

chandrasekaranpradeep marked this pull request as ready for review March 4, 2025 09:15

chandrasekaranpradeep requested review from vkovinicTT, mstojkovicTT, nvukobratTT, pilkicTT and dgolubovicTT as code owners March 4, 2025 09:15

vkovinicTT reviewed Mar 4, 2025

View reviewed changes

forge/test/mlir/llama/tests/test_llama_decode.py Outdated Show resolved Hide resolved

forge/test/mlir/llama/tests/test_llama_decode.py Outdated Show resolved Hide resolved

chandrasekaranpradeep force-pushed the pchandrasekaran/fix_execution_depth_recording branch from 9a885b6 to 86c3e5b Compare March 4, 2025 12:48

chandrasekaranpradeep force-pushed the pchandrasekaran/fix_execution_depth_recording branch from 86c3e5b to fda39e1 Compare March 6, 2025 07:14

chandrasekaranpradeep requested a review from vkovinicTT March 6, 2025 07:20

vkovinicTT requested changes Mar 6, 2025

View reviewed changes

forge/test/mlir/test_training.py Show resolved Hide resolved

nvukobratTT approved these changes Mar 6, 2025

View reviewed changes

chandrasekaranpradeep force-pushed the pchandrasekaran/fix_execution_depth_recording branch from fda39e1 to 6288878 Compare March 6, 2025 15:26

chandrasekaranpradeep requested a review from vkovinicTT March 6, 2025 15:27

chandrasekaranpradeep force-pushed the pchandrasekaran/fix_execution_depth_recording branch from 6288878 to e771e4a Compare March 6, 2025 15:32

ashokkumarkannan1 requested changes Mar 6, 2025

View reviewed changes

forge/test/mlir/llama/tests/test_llama_decode.py Outdated Show resolved Hide resolved

chandrasekaranpradeep force-pushed the pchandrasekaran/fix_execution_depth_recording branch from e771e4a to 1a23cc9 Compare March 7, 2025 12:35

chandrasekaranpradeep requested a review from ashokkumarkannan1 March 7, 2025 12:36

ashokkumarkannan1 approved these changes Mar 7, 2025

View reviewed changes

chandrasekaranpradeep force-pushed the pchandrasekaran/fix_execution_depth_recording branch from 1a23cc9 to 898dd24 Compare March 7, 2025 12:46

vkovinicTT requested changes Mar 7, 2025

View reviewed changes

forge/test/mlir/test_training.py Outdated Show resolved Hide resolved

chandrasekaranpradeep force-pushed the pchandrasekaran/fix_execution_depth_recording branch from 898dd24 to 6ae6015 Compare March 7, 2025 13:26

chandrasekaranpradeep requested a review from vkovinicTT March 7, 2025 13:30

vkovinicTT approved these changes Mar 7, 2025

View reviewed changes

Fix the execution phase and stage recording issue and replace compare…

d2b1f0d

…_with_golden with verify function

chandrasekaranpradeep force-pushed the pchandrasekaran/fix_execution_depth_recording branch from 6ae6015 to d2b1f0d Compare March 7, 2025 13:34

chandrasekaranpradeep merged commit c80672e into main Mar 7, 2025
11 checks passed

chandrasekaranpradeep deleted the pchandrasekaran/fix_execution_depth_recording branch March 7, 2025 15:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix the execution phase and stage recording issue and replace compare_with_golden with verify function #1358

Fix the execution phase and stage recording issue and replace compare_with_golden with verify function #1358

chandrasekaranpradeep commented Mar 4, 2025 •

edited

Loading

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

codecov-commenter commented Mar 6, 2025 •

edited

Loading

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

ashokkumarkannan1 left a comment

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

Fix the execution phase and stage recording issue and replace compare_with_golden with verify function #1358

Fix the execution phase and stage recording issue and replace compare_with_golden with verify function #1358

Conversation

chandrasekaranpradeep commented Mar 4, 2025 • edited Loading

Problem Description

Example:

Issue Origin

Reason for Current Implementation

Proposed Fix

Benefits of This Change

Note:

Llama Decode with cache:

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

github-actions bot commented Mar 4, 2025

codecov-commenter commented Mar 6, 2025 • edited Loading

Codecov Report

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

ashokkumarkannan1 left a comment

Choose a reason for hiding this comment

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

chandrasekaranpradeep commented Mar 4, 2025 •

edited

Loading

codecov-commenter commented Mar 6, 2025 •

edited

Loading