Add resnet and llama to benchmark models #1386

vcanicTT · 2025-03-06T14:40:21Z

We want to track the end-to-end performance of different models, so we have added ResNet HF and Llama Prefill to the benchmark models. This PR also includes a few changes to existing models, aiming to align all models to be written in the same manner and uploaded successfully.

codecov-commenter · 2025-03-06T15:39:38Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 43.40%. Comparing base (5236bc7) to head (c505238).
Report is 3 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #1386   +/-   ##
=======================================
  Coverage   43.40%   43.40%           
=======================================
  Files          48       48           
  Lines        7860     7860           
=======================================
  Hits         3412     3412           
  Misses       4448     4448

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

vcanicTT · 2025-03-06T16:01:57Z

@dgolubovicTT @pilkicTT @vmilosevic Can you take a look?

github-actions · 2025-03-06T17:09:58Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	634 ran	492 passed	142 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-06T17:24:36Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	634 ran	492 passed	142 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-06T17:26:06Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	693 ran	556 passed	137 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-06T17:49:54Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	693 ran	556 passed	137 skipped	0 failed

Test	Result
No test annotations available

pilkicTT

@vcanicTT I see that you have another PR open (for resnet)... should that one be closed?

pilkicTT · 2025-03-06T19:18:07Z

forge/test/benchmark/scripts/run_benchmark_wh.sh

@@ -8,4 +8,10 @@


 # MNIST Linear
-python forge/test/benchmark/benchmark.py -m mnist_linear -bs 1 -lp 32 -o forge-benchmark-e2e-mnist.json
+PYTHONPATH=./forge python forge/test/benchmark/benchmark.py -m mnist_linear -bs 1 -lp 32 -o forge-benchmark-e2e-mnist.json


Why is the PYTHONPATH set? We should avoid this...

I had problems with python packages and imports, and this was the best way to solve the problem. When I run the tests via benchmark script it doesn't see tests module. Do you have any suggestions how to resolve this in a better way?

pilkicTT · 2025-03-06T19:23:48Z

.github/workflows/perf-benchmark.yml

@@ -82,7 +82,9 @@ jobs:
      shell: bash
      run: |
        source env/activate
-        python forge/test/benchmark/benchmark.py -m mnist_linear -bs 1 -lp 32 -o ${{ steps.strings.outputs.perf_report_path }}
+        PYTHONPATH=./forge python forge/test/benchmark/benchmark.py -m mnist_linear -bs 1 -lp 32 -o ${{ steps.strings.outputs.perf_report_path }}


It would be good to run all the models with higher batch size.

The tests can be run in batches larger than one. For now, we keep it at one to ensure the models work correctly. We can increase it later when we start tracking results and are confident that they are preserved accurately.

vcanicTT · 2025-03-07T10:25:51Z

@vcanicTT I see that you have another PR open (for resnet)... should that one be closed?

Yes, I sent a PR only for ResNet. Then, I rebased on the ResNet branch and continued working on LLaMA. If we do everything in this PR, I will close the first one.

vcanicTT · 2025-03-07T13:16:13Z

.github/workflows/perf-benchmark.yml

@@ -44,7 +44,10 @@ jobs:
      run: |
        echo "work-dir=$(pwd)" >> "$GITHUB_OUTPUT"
        echo "build-output-dir=$(pwd)/build" >> "$GITHUB_OUTPUT"
-        echo "perf_report_path=forge-benchmark-e2e-mnist_$JOB_ID.json" >> "$GITHUB_OUTPUT"


@vmilosevic Can you look at this part, this file, and please tell if you're ok or not with this change? A model expects .json file as output, so I made three different paths for each model, and one path that represents folder from which we upload the files, i.e. benchmark reports.

github-actions · 2025-03-07T14:34:54Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	634 ran	493 passed	141 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-07T14:46:24Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	693 ran	557 passed	136 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-07T14:56:33Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	634 ran	493 passed	141 skipped	0 failed

Test	Result
No test annotations available

github-actions · 2025-03-07T14:58:20Z

	Tests	Passed ✅	Skipped ⚠️	Failed
TT-Forge-FE Tests	693 ran	557 passed	136 skipped	0 failed

Test	Result
No test annotations available

vcanicTT self-assigned this Mar 6, 2025

vcanicTT requested review from vmilosevic, nvukobratTT, pilkicTT and dgolubovicTT as code owners March 6, 2025 14:40

pilkicTT reviewed Mar 6, 2025

View reviewed changes

vcanicTT added 5 commits March 7, 2025 10:41

Add resnet hf model to benchmark models.

3c0770d

Add resnet hf model to benchmark models.

c1c55ac

Add resnet hf model to benchmark models.

1815834

Add llama prefill model to benchmark models.

5f9787f

Add llama model to benchmark models.

c505238

vcanicTT force-pushed the vcanic/bringup_benchmark_llama branch from 24b220f to c505238 Compare March 7, 2025 12:52

vcanicTT commented Mar 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add resnet and llama to benchmark models #1386

Add resnet and llama to benchmark models #1386

vcanicTT commented Mar 6, 2025

codecov-commenter commented Mar 6, 2025 •

edited

Loading

vcanicTT commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

pilkicTT left a comment

pilkicTT Mar 6, 2025

vcanicTT Mar 7, 2025 •

edited

Loading

pilkicTT Mar 6, 2025

vcanicTT Mar 7, 2025

vcanicTT commented Mar 7, 2025

vcanicTT Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

Add resnet and llama to benchmark models #1386

Are you sure you want to change the base?

Add resnet and llama to benchmark models #1386

Conversation

vcanicTT commented Mar 6, 2025

codecov-commenter commented Mar 6, 2025 • edited Loading

Codecov Report

vcanicTT commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

github-actions bot commented Mar 6, 2025

pilkicTT left a comment

Choose a reason for hiding this comment

pilkicTT Mar 6, 2025

Choose a reason for hiding this comment

vcanicTT Mar 7, 2025 • edited Loading

Choose a reason for hiding this comment

pilkicTT Mar 6, 2025

Choose a reason for hiding this comment

vcanicTT Mar 7, 2025

Choose a reason for hiding this comment

vcanicTT commented Mar 7, 2025

vcanicTT Mar 7, 2025

Choose a reason for hiding this comment

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

github-actions bot commented Mar 7, 2025

codecov-commenter commented Mar 6, 2025 •

edited

Loading

vcanicTT Mar 7, 2025 •

edited

Loading