Skip to content

test: L0_orca_trtllm fixed #8191

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open

test: L0_orca_trtllm fixed #8191

wants to merge 1 commit into from

Conversation

indrajit96
Copy link
Contributor

What does the PR do?

Fix L0_orca_trtllm which was broken due to new changes in the trtllm directory structure.

Checklist

  • PR title reflects the change and is of format <commit_type>: <Title>
  • Changes are described in the pull request.
  • Related issues are referenced.
  • Populated github labels field
  • Added test plan and verified test passes.
  • Verified that the PR passes existing CI.
  • Verified copyright is correct on all changed files.
  • Added succinct git squash message before merging ref.
  • All template sections are filled out.
  • Optional: Additional screenshots for behavior/output changes with before/after.

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

  • build
  • ci
  • docs
  • feat
  • fix
  • perf
  • refactor
  • revert
  • style
  • test

Where should the reviewer start?

test.sh

Test plan:

NA

  • CI Pipeline ID:
    27767229

Caveats:

NA

Background

Test broke due to

  1. CI runners were suddenly unable to find func definitions (RCA unknown)
  2. Change in TRTLLM directory structure which resulted in trtllm-build failing.

@indrajit96 indrajit96 requested review from rmccorm4 and yinggeh May 7, 2025 20:20

cd ${BASE_DIR}
}

clone_tensorrt_llm_backend_repo
build_gpt2_base_model
build_gpt2_tensorrt_engine
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How did this call to build_gpt2_tensorrt_engine previously work if the function didn't exist until this PR?

Copy link
Contributor Author

@indrajit96 indrajit96 May 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I tried finding an answer to this, but was unable to
Checked with @mc-nv if anything changed in the way CI runs or mounts. No luck.
Double checked if the test did actually run as expected before it broke (It did)
Test started breaking when we merged 24.04 release branch into main.
They were originally ONLY defined in L0_perf_tensorrt_llm/test.sh and prev tests used to pick the defn from there

Copy link
Contributor

@rmccorm4 rmccorm4 May 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

They were originally ONLY defined in L0_perf_tensorrt_llm/test.sh and prev tests used to pick the defn from there

Do we do a source L0_perf_tensorrt_llm/test.sh somewhere when running this test? I didn't see it in this bash script. Maybe on gitlab job side?

Copy link
Contributor

@yinggeh yinggeh May 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe because there is no set +e so bash just ignore the error.

@@ -138,6 +145,42 @@ function kill_server {
done
}

function clone_tensorrt_llm_backend_repo {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In the previous PR, I suggested moving these functions to qa/common/util.sh but somehow they were deleted instead of moved.
#8009 (comment)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

Successfully merging this pull request may close these issues.

4 participants