fix: Fix logic in OnnxToOvNetworkBindings for stateful models #719

RyanMetcalfeInt8 · 2025-06-24T19:50:25Z

This resolves the issue on ovep-develop where stateful models encounter a runtime error when using EPCtx-wrapped models with ORT GenAI.

The issue that I encountered when using ORT GenAI with EPCtx IR's was as follows:

Both the .onnx and the encapsulated IR have a 'beam_idx' tensor defined, so matched_names is set to true (line 67).

The old logic was:

if (!matched_names && session_context.enable_causallm &&
            std::any_of(special_io_names_.begin(), special_io_names_.end(),
                        [&onnx_name](const std::string& name) { return onnx_name.find(name) != std::string::npos; })) {
          // This case also requires dynamic shape inference, so we'll mark the bindings as dynamic.
          has_dynamic_io_ = true;
          continue;
}

So immediately, it skips over this because matched_named==true. For beam_idx, it's something that ORT GenAI doesn't set, and hence we hit a runtime error during BasicBacked::Infer.

I reworked the logic here to reflect the fact that both of these conditions should be true to hit this continue (skip io binding):

session_context.enable_causallm is true (i.e. stateful flow is enabled)
Either there was a name mismatch, or the tensor name is part of the special io names list.

And so, accomplished this with a nested if:

if (session_context.enable_causallm) {
  if (!matched_names ||
      std::any_of(special_io_names_.begin(), special_io_names_.end(),
                  [&onnx_name](const std::string& name) { return onnx_name.find(name) != std::string::npos; })) {
    // This case also requires dynamic shape inference, so we'll mark the bindings as dynamic.
    has_dynamic_io_ = true;
    continue;
  }
}

As that seemed more readable to me than if ( session_context.enable_causallm && ( [.. the other conditions ...]) ).

Copilot

Pull Request Overview

This PR fixes the conditional logic in OnnxToOvNetworkBindings to correctly skip IO bindings for stateful (causal LM) models when encountering name mismatches or special IO names, preventing runtime errors.

Restructured the enable_causallm check into a nested if so both stateful mode and name conditions are evaluated together.
Clarified comments around handling of newly introduced tensors like beam_idx.

Comments suppressed due to low confidence (1)

onnxruntime/core/providers/openvino/backends/basic_backend.h:75

Add or update unit tests to cover the new stateful branch where session_context.enable_causallm is true and inputs have unmatched or special IO names, ensuring dynamic shape inference is validated.

        if (session_context.enable_causallm) {

onnxruntime/core/providers/openvino/backends/basic_backend.h

gblong1

LGTM. Tested on LNL with Phi3.5, Qwen2.5, and Deepseek

RyanMetcalfeInt8 and others added 2 commits June 24, 2025 11:46

fix: Fix logic in OnnxToOvNetworkBindings for stateful models

5be7a41

Merge branch 'ovep-develop' into stateful_fixes

94fe7ea

MayureshV1 requested a review from Copilot June 25, 2025 05:02

Copilot AI reviewed Jun 25, 2025

View reviewed changes

onnxruntime/core/providers/openvino/backends/basic_backend.h Show resolved Hide resolved

gblong1 approved these changes Jun 25, 2025

View reviewed changes

ankitm3k self-requested a review June 25, 2025 05:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Fix logic in OnnxToOvNetworkBindings for stateful models #719

fix: Fix logic in OnnxToOvNetworkBindings for stateful models #719

RyanMetcalfeInt8 commented Jun 24, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

gblong1 left a comment

Uh oh!

Uh oh!

fix: Fix logic in OnnxToOvNetworkBindings for stateful models #719

Are you sure you want to change the base?

fix: Fix logic in OnnxToOvNetworkBindings for stateful models #719

Conversation

RyanMetcalfeInt8 commented Jun 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

gblong1 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

RyanMetcalfeInt8 commented Jun 24, 2025 •

edited

Loading