Calculate ReduceSum row by row in ONNX model from OneVsAllTrainer #4904

antoniovs1029 · 2020-02-29T00:19:16Z

There's a bug with the ONNX models exported from OneVsAllTrainers that have OutputFormula = OutputFormula.Softmax. (Notice that to the best of my knowledge, only a LightGBM multiclass trainer that had useSoftmax = true would have such an OutputFormula).

Problem was that the SoftMax (particularly the ReduceSum part of it) would be applied by summing the whole input batch, instead of doing separate sums for each row. This PR fixes that.

Notice that this error wasn't presented in our tests, since the OnnxTransformer which applies the ONNX model, actually process one row at a time, so the batch would always consist of one row. When trying to use this model directly with OnnxRuntime API (without the OnnxTransformer), then this problem appeared.

Lynx1820

Is it possible to add a test for this?

harishsk · 2020-02-29T01:11:44Z

src/Microsoft.ML.StandardTrainers/Standard/MulticlassClassification/OneVersusAllTrainer.cs

@@ -928,6 +928,8 @@ public override bool SaveAsOnnx(OnnxContext ctx, string[] outputNames, string fe
                var sumOutput = ctx.AddIntermediateVariable(NumberDataViewType.Single, "SumOutput");
                var sumNode = ctx.CreateNode(opType, expOutput, sumOutput, ctx.GetNodeName(opType), "");
                sumNode.AddAttribute("keepdims", 1);
+                long[] list = { 1 };


What about the graph itself when run with SoftMax. Does it still look broken?

Yes, the sigmoid nodes (that are actually never needed for lightgbm multiclass with softmax) are still there, without having their output connected to anything.

I did try to find how to eliminate them, but it seems that various changes are needed in onevsall trainer and also in the calibrators' saveasonnx methods.

I didn't look much into it, since i didn't know if it was worth it, as solving that won't have any effect on the model's output, and it actually isn't related to the issue i am addresing in this PR (if anything, it makes the onnx model architecture uglier and somewhat confusing, but nothing else).

Do you want me to fix that problem? In this PR?

Yes, please fix the problem. I will leave it to you whether you want to fix this in a separate PR or this PR.

Ok, I will address that in a separate PR.

antoniovs1029 · 2020-02-29T01:40:36Z

Is it possible to add a test for this?

@Lynx1820 i believe the onnx tranaformer only works with batches of one row at a time, thus i don't see how can i test this issue in ML.NET tests.

If anything there's already a test for multiclass trainers, which tests lightgbm with "useSoftmax = true". And that test still passes even with my changes.

harishsk · 2020-02-29T04:14:19Z

It is okay to leave this without a test for now because the changes required to have OnnxTransformer support batch processing are much larger. We will address this later.
As long as all the current tests pass for now, it is okay.

Apply ReduceSum row by row (axis = 1)

6b0a292

antoniovs1029 requested a review from a team as a code owner February 29, 2020 00:19

antoniovs1029 requested review from harishsk and ganik February 29, 2020 00:45

Lynx1820 reviewed Feb 29, 2020

View reviewed changes

harishsk reviewed Feb 29, 2020

View reviewed changes

harishsk approved these changes Feb 29, 2020

View reviewed changes

antoniovs1029 merged commit 179343a into dotnet:master Mar 2, 2020

ghost locked as resolved and limited conversation to collaborators Mar 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Calculate ReduceSum row by row in ONNX model from OneVsAllTrainer #4904

Calculate ReduceSum row by row in ONNX model from OneVsAllTrainer #4904

Uh oh!

antoniovs1029 commented Feb 29, 2020

Uh oh!

Lynx1820 left a comment •

edited

Loading

Uh oh!

harishsk Feb 29, 2020

Uh oh!

antoniovs1029 Feb 29, 2020

Uh oh!

harishsk Feb 29, 2020

Uh oh!

antoniovs1029 Mar 2, 2020

Uh oh!

antoniovs1029 commented Feb 29, 2020

Uh oh!

harishsk commented Feb 29, 2020

Uh oh!

Uh oh!

Calculate ReduceSum row by row in ONNX model from OneVsAllTrainer #4904

Calculate ReduceSum row by row in ONNX model from OneVsAllTrainer #4904

Uh oh!

Conversation

antoniovs1029 commented Feb 29, 2020

Uh oh!

Lynx1820 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

harishsk Feb 29, 2020

Choose a reason for hiding this comment

Uh oh!

antoniovs1029 Feb 29, 2020

Choose a reason for hiding this comment

Uh oh!

harishsk Feb 29, 2020

Choose a reason for hiding this comment

Uh oh!

antoniovs1029 Mar 2, 2020

Choose a reason for hiding this comment

Uh oh!

antoniovs1029 commented Feb 29, 2020

Uh oh!

harishsk commented Feb 29, 2020

Uh oh!

Uh oh!

Lynx1820 left a comment •

edited

Loading