Change default EvaluationMetric for LightGbm trainers to conform to d… #3859

najeeb-kazmi · 2019-06-13T00:38:42Z

…efault metric in standalone LightGbm

In ML.NET, the default EvaluationMetric for LightGbm is set to EvaluateMetricType.Error for multiclass, EvaluationMetricType.LogLoss for binary, and so on. This leads to inconsistent behavior from the user's perspective: If a user specified EvaluationMetric = EvaluateMetricType.Default, the parameter passed to LightGbm would be the empty string "", which is the LightGbm default and means that the metric is selected based on the objective. However, if they do not specify EvaluationMetric at all, the parameter passed to LightGbm would be Error for multiclass, LogLoss for binary, and so on.

We should make the experience for LightGbm in ML.NET consistent with what a user of standalone LightGbm experiences, and not expect them to dig through LightGbm docs and ML.NET docs to find this out.

This PR makes the user experience consistent with standalone LightGbm by by changing the default EvaluationMetric in ML.NET to EvaluationMetricType.Default.

LightGbm metric parameters docs

…efault metric in standalone LightGbm

yaeldekel · 2019-06-13T16:09:55Z

src/Microsoft.ML.LightGbm/LightGbmBinaryTrainer.cs

@@ -162,7 +162,7 @@ public enum EvaluateMetricType
            [Argument(ArgumentType.AtMostOnce,
                HelpText = "Evaluation metrics.",
                ShortName = "em")]
-            public EvaluateMetricType EvaluationMetric = EvaluateMetricType.Logloss;
+            public EvaluateMetricType EvaluationMetric = EvaluateMetricType.Default;


Default [](start = 76, length = 7)

Isn't this a breaking change?
cc @eerhardt

Yes, but the other option is to have an inconsistent user experience. I talked to @ebarsoumMS about this. Let's discuss and reach a conclusion.

It's not an "API breaking change". I think it falls into the scenarios that @TomFinley listed here #3602 (comment).

However even many years later sometimes we still have somewhat troublesome defaults running around

Here, if there is a better default value, I think it is acceptable to change the default.

Also, this doesn't actually change training behavior, nor the metrics calculated by ML.NET evaluators. Just changes the metric that LightGbm calculates internally.

In ML.NET, when we do the following (e.g. for binary classification)

var transformedTestData = model.Transform(testData); var metrics = mlContext.BinaryClassification.Evaluate(transformedTestData);

the evaluator computes all relevant metrics for binary classification regardless of what is specified by LightGbm's EvaluationMetric parameter.

It may control LightGBM's early stopping, but otherwise I think this is a NOOP change. ML.NET doesn't relay the stdout from LightGBM to the user, and ML.NET uses its own evaluators for computing the final metrics.

Users could benefit from ML.NET relaying this info back to the user. This would allow a GUI to show the learning curves in real time (or as text output from a CLI):

yaeldekel

codemzs

…orm to default metric in standalone LightGbm (dotnet#3859)" This reverts commit 3a35a82.

…efault metric in standalone LightGbm (dotnet#3859)

Change default EvaluationMetric for LightGbm trainers to conform to d…

4c9c786

…efault metric in standalone LightGbm

najeeb-kazmi requested review from abgoswam, artidoro, singlis and ganik June 13, 2019 00:38

yaeldekel reviewed Jun 13, 2019

View reviewed changes

yaeldekel approved these changes Jun 14, 2019

View reviewed changes

codemzs approved these changes Jul 1, 2019

View reviewed changes

codemzs merged commit 3a35a82 into dotnet:master Jul 1, 2019

codemzs added a commit to codemzs/machinelearning that referenced this pull request Jul 3, 2019

Revert "Change default EvaluationMetric for LightGbm trainers to conf…

48644ab

…orm to default metric in standalone LightGbm (dotnet#3859)" This reverts commit 3a35a82.

justinormont mentioned this pull request Jul 3, 2019

Change default # of iterations in Averaged Perceptron to 10 #2305

Closed

rayankrish mentioned this pull request Jul 15, 2019

Stop LightGbm Warning for Default Metric Input [Issue #3965 Fix] #4007

Merged

Dmitry-A pushed a commit to Dmitry-A/machinelearning that referenced this pull request Jul 24, 2019

Change default EvaluationMetric for LightGbm trainers to conform to d…

24836d5

…efault metric in standalone LightGbm (dotnet#3859)

najeeb-kazmi deleted the 3822 branch January 30, 2020 01:23

ghost locked as resolved and limited conversation to collaborators Mar 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change default EvaluationMetric for LightGbm trainers to conform to d… #3859

Change default EvaluationMetric for LightGbm trainers to conform to d… #3859

najeeb-kazmi commented Jun 13, 2019 •

edited

Loading

yaeldekel Jun 13, 2019

najeeb-kazmi Jun 13, 2019

eerhardt Jun 13, 2019

najeeb-kazmi Jun 13, 2019

justinormont Jun 13, 2019

yaeldekel left a comment

codemzs left a comment

Change default EvaluationMetric for LightGbm trainers to conform to d… #3859

Change default EvaluationMetric for LightGbm trainers to conform to d… #3859

Conversation

najeeb-kazmi commented Jun 13, 2019 • edited Loading

yaeldekel Jun 13, 2019

Choose a reason for hiding this comment

najeeb-kazmi Jun 13, 2019

Choose a reason for hiding this comment

eerhardt Jun 13, 2019

Choose a reason for hiding this comment

najeeb-kazmi Jun 13, 2019

Choose a reason for hiding this comment

justinormont Jun 13, 2019

Choose a reason for hiding this comment

yaeldekel left a comment

Choose a reason for hiding this comment

codemzs left a comment

Choose a reason for hiding this comment

najeeb-kazmi commented Jun 13, 2019 •

edited

Loading