Added cross entropy support to validation training, edited metric reporting #5255

mstfbl · 2020-06-24T10:40:55Z

In this PR, I've added cross entropy metric calculation & reporting during validation training in the Image Classification Trainer. Now, validation training can obtain cross entropy values for each epoch during validation training.

I have attached below screenshots of training and validation metrics with this repro.

Before (note that learning rate is constant, even when the given ExponentialLRDecay is used, and cross entropy during validation is reported as 0):

After:

I have also edited ToString() to report accuracy and cross entropy for the validation phase as well. Reporting learning rate decay during validation isn't necessary, as no learning rate decay occurs in validation.

I have also added tests to verify the proper decaying of the cross-entropy rate with this PR.

…for decaying LR in validation

codecov · 2020-06-25T12:26:56Z

Codecov Report

Merging #5255 into master will increase coverage by 0.24%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #5255      +/-   ##
==========================================
+ Coverage   73.49%   73.74%   +0.24%     
==========================================
  Files        1014     1022       +8     
  Lines      188680   190258    +1578     
  Branches    20330    20468     +138     
==========================================
+ Hits       138677   140308    +1631     
+ Misses      44493    44429      -64     
- Partials     5510     5521      +11

Flag	Coverage Δ
#Debug	`73.74% <100.00%> (+0.24%)`	⬆️
#production	`69.50% <100.00%> (+0.20%)`	⬆️
#test	`87.64% <100.00%> (+0.21%)`	⬆️

Impacted Files	Coverage Δ
.../Microsoft.ML.Vision/ImageClassificationTrainer.cs	`91.13% <100.00%> (+0.04%)`	⬆️
...cenariosWithDirectInstantiation/TensorflowTests.cs	`91.78% <100.00%> (+0.29%)`	⬆️
src/Microsoft.ML.FastTree/RegressionTree.cs	`75.51% <0.00%> (-8.17%)`	⬇️
src/Microsoft.ML.LightGbm/LightGbmTrainerBase.cs	`78.92% <0.00%> (-6.07%)`	⬇️
src/Microsoft.ML.AutoML/API/AutoCatalog.cs	`69.35% <0.00%> (-4.84%)`	⬇️
....ML.AutoML/PipelineSuggesters/PipelineSuggester.cs	`79.83% <0.00%> (-3.37%)`	⬇️
src/Microsoft.ML.TimeSeries/ExtensionsCatalog.cs	`90.62% <0.00%> (-3.00%)`	⬇️
...rosoft.ML.AutoML/ColumnInference/TextFileSample.cs	`59.60% <0.00%> (-2.65%)`	⬇️
src/Microsoft.ML.Maml/MAML.cs	`23.78% <0.00%> (-2.43%)`	⬇️
...osoft.ML.Functional.Tests/IntrospectiveTraining.cs	`98.73% <0.00%> (-1.27%)`	⬇️
... and 50 more

…nd training

wangyems

The print of cross-entropy looks good to me!
I'm a little skeptical that whether we train on validation set after searching this and quote "it is training data used for testing, but neither as part of the low-level training nor as part of the final testing". It seems like this case may fall into the category of low-level training: the validation set is fixed during the training(I might be wrong because I merely draw this conclusion by looking at the screenshot that the accuracy of validation set is often higher than training set's accuracy in the second screenshot). If the validation set is not fixed(something like cross-validation) then just ignore this comment.

mstfbl · 2020-06-30T21:58:53Z

Hey Ye,
Thank you so much for your review. After checking in with @harshithapv, we've arrived at the concolusion that the validation stage does not require learning rate scheduler support, as validation is only for computing loss and analyzing how the training phase is going. As there is no actual model training during the validation phase, there is also no need for decaying learning rates during validation.
I'll be making the necessary changes soon! #Resolved

mstfbl · 2020-07-02T05:04:38Z

src/Microsoft.ML.Vision/ImageClassificationTrainer.cs

@@ -170,11 +170,11 @@ public sealed class TrainMetrics
            public override string ToString()
            {
                if (DatasetUsed == ImageClassificationMetrics.Dataset.Train)
-                    return $"Phase: Training, Dataset used: {DatasetUsed.ToString(),10}, Batch Processed Count: {BatchProcessedCount,3}, Learning Rate: {LearningRate,10} " +


I have added learning rate to the end of the string here for cosmetic reasons. Now there is uniform alignment of columns' statistics between training and validation. :) #Resolved

mstfbl · 2020-07-02T05:08:46Z

@wangyems I have removed learning rate decay for validation from my PR, and now the accuracy statistics for before and after are the same. Now cross entropy is being reported for validation, and as it should be, there is no change in learning rates during the validation phase,

wangyems

lgtm

mstfbl added 2 commits June 24, 2020 02:11

Added cross entropy to validation training, simplified metric reporting

f3f3125

Edited runnerOutputTensorNames for validation runner, and added test …

bdcddf3

…for decaying LR in validation

mstfbl marked this pull request as ready for review June 25, 2020 11:18

mstfbl requested a review from a team as a code owner June 25, 2020 11:18

mstfbl changed the title ~~Added cross entropy to validation training, simplified metric reporting~~ Added cross entropy and learning rate scheduler support to validation training, simplified metric reporting Jun 25, 2020

mstfbl marked this pull request as draft June 25, 2020 17:27

Added more tests to validate learning rate schedulers in validation a…

63058ba

…nd training

mstfbl marked this pull request as ready for review June 25, 2020 22:19

mstfbl requested review from antoniovs1029, michaelgsharp, frank-dong-ms-zz, Lynx1820, wangyems and harishsk June 26, 2020 19:53

wangyems reviewed Jun 30, 2020

View reviewed changes

mstfbl marked this pull request as draft June 30, 2020 22:08

mstfbl added 5 commits July 1, 2020 01:49

Removed learning rate decay from validation phase

062458b

Update validationEvalRunner

76d2a00

Set learning rate scheduler to null during validation, update ToString()

8ca3f79

Update ToString() for uniform alignment

35018d6

Update TensorflowTests.cs

dbe3baf

mstfbl commented Jul 2, 2020

View reviewed changes

mstfbl requested a review from wangyems July 2, 2020 05:09

wangyems approved these changes Jul 2, 2020

View reviewed changes

mstfbl marked this pull request as ready for review July 2, 2020 09:54

mstfbl merged commit 14c8acc into dotnet:master Jul 2, 2020

mstfbl changed the title ~~Added cross entropy and learning rate scheduler support to validation training, simplified metric reporting~~ Added cross entropy support to validation training, edited metric reporting Jul 2, 2020

mstfbl mentioned this pull request Jul 2, 2020

ImageClassifier with ExponentialLRDecay: metrics not updated/calculated during validation #4807

Closed

ghost locked as resolved and limited conversation to collaborators Mar 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added cross entropy support to validation training, edited metric reporting #5255

Added cross entropy support to validation training, edited metric reporting #5255

mstfbl commented Jun 24, 2020 •

edited

Loading

codecov bot commented Jun 25, 2020 •

edited

Loading

wangyems left a comment

mstfbl commented Jun 30, 2020 •

edited

Loading

mstfbl Jul 2, 2020 •

edited

Loading

mstfbl commented Jul 2, 2020

wangyems left a comment

Added cross entropy support to validation training, edited metric reporting #5255

Added cross entropy support to validation training, edited metric reporting #5255

Conversation

mstfbl commented Jun 24, 2020 • edited Loading

codecov bot commented Jun 25, 2020 • edited Loading

Codecov Report

wangyems left a comment

Choose a reason for hiding this comment

mstfbl commented Jun 30, 2020 • edited Loading

mstfbl Jul 2, 2020 • edited Loading

Choose a reason for hiding this comment

mstfbl commented Jul 2, 2020

wangyems left a comment

Choose a reason for hiding this comment

mstfbl commented Jun 24, 2020 •

edited

Loading

codecov bot commented Jun 25, 2020 •

edited

Loading

mstfbl commented Jun 30, 2020 •

edited

Loading

mstfbl Jul 2, 2020 •

edited

Loading