-
Notifications
You must be signed in to change notification settings - Fork 1.9k
Fix PFI issue in binary classification #4587
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
10 commits
Select commit
Hold shift + click to select a range
4ed3ca8
Add non-calibrated evaluation to PFI
yaeldMS fe48872
change to always call EvaluateNonCalibrated
yaeldMS b6ed4b3
Add non-calibrated evaluation to PFI
yaeldMS 0559fb7
change to always call EvaluateNonCalibrated
yaeldMS 6558551
Add non-calibrated evaluation to PFI
yaeldMS 2d53161
change to always call EvaluateNonCalibrated
yaeldMS bd2af8e
Add non-calibrated evaluation to PFI
yaeldMS 6e53e5b
Add asserts to unit test
yaeldMS 8bde46b
Remove using statements
yaeldMS cd7f27c
Remove unused ctor
yaeldMS File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -305,6 +305,36 @@ public void TestPfiBinaryClassificationOnSparseFeatures(bool saveModel) | |
|
||
Done(); | ||
} | ||
|
||
[Fact] | ||
public void TestBinaryClassificationWithoutCalibrator() | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The test does not Assert anything. Can you please include Asserts for the relevant results that this test is supposed to verify? #Resolved |
||
{ | ||
var dataPath = GetDataPath("breast-cancer.txt"); | ||
var ff = ML.BinaryClassification.Trainers.FastForest(); | ||
var data = ML.Data.LoadFromTextFile(dataPath, | ||
new[] { new TextLoader.Column("Label", DataKind.Boolean, 0), | ||
new TextLoader.Column("Features", DataKind.Single, 1, 9) }); | ||
var model = ff.Fit(data); | ||
var pfi = ML.BinaryClassification.PermutationFeatureImportance(model, data); | ||
|
||
// For the following metrics higher is better, so minimum delta means more important feature, and vice versa | ||
Assert.Equal(7, MaxDeltaIndex(pfi, m => m.AreaUnderRocCurve.Mean)); | ||
Assert.Equal(1, MinDeltaIndex(pfi, m => m.AreaUnderRocCurve.Mean)); | ||
Assert.Equal(3, MaxDeltaIndex(pfi, m => m.Accuracy.Mean)); | ||
Assert.Equal(1, MinDeltaIndex(pfi, m => m.Accuracy.Mean)); | ||
Assert.Equal(3, MaxDeltaIndex(pfi, m => m.PositivePrecision.Mean)); | ||
Assert.Equal(1, MinDeltaIndex(pfi, m => m.PositivePrecision.Mean)); | ||
Assert.Equal(3, MaxDeltaIndex(pfi, m => m.PositiveRecall.Mean)); | ||
Assert.Equal(1, MinDeltaIndex(pfi, m => m.PositiveRecall.Mean)); | ||
Assert.Equal(3, MaxDeltaIndex(pfi, m => m.NegativePrecision.Mean)); | ||
Assert.Equal(1, MinDeltaIndex(pfi, m => m.NegativePrecision.Mean)); | ||
Assert.Equal(2, MaxDeltaIndex(pfi, m => m.NegativeRecall.Mean)); | ||
Assert.Equal(1, MinDeltaIndex(pfi, m => m.NegativeRecall.Mean)); | ||
Assert.Equal(3, MaxDeltaIndex(pfi, m => m.F1Score.Mean)); | ||
Assert.Equal(1, MinDeltaIndex(pfi, m => m.F1Score.Mean)); | ||
Assert.Equal(7, MaxDeltaIndex(pfi, m => m.AreaUnderPrecisionRecallCurve.Mean)); | ||
Assert.Equal(1, MinDeltaIndex(pfi, m => m.AreaUnderPrecisionRecallCurve.Mean)); | ||
} | ||
#endregion | ||
|
||
#region Multiclass Classification Tests | ||
|
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Small Nit:
The only difference between the
if (isCalibratedModel)
and the else case is the idv parameter. Is it possible to make this a bit more readable by factoring out just that line and using a single call to the PermutationFeatureImportance constructor? #ResolvedThere was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Resolving this comment, since
isCalibratedModel
has been removed.In reply to: 362685943 [](ancestors = 362685943)