Add confusion matrix #533

feifjiang · 2020-12-04T02:39:33Z

This PR replaces #532 #530

codecov · 2020-12-04T03:03:07Z

Codecov Report

Merging #533 (70b7cab) into master (13ad9cd) will increase coverage by 86.77%.
The diff coverage is 95.31%.

@@             Coverage Diff             @@
##           master     #533       +/-   ##
===========================================
+ Coverage        0   86.77%   +86.77%     
===========================================
  Files           0      347      +347     
  Lines           0    12019    +12019     
  Branches        0      389      +389     
===========================================
+ Hits            0    10430    +10430     
- Misses          0     1589     +1589

Impacted Files	Coverage Δ
...op/evaluators/OpMultiClassificationEvaluator.scala	`95.62% <95.31%> (ø)`
...main/scala/org/apache/spark/ml/tree/RichNode.scala	`0.00% <0.00%> (ø)`
.../op/stages/impl/feature/NameEntityRecognizer.scala	`100.00% <0.00%> (ø)`
...salesforce/op/stages/impl/tuning/OpValidator.scala	`94.59% <0.00%> (ø)`
...org/apache/spark/ml/attribute/MetadataHelper.scala	`100.00% <0.00%> (ø)`
...ce/op/stages/impl/feature/TextLenTransformer.scala	`100.00% <0.00%> (ø)`
...cala/com/salesforce/op/test/TestSparkContext.scala	`83.33% <0.00%> (ø)`
...la/com/salesforce/op/stages/OpPipelineStages.scala	`63.88% <0.00%> (ø)`
.../com/salesforce/op/utils/spark/RichEvaluator.scala	`100.00% <0.00%> (ø)`
...ain/scala/com/salesforce/op/aggregators/Sets.scala	`37.50% <0.00%> (ø)`
... and 338 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 13ad9cd...70b7cab. Read the comment docs.

core/src/main/scala/com/salesforce/op/evaluators/OpMultiClassificationEvaluator.scala

gerashegalov

LGTM, question re confMatrixThresholds parameter upper bound

gerashegalov · 2020-12-04T22:54:14Z

core/src/main/scala/com/salesforce/op/evaluators/OpMultiClassificationEvaluator.scala

@@ -122,9 +120,9 @@ private[op] class OpMultiClassificationEvaluator
 parent = this,
 name = "confMatrixThresholds",
 doc = "sequence of threshold values used for confusion matrix metrics",
- isValid = _.forall(x => x >= 0.0 && x <= 1.0)
+ isValid = _.forall(x => x >= 0.0 && x < 1.0)


why is 1.0 not allowed?

For multinomial logistic regression, the scenario where top probability equals to 1.0 will never happen. For random forest it's quite unlikely, unless the record falls onto one and the only one leaf node of the same class in every tree. Therefore it doesn't really make sense to have confidence threshold of 1.0

just wondering how much value is there to restrict it here?

@gerashegalov are you concerned about too many values present in the confMatrixThresholds array? That's a valid concern, I can add a check for that

core/src/main/scala/com/salesforce/op/evaluators/OpMultiClassificationEvaluator.scala

local/src/test/scala/com/salesforce/op/local/OpWorkflowModelLocalTest.scala

… with e-agent team

tovbinm · 2020-12-10T18:31:29Z

core/src/main/scala/com/salesforce/op/evaluators/OpMultiClassificationEvaluator.scala

@@ -597,6 +600,13 @@ case class MisClassificationsPerCategory
 MisClassifications: Map[Double, Long]
 )

+case class labelPredictionConfidence


@feifjiang are you planning to use LabelPredictionConfidenceCt and labelPredictionConfidence somewhere?

@tovbinm sorry - forgot to remove it. Just updated the PR again.

feifjiang · 2020-12-10T20:09:11Z

@tovbinm @gerashegalov Can either one of you merge my PR with master? I don't have write access.

tovbinm · 2020-12-10T20:45:03Z

@feifjiang thank you for your contribution!

feifjiang · 2020-12-10T21:01:02Z

@tovbinm Thank you for the comments and being so responsive!!

feifjiang added 2 commits December 3, 2020 18:32

add confusion matrix

c98190e

Merge remote-tracking branch 'upstream/master' into ff/cm

aa0e63a

feifjiang requested review from gerashegalov, Jauntbox, leahmcguire, nicodv and tovbinm as code owners December 4, 2020 02:39

gerashegalov suggested changes Dec 4, 2020

View reviewed changes

address Gera's comments

c8fcc2c

feifjiang requested a review from gerashegalov December 4, 2020 22:30

gerashegalov approved these changes Dec 4, 2020

View reviewed changes

tovbinm reviewed Dec 5, 2020

View reviewed changes

core/src/main/scala/com/salesforce/op/evaluators/OpMultiClassificationEvaluator.scala Show resolved Hide resolved

tovbinm reviewed Dec 5, 2020

View reviewed changes

core/src/main/scala/com/salesforce/op/evaluators/OpMultiClassificationEvaluator.scala Outdated Show resolved Hide resolved

tovbinm reviewed Dec 5, 2020

View reviewed changes

core/src/main/scala/com/salesforce/op/evaluators/OpMultiClassificationEvaluator.scala Show resolved Hide resolved

tovbinm reviewed Dec 5, 2020

View reviewed changes

feifjiang added 2 commits December 7, 2020 14:42

addressed comments from previous commit

25754cf

reduced threshold in OpWorkflowModelLocalTest

e18f2e7

feifjiang requested review from tovbinm and gerashegalov December 7, 2020 22:53

(1)address reviewer comment (2)modified default values per discussion…

e5b71a5

… with e-agent team

tovbinm approved these changes Dec 10, 2020

View reviewed changes

tovbinm reviewed Dec 10, 2020

View reviewed changes

feifjiang added 2 commits December 10, 2020 11:20

remove un-used case class

d2e465d

removed unused code

70b7cab

tovbinm merged commit 91724f1 into salesforce:master Dec 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add confusion matrix #533

Add confusion matrix #533

feifjiang commented Dec 4, 2020 •

edited by gerashegalov

Loading

codecov bot commented Dec 4, 2020 •

edited

Loading

gerashegalov left a comment

gerashegalov Dec 4, 2020

feifjiang Dec 7, 2020 •

edited

Loading

gerashegalov Dec 8, 2020

feifjiang Dec 8, 2020 •

edited

Loading

tovbinm Dec 10, 2020

feifjiang Dec 10, 2020

feifjiang commented Dec 10, 2020

tovbinm commented Dec 10, 2020

feifjiang commented Dec 10, 2020

Add confusion matrix #533

Add confusion matrix #533

Conversation

feifjiang commented Dec 4, 2020 • edited by gerashegalov Loading

codecov bot commented Dec 4, 2020 • edited Loading

Codecov Report

gerashegalov left a comment

Choose a reason for hiding this comment

gerashegalov Dec 4, 2020

Choose a reason for hiding this comment

feifjiang Dec 7, 2020 • edited Loading

Choose a reason for hiding this comment

gerashegalov Dec 8, 2020

Choose a reason for hiding this comment

feifjiang Dec 8, 2020 • edited Loading

Choose a reason for hiding this comment

tovbinm Dec 10, 2020

Choose a reason for hiding this comment

feifjiang Dec 10, 2020

Choose a reason for hiding this comment

feifjiang commented Dec 10, 2020

tovbinm commented Dec 10, 2020

feifjiang commented Dec 10, 2020

feifjiang commented Dec 4, 2020 •

edited by gerashegalov

Loading

codecov bot commented Dec 4, 2020 •

edited

Loading

feifjiang Dec 7, 2020 •

edited

Loading

feifjiang Dec 8, 2020 •

edited

Loading