Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expose derived feature value in model insights #484

Merged
merged 9 commits into from
Jun 24, 2020
Merged

Conversation

TuanNguyen27
Copy link
Collaborator

@TuanNguyen27 TuanNguyen27 commented Jun 17, 2020

Some of the derived features (or engineered features) have human-readable values that could be exposed inside modelInsights for downstream consumption.

@codecov
Copy link

codecov bot commented Jun 17, 2020

Codecov Report

Merging #484 into master will increase coverage by 0.02%.
The diff coverage is 100.00%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master     #484      +/-   ##
==========================================
+ Coverage   87.01%   87.03%   +0.02%     
==========================================
  Files         345      345              
  Lines       11680    11683       +3     
  Branches      378      387       +9     
==========================================
+ Hits        10163    10168       +5     
+ Misses       1517     1515       -2     
Impacted Files Coverage Δ
...c/main/scala/com/salesforce/op/ModelInsights.scala 93.09% <100.00%> (+0.04%) ⬆️
...force/op/stages/impl/feature/OPMapVectorizer.scala 97.79% <100.00%> (+0.01%) ⬆️
...es/src/main/scala/com/salesforce/op/OpParams.scala 89.79% <0.00%> (+4.08%) ⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6d90ba6...daa5e9f. Read the comment docs.

Copy link
Collaborator

@leahmcguire leahmcguire left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, are there other places where the descriptor value should be passed on?

Copy link
Contributor

@michaelweilsalesforce michaelweilsalesforce left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, what happened with the OpMapVec? Did we miss adding descriptorValue?
Also, maybe add one test in ModelInsight when a descriptor value is being added.

@TuanNguyen27
Copy link
Collaborator Author

LGTM, are there other places where the descriptor value should be passed on?

I did a spot check, i think only OpMapVec was missing this.

@TuanNguyen27
Copy link
Collaborator Author

Yes, what happened with the OpMapVec? Did we miss adding descriptorValue?
Also, maybe add one test in ModelInsight when a descriptor value is being added.

There's this line and this line that check for non-empty derivedFeatureValue. However we don't have a test for derivedFeatureValue when it is derived from a Map feature. Is it okay if i just add a check for derivedFeatureValue when it comes from a Map feature ?

@TuanNguyen27
Copy link
Collaborator Author

@leahmcguire @michaelweilsalesforce It's actually tricky to write a test for this change, because the existing tests drop numericMap as a feature...Do you have any suggestions ?

@TuanNguyen27 TuanNguyen27 merged commit 5a2449e into master Jun 24, 2020
@tovbinm tovbinm deleted the derivedFeatValue branch August 6, 2020 05:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants