add pos_weight for SigmoidBinaryCrossEntropyLoss #13612

eureka7mt · 2018-12-11T08:03:22Z

Description

Add pos_weight for SigmoidBinaryCrossEntropyLoss.
A value pos_weights > 1 decreases the false negative count, hence increasing the recall.
Conversely setting pos_weights < 1 decreases the false positive count and increases the precision.
This can be seen from the fact that pos_weight is introduced as a multiplicative coefficient for the positive targets term in the loss expression:
label * -log(sigmoid(pred)) * pos_weight + (1 - label) * -log(1 - sigmoid(pred))

It's adopted from tensorflow's implementation

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http:https://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

roywei · 2018-12-12T00:12:26Z

@eureka7mt Thanks for the contribution, could you add a unit test for this case?

roywei · 2018-12-12T00:13:13Z

@mxnet-label-bot add[Gluon, pr-awaiting-review]

add test

python/mxnet/gluon/loss.py

anirudhacharya

minor comment on the unit test and fix the CI failure.

Rest LGTM.

tests/python/unittest/test_loss.py

stu1130 · 2019-01-16T21:07:37Z

@eureka7mt Could you fix the Trailing whitespace issue?

eureka7mt · 2019-01-24T03:24:32Z

Don't know why it failed in test_multinomial_generator() which is in the file '/work/mxnet/tests/python/gpu/../unittest/test_random.py' with unix-gpu.

vandanavk · 2019-02-05T19:33:11Z

@eureka7mt could you re-trigger the CI?

ankkhedia · 2019-02-14T23:14:38Z

@eureka7mt Could you please look into the CI failures?

anirudhacharya · 2019-03-01T22:31:19Z

@mxnet-label-bot update [pr-awaiting-merge]

python/mxnet/gluon/loss.py

CONTRIBUTORS.md

wkcn

We will merge it after the CI passes.
Thanks for your contribution!

python/mxnet/gluon/loss.py

CONTRIBUTORS.md

eureka7mt · 2019-03-07T06:24:27Z

Adding the if-else statement make an error.Though the default value of pos_weight is set to be 1,the pos_weight is usually an (1,N) NDArray.And it seems that an error happen in if-else statement when input is a symbol

wkcn · 2019-03-07T06:48:45Z

@eureka7mt
I see. Since pos_weight is a tensor, it is better to default pos_weight=None as you wrote before. I will update it. Thanks!

Edit: I think the pos_weight is a scalar, since it is a binary classification loss.

eureka7mt · 2019-03-07T07:17:05Z

@wkcn It could be a scalar for classifying a single class.But for multi-class and multi-label classifying,it should be a tensor.Because in this situation,the number of positive examples and negative examples isn't same for each class.
And in pytorch,it's also defined as a tensor.See the pytorch docs.

wkcn · 2019-03-07T14:21:25Z

I change the order of SigmoidBinaryCrossEntropyLoss inputs from (self, F, pred, label, pos_weight=None, sample_weight=None) to (self, F, pred, label, sample_weight=None, pos_weight=None), since we need the compatibilty for other projects

wkcn · 2019-03-07T15:27:25Z

Sorry that I trigger the sanity problem.
Could someone please help me solve it? Thanks!

eureka7mt · 2019-03-08T01:41:26Z

Maybe the broadcast_mul isn't necessary.I think that a NDArray * a NDArray will do broadcast_mul automatically.

wkcn · 2019-03-08T01:55:11Z

@eureka7mt I think we may pass Symbol into SigmoidBinaryCrossEntropyLoss. Symbol will not broadcast_mul automatically in my test.

import mxnet as mx
from mxnet.gluon import nn

class TestBlock(nn.HybridBlock):
    def __init__(self):
        super(TestBlock, self).__init__()
    def hybrid_forward(self, F, x, y):
        return x * y

block = TestBlock()
block.hybridize()
a = mx.nd.zeros((10, 1))
b = mx.nd.ones((1, 5))
c = block(a, b)
print (c.asnumpy())

wkcn · 2019-03-08T06:08:21Z

The PR has been merged.
Thanks for your contribution: )

* add pos_weight for SigmoidBinaryCrossEntropyLoss in gluon.loss * Update loss.py * add test add test * set the default value of pos_weight to be 1 * fix unittest * set N be a random number * fix issues * test without random number * test with random N * fix * fix errors * fix errors * fix order * Update loss.py * Update loss.py * fix pylint * default pos_weight=None * add broadcast_mul and fix pylint * fix unittest * Update loss.py * Update loss.py * Update loss.py

add pos_weight for SigmoidBinaryCrossEntropyLoss in gluon.loss

a0c0fb0

eureka7mt requested a review from szha as a code owner December 11, 2018 08:03

Update loss.py

bd9f40f

marcoabreu added Gluon pr-awaiting-review PR is waiting for code review labels Dec 12, 2018

add test

1b56843

add test

anirudhacharya reviewed Dec 17, 2018

View reviewed changes

python/mxnet/gluon/loss.py Outdated Show resolved Hide resolved

python/mxnet/gluon/loss.py Show resolved Hide resolved

eureka7mt added 2 commits December 25, 2018 15:20

set the default value of pos_weight to be 1

0c16c53

fix unittest

04da956

anirudhacharya reviewed Jan 4, 2019

View reviewed changes

tests/python/unittest/test_loss.py Outdated Show resolved Hide resolved

set N be a random number

100eabd

stu1130 approved these changes Jan 16, 2019

View reviewed changes

fix issues

62b9701

eureka7mt added 2 commits February 15, 2019 16:48

test without random number

985f901

test with random N

ba2683a

marcoabreu added pr-awaiting-merge Review and CI is complete. Ready to Merge and removed Gluon pr-awaiting-review PR is waiting for code review labels Mar 1, 2019

wkcn suggested changes Mar 2, 2019

View reviewed changes

python/mxnet/gluon/loss.py Outdated Show resolved Hide resolved

python/mxnet/gluon/loss.py Outdated Show resolved Hide resolved

eureka7mt and others added 2 commits March 6, 2019 18:57

fix

9313c21

Merge branch 'master' into master

cb5d54a

szha reviewed Mar 6, 2019

View reviewed changes

CONTRIBUTORS.md Outdated Show resolved Hide resolved

fix errors

c497926

eureka7mt and others added 6 commits March 7, 2019 11:34

fix errors

8b2e560

Merge remote-tracking branch 'upstream/master'

33159ab

Merge branch 'master' of https://github.com/eureka7mt/incubator-mxnet

6662a1a

fix order

f18707d

Update loss.py

0099479

Update loss.py

3f44834

wkcn approved these changes Mar 7, 2019

View reviewed changes

python/mxnet/gluon/loss.py Outdated Show resolved Hide resolved

CONTRIBUTORS.md Outdated Show resolved Hide resolved

CONTRIBUTORS.md Outdated Show resolved Hide resolved

wkcn added pr-awaiting-testing PR is reviewed and waiting CI build and test and removed pr-awaiting-merge Review and CI is complete. Ready to Merge labels Mar 7, 2019

fix pylint

7689e0d

eureka7mt and others added 3 commits March 7, 2019 15:52

default pos_weight=None

d453f3e

add broadcast_mul and fix pylint

8e26858

fix unittest

a0becb2

wkcn added 3 commits March 8, 2019 07:21

Update loss.py

35716f1

Update loss.py

2817847

Update loss.py

2b03da7

wkcn added pr-awaiting-merge Review and CI is complete. Ready to Merge and removed pr-awaiting-testing PR is reviewed and waiting CI build and test labels Mar 8, 2019

wkcn merged commit ce9e3cf into apache:master Mar 8, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add pos_weight for SigmoidBinaryCrossEntropyLoss #13612

add pos_weight for SigmoidBinaryCrossEntropyLoss #13612

eureka7mt commented Dec 11, 2018 •

edited

Loading

roywei commented Dec 12, 2018

roywei commented Dec 12, 2018

anirudhacharya left a comment •

edited

Loading

stu1130 commented Jan 16, 2019

eureka7mt commented Jan 24, 2019

vandanavk commented Feb 5, 2019

ankkhedia commented Feb 14, 2019

anirudhacharya commented Mar 1, 2019

wkcn left a comment

eureka7mt commented Mar 7, 2019 •

edited

Loading

wkcn commented Mar 7, 2019 •

edited

Loading

eureka7mt commented Mar 7, 2019

wkcn commented Mar 7, 2019

wkcn commented Mar 7, 2019

eureka7mt commented Mar 8, 2019

wkcn commented Mar 8, 2019 •

edited

Loading

wkcn commented Mar 8, 2019

add pos_weight for SigmoidBinaryCrossEntropyLoss #13612

add pos_weight for SigmoidBinaryCrossEntropyLoss #13612

Conversation

eureka7mt commented Dec 11, 2018 • edited Loading

Description

Checklist

Essentials

Changes

Comments

roywei commented Dec 12, 2018

roywei commented Dec 12, 2018

anirudhacharya left a comment • edited Loading

Choose a reason for hiding this comment

stu1130 commented Jan 16, 2019

eureka7mt commented Jan 24, 2019

vandanavk commented Feb 5, 2019

ankkhedia commented Feb 14, 2019

anirudhacharya commented Mar 1, 2019

wkcn left a comment

Choose a reason for hiding this comment

eureka7mt commented Mar 7, 2019 • edited Loading

wkcn commented Mar 7, 2019 • edited Loading

eureka7mt commented Mar 7, 2019

wkcn commented Mar 7, 2019

wkcn commented Mar 7, 2019

eureka7mt commented Mar 8, 2019

wkcn commented Mar 8, 2019 • edited Loading

wkcn commented Mar 8, 2019

eureka7mt commented Dec 11, 2018 •

edited

Loading

anirudhacharya left a comment •

edited

Loading

eureka7mt commented Mar 7, 2019 •

edited

Loading

wkcn commented Mar 7, 2019 •

edited

Loading

wkcn commented Mar 8, 2019 •

edited

Loading