ENH - Add `Pinball` datafit #134

Badr-MOUFAD · 2022-12-06T15:40:27Z

Provided that we now have a solver that handles non-smooth datafits #131

This adds a Pinball datafit that can be used to fit a QuantileRegressor
and hence as a by-product fit a LAD-Lasso (quantile=0.5)

closes #133

…pdcd-ws-skglm

…into pdcd-ws-skglm

skglm/experimental/pdcd_ws.py

skglm/experimental/quantile_regression.py

…into pinball-df

Badr-MOUFAD · 2022-12-08T12:49:00Z

The benchmarks results on L1- Quantile regression problem for the following setups:

datasets: MEG, Simulated
regularization parameter: [0.5, 0.1, 0.05] * lambda_max
solvers: PDCD-WS, scipy.linprog solvers

@lorentzenchr, your feedback is valuable 🙏

skglm/experimental/quantile_regression.py

mathurinm · 2022-12-08T15:49:01Z

skglm/experimental/quantile_regression.py

+
+ The datafit reads::
+
+ quantile * max(y - Xw, 0) + (1 - quantile) * max(Xw - y, 0)


nitpick: the real value does not involve np.max, it is np.maximum(...). sum().
Maybe rewrite as a sum and use _i to denote sample indices ? check how sklearn does it

I looked up scikit-learn source code, but they don't specify the expression.

I am more with the usage of sum and _ to indicate samples.

…o pinball-df

lorentzenchr

I can't comment on the core functions like prox as I'm not too familiar with skglm.
A few more tests could not hurt.

lorentzenchr · 2022-12-08T21:28:43Z

skglm/experimental/quantile_regression.py

+
+ The datafit reads::
+
+ sum_i quantile * max(y_i - Xw_i, 0) + (1 - quantile) * max(Xw_i - y_i, 0)


The naming "quantile" is unfortunate. It should be "quantile level", i.e. the quantile at level 50% is the median. The "quantile" in the formula is not the quantile, but Xw_i is an estimation of the quantile.

thanks @lorentzenchr for the insightful remark, we'll change that

Badr-MOUFAD added 24 commits November 30, 2022 18:21

remove sqrt n_samples

413ef54

update unittest

2ef5eb7

info comment statsmodels

5c0bedc

add prox subdiff to sqrt df

ca6ece7

implement PDCD_WS

a6303e5

r sqrt_n from CB

e8fcee3

Merge branch 'r-sqrt-n' of https://github.com/Badr-MOUFAD/skglm into …

339e98f

…pdcd-ws-skglm

bug w and subdiff

19a0ea9

unittest sqrt

e01451d

add docs

dd36b88

fix docs SqrtQuadratic

523419b

Merge branch 'main' of https://github.com/scikit-learn-contrib/skglm …

71de179

…into pdcd-ws-skglm

subdiff --> fixed_point

63a547b

efficient prox conjugate && fix tests

f78d17d

remove go

d0ae3a4

MM remarks

ad36485

fix test && clean ups

f60bd59

MM round 2 remarks

5a5f1ba

CI Trigger

4f27c56

implement pinball

fe45faa

unittest

3ce886f

fix pinball value && ST step

6928502

more unittest

1271288

fix bug prox pinball

bd1984a

mathurinm reviewed Dec 7, 2022

View reviewed changes

skglm/experimental/pdcd_ws.py Outdated Show resolved Hide resolved

mathurinm reviewed Dec 7, 2022

View reviewed changes

skglm/experimental/quantile_regression.py Outdated Show resolved Hide resolved

mathurinm reviewed Dec 7, 2022

View reviewed changes

skglm/experimental/quantile_regression.py Outdated Show resolved Hide resolved

mathurinm reviewed Dec 7, 2022

View reviewed changes

skglm/experimental/quantile_regression.py Outdated Show resolved Hide resolved

Badr-MOUFAD added 2 commits December 8, 2022 09:51

Merge branch 'main' of https://github.com/scikit-learn-contrib/skglm …

36100c7

…into pinball-df

MM remarks

1a03c60

Badr-MOUFAD mentioned this pull request Dec 8, 2022

ENH - Add skglm PDCD with working sets solver benchopt/benchmark_quantile_regression#10

Open

mathurinm reviewed Dec 8, 2022

View reviewed changes

skglm/experimental/quantile_regression.py Outdated Show resolved Hide resolved

Update skglm/experimental/quantile_regression.py

4b3ea45

mathurinm reviewed Dec 8, 2022

View reviewed changes

Badr-MOUFAD added 3 commits December 8, 2022 17:08

pinball expression

9cf2216

Merge branch 'pinball-df' of https://github.com/Badr-MOUFAD/skglm int…

626b71d

…o pinball-df

sqrt --> pinball

8e93720

lorentzenchr reviewed Dec 8, 2022

View reviewed changes

quantile --> quantile_level

0a247f0

mathurinm approved these changes Dec 9, 2022

View reviewed changes

mathurinm merged commit 4bee85f into scikit-learn-contrib:main Dec 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH - Add `Pinball` datafit #134

ENH - Add `Pinball` datafit #134

Badr-MOUFAD commented Dec 6, 2022 •

edited

Loading

Badr-MOUFAD commented Dec 8, 2022 •

edited

Loading

mathurinm Dec 8, 2022

Badr-MOUFAD Dec 8, 2022

lorentzenchr left a comment

lorentzenchr Dec 8, 2022

mathurinm Dec 9, 2022


		The datafit reads::

		quantile * max(y - Xw, 0) + (1 - quantile) * max(Xw - y, 0)


		The datafit reads::

		sum_i quantile * max(y_i - Xw_i, 0) + (1 - quantile) * max(Xw_i - y_i, 0)

ENH - Add Pinball datafit #134

ENH - Add Pinball datafit #134

Conversation

Badr-MOUFAD commented Dec 6, 2022 • edited Loading

Badr-MOUFAD commented Dec 8, 2022 • edited Loading

mathurinm Dec 8, 2022

Choose a reason for hiding this comment

Badr-MOUFAD Dec 8, 2022

Choose a reason for hiding this comment

lorentzenchr left a comment

Choose a reason for hiding this comment

lorentzenchr Dec 8, 2022

Choose a reason for hiding this comment

mathurinm Dec 9, 2022

Choose a reason for hiding this comment

ENH - Add `Pinball` datafit #134

ENH - Add `Pinball` datafit #134

Badr-MOUFAD commented Dec 6, 2022 •

edited

Loading

Badr-MOUFAD commented Dec 8, 2022 •

edited

Loading