[READY] Fit intercept inside cd_solver #55

mathurinm · 2022-08-22T11:59:32Z

Second part of #19 split (first part is #54)

…into intercept

skglm/estimators.py

skglm/solvers/cd_solver.py

skglm/solvers/group_bcd_solver.py

mathurinm · 2022-08-26T15:45:41Z

skglm/solvers/group_bcd_solver.py

+                intercept_old = w[-1]
+                w[-1] -= datafit.intercept_update_step(y, Xw)
+                Xw += (w[-1] - intercept_old)
+
            w_acc, Xw_acc, is_extrapolated = accelerator.extrapolate(w, Xw)

            if is_extrapolated:  # avoid computing p_obj for un-extrapolated w, Xw


probably below, the objective should be computed with w[:n_features]. I wonder why it does not fail, maybe it's not tested

skglm/solvers/multitask_bcd_solver.py

mathurinm · 2022-08-26T15:47:50Z

skglm/tests/test_group.py

@@ -132,6 +132,44 @@ def test_vs_celer_grouplasso(n_groups, n_features, shuffle):
    np.testing.assert_allclose(model.coef_, w, atol=1e-5)


+@pytest.mark.parametrize("n_groups, n_features, shuffle",
+                         [[15, 50, False]])


if a single set of parameters is used, we can get rid of parametrize and hardcode these.

skglm/tests/test_intercept.py

QB3

Here is a first round of comments, my main complain is the test_intercept.py which I think could / should be merged with test_estimators.py

skglm/tests/test_intercept.py

skglm/datafits/single_task.py

skglm/estimators.py

skglm/solvers/cd_solver.py

skglm/solvers/multitask_bcd_solver.py

mathurinm · 2022-08-29T07:17:05Z

skglm/tests/test_estimators.py

    if estimator_name == "GeneralizedLinearEstimator":
        pytest.skip()
    estimator_sk = dict_estimators_sk[estimator_name]
    estimator_ours = dict_estimators_ours[estimator_name]
+    # TODO This seems a bit unusal, maybe to discuss


@QB3 this has potentially harmful side effects: it sets the intercept to True on an object defined outside the function, so if we first call the function with fit_intercept=True, we set to model.fit_intercept to True, then the second time the function is called, the model still has clf.fit_intercept set to True.

it can be avoided by cloning the estimator when entering the test instead (sklearn.base.clone)

skglm/tests/test_intercept.py

…ntercept

PABannier

LGTM! For the documentation, i don't feel able to write it for the intercept fitting right now as I don't fully understand what's going on. @Badr-MOUFAD do you want to give it a try? a few lines would suffice.

Badr-MOUFAD

I did a second pass. I think we are almost done.

Some minor comments.

skglm/datafits/single_task.py

skglm/solvers/cd_solver.py

skglm/solvers/group_bcd_solver.py

skglm/solvers/multitask_bcd_solver.py

Badr-MOUFAD · 2022-08-30T23:32:57Z

skglm/tests/test_group.py

+                       fit_intercept=True, tol=1e-12)
+    model.fit(X, y)
+
+    np.testing.assert_allclose(model.coef_, w[:X.shape[1]], atol=1e-5)


Agree!
we can test w[-1] against intercept_ when fit_intercept=True?

skglm/utils.py

skglm/tests/test_group.py

Badr-MOUFAD · 2022-08-31T11:55:46Z

skglm/solvers/cd_solver.py

@@ -268,19 +285,27 @@ def cd_solver(
        for epoch in range(max_epochs):


@mathurinm, I don't know whether it's worth it to check intercept optimality when solving subproblems. WDYT?

Fit intercept inside solver

6123648

mathurinm mentioned this pull request Aug 22, 2022

[READY] ENH add fit_intercept #19

Closed

Klopfe added 9 commits August 26, 2022 13:27

Merge branch 'main' of https://github.com/scikit-learn-contrib/skglm …

050015e

…into intercept

Fix examples and intercept_ in estimators

e57824f

fixed test_path

a012471

add fit_intercept to Multi task solver

ecbc0d7

add intercept update to multitask and group datafit

dbf14d9

shape problems with intercept and AA solved

517dd11

add fit intercept to bcd solver

c305733

add test_intercept to group_lasso tests

05e3d36

created test_intercept_mtl and fixed it

9dd776e