Fix Segmentation Fault and `ZeroDivisionError` in Group Lasso #292

Badr-MOUFAD · 2024-03-29T17:34:04Z

Context

I run unittest of skglm and got an error for celer namely segmentation fault.
The tests were run on a Mac. Surprisingly, the problem doesn't arise for Linux (no errors in skglm CI)

Contributions of the PR

After debugging, the segmentation fault (out of bound index) comes from an un initialized variable.
Other errors, due to changes of scikit-learn API, were fixed as well.

Badr-MOUFAD · 2024-03-29T17:37:31Z

celer/group_fast.pyx

@@ -418,8 +418,11 @@ cpdef celer_grp(
                                    &inc) / lc_groups[g]
                    norm_wg += w[j] ** 2
                norm_wg = sqrt(norm_wg)
-                bst_scal = max(0.,
+                if norm_wg != 0.:
+                    bst_scal = max(0.,


I'm perhaps lacking context about the code, namely I ignore the purpose of bst_scal.

I just followed commun sense to handle the case of norm w being zero.

bst_scal is for BockSofthThresholding scaling : the formula for the BST of wg at level lambda is:
wg * max(0, 1 - lambda / norm(wg)) aka 0 if norm(wg) < lambda, and (1 - lambda/norm(wg)) wg otherwise

I'm curious, how do you end up with a vanishing wg after a gradient step ? This should not happen with probability 1, I'm guessing a 0 group X_g ?

mathurinm · 2024-03-29T18:02:53Z

celer/group_fast.pyx

@@ -89,7 +89,7 @@ cpdef floating dnorm_grp(

    else:  # scaling only with features in C
        for g_idx in range(ws_size):
-            if weights[g] == INFINITY:
+            if weights[g_idx] == INFINITY:
                continue

            g = C[g_idx]


are you sure that it's not g (defined L95, so def should be moved above) which should be used here? weights has size n_groups, not ws_size

Yes you are right,
We can insert line 95 right after Line 91 and keep the rest as it is

codecov-commenter · 2024-03-30T12:12:04Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 88.53%. Comparing base (c1b564c) to head (138bd97).

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #292   +/-   ##
=======================================
  Coverage   88.53%   88.53%           
=======================================
  Files          15       15           
  Lines        1143     1143           
  Branches      127      127           
=======================================
  Hits         1012     1012           
  Misses        101      101           
  Partials       30       30

Flag	Coverage Δ
unittests	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

Files	Coverage Δ
celer/dropin_sklearn.py	`95.95% <ø> (ø)`
celer/tests/test_mtl.py	`100.00% <ø> (ø)`

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update c1b564c...138bd97. Read the comment docs.

Badr-MOUFAD · 2024-03-30T12:16:22Z

@mathurinm, don't mind the 10 files diff, it is just GitHub glitch

mathurinm · 2024-04-02T07:28:38Z

@Badr-MOUFAD my solution in that case is to merge main into the branch, now it is reduced to 3 files

mathurinm · 2024-04-08T15:03:54Z

Thanks @Badr-MOUFAD

Badr-MOUFAD added 3 commits March 29, 2024 18:18

fix index & divide by zero

dd40914

provide kwarg to _preprocess_data

cd77bb6

fit_intercept as bool

afaf973

Badr-MOUFAD commented Mar 29, 2024

View reviewed changes

CI trigger

629cee7

mathurinm reviewed Mar 29, 2024

View reviewed changes

Badr-MOUFAD and others added 12 commits March 30, 2024 12:24

fix index group

028ec44

debug circle ci

b988f1a

print pwd

169be60

print pwd

a87fbf8

content of dir

2e89487

MNT - remove requirements files (mathurinm#285)

de6133c

MNT cosmit readme (mathurinm#287)

265d2c9

CI trigger

71f8dc8

debug circle ci

a6de333

print pwd

69372b8

content of dir

32309a1

rm commands circle-ci

c4d6a43

Merge branch 'main' of github.com:mathurinm/celer into fix-grp

138bd97

mathurinm approved these changes Apr 2, 2024

View reviewed changes

mathurinm approved these changes Apr 8, 2024

View reviewed changes

mathurinm merged commit 1df21cf into mathurinm:main Apr 8, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Segmentation Fault and `ZeroDivisionError` in Group Lasso #292

Fix Segmentation Fault and `ZeroDivisionError` in Group Lasso #292

Badr-MOUFAD commented Mar 29, 2024

Badr-MOUFAD Mar 29, 2024

mathurinm Apr 2, 2024

mathurinm Mar 29, 2024

Badr-MOUFAD Mar 30, 2024 •

edited

Loading

codecov-commenter commented Mar 30, 2024 •

edited

Loading

Badr-MOUFAD commented Mar 30, 2024

mathurinm commented Apr 2, 2024

mathurinm commented Apr 8, 2024

Fix Segmentation Fault and ZeroDivisionError in Group Lasso #292

Fix Segmentation Fault and ZeroDivisionError in Group Lasso #292

Conversation

Badr-MOUFAD commented Mar 29, 2024

Context

Contributions of the PR

Badr-MOUFAD Mar 29, 2024

Choose a reason for hiding this comment

mathurinm Apr 2, 2024

Choose a reason for hiding this comment

mathurinm Mar 29, 2024

Choose a reason for hiding this comment

Badr-MOUFAD Mar 30, 2024 • edited Loading

Choose a reason for hiding this comment

codecov-commenter commented Mar 30, 2024 • edited Loading

Codecov Report

Badr-MOUFAD commented Mar 30, 2024

mathurinm commented Apr 2, 2024

mathurinm commented Apr 8, 2024

Fix Segmentation Fault and `ZeroDivisionError` in Group Lasso #292

Fix Segmentation Fault and `ZeroDivisionError` in Group Lasso #292

Badr-MOUFAD Mar 30, 2024 •

edited

Loading

codecov-commenter commented Mar 30, 2024 •

edited

Loading