Compute and use `dual_gap` in coordinate descent solver

Sklearn's coordinate descent solver computes and uses the `dual_gap` as part of its stopping criteria, while ours does not. This causes a few problems:

- In the `cuml.accel` layer, our exposed estimators lack a `dual_gap_` fitted attribute, exposing the difference between the primary and the dual as a result of the fit. We tried to patch around this in #6714, but it came with a performance cost and didn't make sense if the computed value wasn't being used in the solver.
- The meaning of the `tol` parameter differs between solvers. Currently we work around that by scaling the `tol` parameter when converting to/from sklearn's parameters. If our solver's had similar stopping criteria we wouldn't need to do this, and could better guarantee functionally equivalent results.

Since we'll need to look into the solver anyway in #6736, I think we should consider making this change to better improve compatibility with sklearn's `ElasticNet`/`Lasso` implementations.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compute and use `dual_gap` in coordinate descent solver #6759

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Compute and use dual_gap in coordinate descent solver #6759

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Compute and use `dual_gap` in coordinate descent solver #6759