[ML] Add a constant to the prediction which minimises the unregularised loss for classification and regression by tveasey · Pull Request #1192 · elastic/ml-cpp

tveasey · 2020-05-05T13:00:17Z

We add on a constant weight to "centre" the data. (Strictly speaking this isn't centring the data in the conventional sense, it is finding a single weight which when added to the ensemble prediction minimises the loss.)

Currently, we choose to minimise regularised loss with this weight, i.e. respecting the weight shrinkage. First, there is no need to do this, shrinkage is used to impose a "smoothness" bias, but a constant function is flat. Second, the subsequent trees spend effort updating the mean predictions to be unbiased and means there is an unfortunate interplay between the degree of smoothing we can use (since it will create bias in the unregularised loss) and the centre of the data.

valeriy42

LGTM. A good idea of estimating a constant prior to fitting a non-linear function.

…ed loss for classification and regression (elastic#1192)

…larised loss for classification and regression (#1194) Backport #1192.

Really 'centre' the data before training in earnest

beb5223

tveasey added >enhancement review v8.0.0 :ml/DataFrameAnalysis v7.8.0 labels May 5, 2020

tveasey requested a review from valeriy42 May 5, 2020 13:00

valeriy42 approved these changes May 5, 2020

View reviewed changes

tveasey added 2 commits May 5, 2020 14:34

Docs

a9999b7

Test fallout

498d8e8

tveasey merged commit 2d24c58 into elastic:master May 5, 2020

tveasey deleted the centre-data-before-training branch May 5, 2020 16:31

tveasey added a commit to tveasey/ml-cpp-1 that referenced this pull request May 5, 2020

[ML] Add a constant to the prediction which minimises the unregularis…

76f4b4e

…ed loss for classification and regression (elastic#1192)

tveasey mentioned this pull request May 5, 2020

[7.8][ML] Add a constant to the prediction which minimises the unregularised loss for classification and regression #1194

Merged

tveasey added a commit to tveasey/ml-cpp-1 that referenced this pull request May 5, 2020

[ML] Add a constant to the prediction which minimises the unregularis…

f66c7fd

…ed loss for classification and regression (elastic#1192)

tveasey added a commit that referenced this pull request May 5, 2020

[7.8][ML] Add a constant to the prediction which minimises the unregu…

2ec623a

…larised loss for classification and regression (#1194) Backport #1192.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Add a constant to the prediction which minimises the unregularised loss for classification and regression#1192

[ML] Add a constant to the prediction which minimises the unregularised loss for classification and regression#1192
tveasey merged 3 commits intoelastic:masterfrom
tveasey:centre-data-before-training

tveasey commented May 5, 2020 •

edited

Loading

Uh oh!

valeriy42 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tveasey commented May 5, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

valeriy42 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tveasey commented May 5, 2020 •

edited

Loading