Skip to content

[7.x][ML] Gain upper bound estimation for classification and regression #1568

Merged
valeriy42 merged 4 commits intoelastic:7.xfrom
valeriy42:backport-pr1537
Nov 12, 2020
Merged

[7.x][ML] Gain upper bound estimation for classification and regression #1568
valeriy42 merged 4 commits intoelastic:7.xfrom
valeriy42:backport-pr1537

Conversation

@valeriy42
Copy link
Copy Markdown
Contributor

In this PR we start computing an upper bound on the potential gain from splitting a node. If the upper bound of the gain is lower than the currently smallest gain among all candidates, we ignore the node and this way prevent computations that are especially expensive on the large datasets.

Since we avoid computation of the splits that we wouldn't be added to the tree anyway, this PR does not change the qualitative results.

At the moment, we can only compute the upper bound for regression and binary classification. For multiclass classification we proceed as before.

Note: this PR contains additional instrumentation to assess the performance improvement. I will remove this instrumentation in a follow-up PR after tests.

Backport of #1537

…lastic#1537)

In this PR we start computing an upper bound on the potential gain from splitting a node. If the upper bound of the gain is lower than the currently smallest gain among all candidates, we ignore the node and this way prevent computations that are especially expensive on the large datasets.

Since we avoid computation of the splits that we wouldn't be added to the tree anyway, this PR does not change the qualitative results.

At the moment, we can only compute the upper bound for regression and binary classification. For multiclass classification we proceed as before.

Note: this PR contains additional instrumentation to assess the performance improvement. I will remove this instrumentation in a follow-up PR after tests.
@valeriy42 valeriy42 merged commit ba4e091 into elastic:7.x Nov 12, 2020
@valeriy42 valeriy42 deleted the backport-pr1537 branch November 12, 2020 16:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant