What does `booster.refit` actually do?

I am looking for a reference on the inner workings of `refit` that goes beyond "refit uses new data to update tree leaf values, keeping the tree's structure intact". How are the tree leaf values updated? Is this documented somewhere in detail?

I am aware of [#3003](https://github.com/microsoft/LightGBM/issues/3003), https://github.com/microsoft/LightGBM/issues/1473, https://github.com/microsoft/LightGBM/issues/1529.

My understanding, from [here](https://github.com/microsoft/LightGBM/blob/67d3e7f0129851e74a87941296a7d83f74bee959/src/boosting/gbdt.cpp#L258-L296) and [here](https://github.com/microsoft/LightGBM/blob/3fad53bc587acc634592ee168aa59ca21cba83f4/src/treelearner/serial_tree_learner.cpp#L247-L280) is the following (pseudo-code):

```python

class Booster():
    def init(self, trees: List[tree], params):
        self.trees = trees
        self.params = params

    def get_grad(self, y, f):
        if self.params["objective"] == "regression":
            return y - f
        elif self.params["objective"] == "classification":
            return 1 / (1 + np.exp(-f)) - y

    def refit(self, X, y):
        f = np.zeros_like(y)  # or some init_model
        decay_rate = self.params["decay_rate"]

        for tree in self.trees:
            grad = self.get_grad(y, f)

            leaf_indices = tree.get_leaf_indices(X)

            for leaf_index in tree.leaf_indices:
                old_leaf_value = tree.get_leaf_value(leaf_index)
                new_leaf_value = np.mean(grad[leaf_indices == leaf_index])
                tree.set_leaf_value(
                    leaf_index,
                    decay_rate * old_leaf_value + (1 - decay_rate) * new_leaf_value
                )

            f += self.params["learning_rate"] * tree.predict(X)
```

Is this correct? To me it seems that this would be in contradiction with https://github.com/microsoft/LightGBM/issues/5609#issuecomment-1342172997:
> but the refit method updates all the trees in one go. 

It would be nice if this mechanic was documented somewhere in more details, to be referenced when using the `refit` method.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What does `booster.refit` actually do? #6838

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

What does booster.refit actually do? #6838

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

What does `booster.refit` actually do? #6838