Deion I want to use some prior predictions from other metho

Hey <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

Oh I see. it should be like <div class="highlight highlight-source-python notransl

[python-package] `init_score` causes the predictions different about lightgbm HOT 6 CLOSED

oooo26 commented on June 16, 2024

[python-package] `init_score` causes the predictions different

from lightgbm.

Comments (6)

jmoralez commented on June 16, 2024 1

Hey @oooo26, thanks for using LightGBM.

Should I add the init_score to lgb prediction?

Yes

It seems useful for this "l2" objective, but still weird for e.g. "tweedie"

You should be able to match the outputs of init_score=None if you set it to the same value that LightGBM starts the boosting from, which is different for each objective. For l2 is the mean of the target, but for tweedie is the log of that

LightGBM/src/objective/regression_objective.hpp

Lines 468 to 470 in 28536a0

    
           double BoostFromScore(int) const override { 
        
             return Common::SafeLog(RegressionL2loss::BoostFromScore(0)); 
        
           }

from lightgbm.

oooo26 commented on June 16, 2024

Should I add the init_score to lgb prediction? It seems useful for this "l2" objective, but still weird for e.g. "tweedie".

import pandas as pd
import lightgbm as lgb
from sklearn import datasets
from sklearn.linear_model import LinearRegression

diabetes = datasets.load_diabetes()
n, p = diabetes["data"].shape


def lgb_model(init_score):
    train_set = lgb.Dataset(
        data=diabetes["data"],
        label=diabetes["target"],
        feature_name=diabetes["feature_names"],
        init_score=init_score,
    )
    params = {
        "objective": "tweedie",  # change to tweedie
        "verbosity": -1,
        "seed": 0,
    }
    model = lgb.train(params, train_set)
    predictions = model.predict(diabetes["data"])
    if init_score is not None:
        predictions += init_score. # add init_score
    return predictions


init_model = LinearRegression().fit(diabetes["data"], diabetes["target"])
results = pd.DataFrame(
    {
        "target": diabetes["target"],
        "init - None": lgb_model(None),
        "init - mean": lgb_model([diabetes["target"].mean()] * n),
        "init - linear": lgb_model(init_model.predict(diabetes["data"])),
    }
)

print(results.head())

   target  init - None  init - mean  init - linear
0   151.0   157.811490   152.952215     207.116677
1    75.0    75.164031   152.952215      69.071033
2   141.0   140.120412   152.952215     177.882790
3   206.0   223.799875   152.952215     167.914458
4   135.0   119.464777   152.952215     129.462258

from lightgbm.

oooo26 commented on June 16, 2024

Thank you for helping!
Let me double check, for tweedie, I should set init_score as the log scale of initial predictions and, after fitting, still add the normal scale of them?

from lightgbm.

oooo26 commented on June 16, 2024

Oh I see. it should be like

def lgb_model(init_score):
    train_set = lgb.Dataset(
        data=diabetes["data"],
        label=diabetes["target"],
        feature_name=diabetes["feature_names"],
    )
    if init_score is not None:
        train_set.set_init_score(np.log(init_score))
    params = {
        "objective": "tweedie",
        "verbosity": -1,
        "seed": 0,
    }
    model = lgb.train(params, train_set)
    predictions = model.predict(diabetes["data"])
    if init_score is not None:
        # transform back to log scale and add init_score
        predictions = np.exp(np.log(predictions) + train_set.init_score)
    return predictions

It is a bit complex i think...but anyway, thank you!

from lightgbm.

jmoralez commented on June 16, 2024

Yes, some objectives have a ConvertOutput method, which is used to translate the raw scores (which the model is trained to predict) to the actual scores (the ones we're interested in). In the case of Tweedie it's the exp (as you've correctly done)

LightGBM/src/objective/regression_objective.hpp

Lines 460 to 462 in 5dfe716

    
           void ConvertOutput(const double* input, double* output) const override { 
        
             output[0] = std::exp(input[0]); 
        
           }

So another alternative would be to always predict the raw score, add the init score (which will be on the same "scale"), and then apply that transformation. So the following would be equivalent but maybe easier since for each objective you'd only need to apply the ConvertOutput method.

    convert_output = np.exp  # for tweedie and poisson
    predictions = model.predict(diabetes["data"], raw_score=True)
    if init_score is not None:
        predictions += init_score
    return convert_output(predictions)

from lightgbm.

jmoralez commented on June 16, 2024

Closing since I believe the issue has been solved, feel free to reopen if you run into any more problems.

from lightgbm.

[python-package] `init_score` causes the predictions different about lightgbm HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

	double BoostFromScore(int) const override {
	return Common::SafeLog(RegressionL2loss::BoostFromScore(0));
	}

	void ConvertOutput(const double* input, double* output) const override {
	output[0] = std::exp(input[0]);
	}