python - 参数不会用于 scikit-learn GridSearchCV 中的自定义估计器

Question

我正在尝试将参数传递给 scikit learn 中的自定义估计器，但未能成功。我希望lr在网格搜索期间更改参数。问题是lr参数没有改变......

（原始代码对我不起作用）

任何带有自定义估计器的完整工作示例GridSearchCV，以及更改参数将不胜感激。

我在ubuntu18.10 使用scikit-learn0.20.2

from sklearn.model_selection import GridSearchCV
from sklearn.base import BaseEstimator, ClassifierMixin
import numpy as np

class MyClassifier(BaseEstimator, ClassifierMixin):

     def __init__(self, lr=0.1):
         # Some code
         print('lr:', lr)
         return self

     def fit(self, X, y):
         # Some code
         return self

     def predict(self, X):
         # Some code
         return X % 3

params = {
    'lr': [0.1, 0.5, 0.7]
}
gs = GridSearchCV(MyClassifier(), param_grid=params, cv=4)

x = np.arange(30)
y = np.concatenate((np.zeros(10), np.ones(10), np.ones(10) * 2))
gs.fit(x, y)

Terveisin，马库斯

score 4 · Accepted Answer

lr由于您在构造函数中打印，因此您无法看到值的变化。

如果我们在.fit()函数内部打印，我们可以看到lr值的变化。发生这种情况是因为创建不同的估算器副本的方式。请参阅此处以了解创建多个副本的过程。

from sklearn.model_selection import GridSearchCV
from sklearn.base import BaseEstimator, ClassifierMixin
import numpy as np

class MyClassifier(BaseEstimator, ClassifierMixin):

    def __init__(self, lr=0):
         # Some code
        print('lr:', lr)
        self.lr = lr

    def fit(self, X, y):
         # Some code
        print('lr:', self.lr)
        return self

    def predict(self, X):
         # Some code
         return X % 3

params = {
    'lr': [0.1, 0.5, 0.7]
}
gs = GridSearchCV(MyClassifier(), param_grid=params, cv=4)

x = np.arange(30)
y = np.concatenate((np.zeros(10), np.ones(10), np.ones(10) * 2))
gs.fit(x, y)
gs.predict(x)

输出：

lr: 0
lr: 0
lr: 0
lr: 0.1
lr: 0
lr: 0.1
lr: 0
lr: 0.1
lr: 0
lr: 0.1
lr: 0
lr: 0.5
lr: 0
lr: 0.5
lr: 0
lr: 0.5
lr: 0
lr: 0.5
lr: 0
lr: 0.7
lr: 0
lr: 0.7
lr: 0
lr: 0.7
lr: 0
lr: 0.7
lr: 0
lr: 0.1

python - 参数不会用于 scikit-learn GridSearchCV 中的自定义估计器

1 回答 1

Related

Reference