7

有没有办法在 python sklearn 上的 train_test_split 上设置种子。我已将参数设置random_state为整​​数,但仍然无法重现结果。

提前致谢。

4

2 回答 2

11
from sklearn.model_selection import train_test_split
x = [k for k in range(0, 10)]
y = [k for k in range(0, 10)]
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.4, random_state=11)
print (x_train)

每次我拆分数据时,上面的代码都会为 x_train 产生相同的结果。随机性可能在您的数据框中,而不是 train_test_split。

于 2019-05-16T15:07:30.950 回答
0

只需在 中train_test_split,指定参数,random_state=some_number_you_wan to use,random_state=42

于 2022-02-23T08:34:58.383 回答