我有 Dataframe df
,我选择了一些数据框,我想根据名为 Sevrice 的库将它们分成 xtrain 和 xtest。这样原始的 1 和 o 进入 xtrain 和 nan 进入 xtest。
Service
1
0
0
1
Nan
Nan
xtarin = df.loc[df['Service'].notnull(), ['Age','Fare', 'GSize','Deck','Class', 'Profession_title' ]]
已编辑
ytrain = df['Service'].dropna()
Xtest=df.loc[df['Service'].isnull(),['Age','Fare','GSize','Deck','Class','Profession_title']]
import pandas as pd
from sklearn.linear_model import LogisticRegression
logistic = LogisticRegression()
logistic.fit(xtrain, ytrain)
logistic.predict(xtest)
我收到此错误logistic.predict(xtest)
X has 220 features per sample; expecting 307