我有一个多元线性回归问题,其中每个数据点如下所示:
y_i = 3 # Some integer between 0 and 20
X_i = [0.5, 80, 0.004, 0.5, 0.789] # A 5 dimensional vector
我可以使用 sklearn 训练一个简单的线性模型,例如:
from sklearn import linear_model
ols = linear_model.LinearRegression()
model = ols.fit(X, y)
这使我的准确率达到了约 55%(线性模型不适合该问题,但这是证明对问题建模的可行性的基线,也是我学习 PyTorch 的一种方式,之前使用过 TensorFlow)。
当我尝试使用 PyTorch 训练线性模型时,我将模型定义为:
class TwoLayerNet(torch.nn.Module):
def __init__(self, D_in, D_out):
super(TwoLayerNet, self).__init__()
self.linear1 = torch.nn.Linear(D_in, D_out)
def forward(self, x):
y_pred = self.linear1(x)
return y_pred
D_in, D_out = 5, 1
model = TwoLayerNet(D_in, D_out)
并培训为:
epochs = 10
criterion = torch.nn.MSELoss(reduction='sum')
optimizer = torch.optim.SGD(model.parameters(), lr=1e-4)
for epoch in range(epochs):
for n, batch in enumerate(batches):
X = []
y = []
for values in batch:
X.append(values[0])
y.append(values[1])
X = torch.from_numpy(np.asarray(X))
y = torch.from_numpy(np.asarray(y))
# Forward pass: Compute predicted y by passing x to the model
optimizer.zero_grad()
y_pred = model(X)
# Compute and print loss
loss = criterion(y_pred, y)
if n % 100 == 99:
print(n, loss.item())
# Zero gradients, perform a backward pass, and update the weights.
loss.backward()
optimizer.step()
这只是我调整的 PyTorch 文档中的一些代码。当前设置仅达到约 25%,与我期望的线性模型的准确度相差甚远。我在模型训练 wrt PyTorch 中做错了什么吗?