1

有人可以解释当您拆分数据集进行测试和训练时会发生什么吗?

4

2 回答 2

2

简而言之,您的数据挖掘模型的准确性是通过根据您的训练集进行预测来评估的,该训练集的结果在测试集中是已知的。

有关数据挖掘模型测试和验证的更多信息 (MSDN)

于 2013-04-04T08:41:08.623 回答
0

为了能够测试您构建的预测分析模型,您需要将数据集分成两组:训练数据集和测试数据集。这些数据集应该是随机选择的,并且应该是实际人口的良好代表。

Similar data should be used for both the training and test datasets.

Normally the training dataset is significantly larger than the test dataset.

Using the test dataset helps you avoid errors such as overfitting.

The trained model is run against test data to see how well the model will perform.

更多信息

于 2017-07-14T22:10:44.337 回答