machine-learning - 训练集和测试集的大小

Question

当训练集小于特定案例的测试集时会发生什么。使用这些类型的数据集进行预测时，什么是合适的方法。谁能给我一些介绍来管理这些情况以及每种学习算法将如何执行。

score 2 · Accepted Answer

你有什么顾虑？你期望从中产生什么坏处？验证太多？过度验证？

测试集不参与分类器的训练。

它仅用于允许您预测未来数据的分类器质量。

因此，如果您有一个大型测试集，您可能会更好地预测分类器的质量，仅此而已。

score 0 · Accepted Answer

One solution when you have a small training set and large test set is instead to combine the sets and use k-fold cross validation. But, ideally you'd accumulate a larger training set. Beware of using out-of-the-box machine learning on ill-suited problems.

machine-learning - 训练集和测试集的大小

2 回答 2

Related

Reference