python - 存储决策树代码/算法

Question

我有一个任务，我计划使用机器学习（特别是监督学习，例如决策树）。最终代码将在没有 scikit learn 或其他外部库的助教电脑上运行。

因此，我需要从头开始编写决策树分类器之类的东西，或者在本地使用外部库，并存储最终算法。

总结一下：当给定一组带标签的训练数据时，如何将最终算法存储在python代码中，以后不依赖外部库来运行最终算法？

例如，决策树可以分解为一系列 if/then 语句，我想生成这些 if/then 语句并存储它们，以便它可以在除了 python 之外没有安装任何东西的计算机上运行。

关于如何实现这一目标的最佳建议是什么。如果这是在错误的论坛，请告知。

score 0 · Accepted Answer

您可以使用 Python 随机森林包sklearn.ensemble，如下所示：

# Import the random forest package
from sklearn.ensemble import RandomForestClassifier 

# create a random forest object with 100 trees
forest = RandomForestClassifier(n_estimators = 100)

predictors = [[0, 0], [1, 1]]
response = [0, 1]

# fit the model to the training data
forest = forest.fit(predictors, response)

# you can reuse the forest model you built to make predictions
# on other data sets
test_data = [[0, 1], [1, 0]]
output = forest.predict(test_data)

请注意，我在此处导入，但如果您想在回归模式下运行随机森林，RandomForestClassifier您也可以使用。RandomForestRegressor

python - 存储决策树代码/算法

1 回答 1

Related

Reference