我有一个使用 Azure ML 工作区的 autoML 功能训练的 StackEnsemble 模型。当我尝试将其部署为 Web 服务时,我收到如下错误 (CrashLoopBackOff)。现在,我强烈怀疑它与模型本身/它需要的依赖关系有关。当我将 score.py 中的模型名称交换为另一个模型名称时,它不是 StackEnsemble(带有缩放器),而只是一个普通的 XGBoost,然后创建的服务没有问题。
我有以下问题: - 我如何找出 StackEnsemble 中的模型/算法,以便正确构建容器/依赖项列表?- 有什么方法可以找出那里的实际错误是什么?我的意思是除了创建我的本地容器并在那里调试它......我尝试根据文档使用 service.get_logs() 获取日志,但那里什么都没有,只有最后 5 行没有指出任何问题。
请指教。
WebserviceException: Service deployment polling reached non-successful terminal state, current service state: Failed
Error:
{
"code": "AciDeploymentFailed",
"message": "Aci Deployment failed with exception: Your container application crashed. This may be caused by errors in your scoring file's init() function.\nPlease check the logs for your container instance: classifier-bwp-ls5923-v1. From the AML SDK, you can run print(service.get_logs()) if you have service object to fetch the logs. \nYou can also try to run image mlws219f9669.azurecr.io/classifier-bwp-ls5923-v1:4 locally. Please refer to http://aka.ms/debugimage#service-launch-fails for more information.",
"details": [
{
"code": "CrashLoopBackOff",
"message": "Your container application crashed. This may be caused by errors in your scoring file's init() function.\nPlease check the logs for your container instance: classifier-bwp-ls5923-v1. From the AML SDK, you can run print(service.get_logs()) if you have service object to fetch the logs. \nYou can also try to run image mlws219f9669.azurecr.io/classifier-bwp-ls5923-v1:4 locally. Please refer to http://aka.ms/debugimage#service-launch-fails for more information."
}
]
}