运行此数据漂移示例笔记本时,我在运行特定单元时遇到问题:
exp = Experiment(ws, datadrift._id)
dd_run = Run(experiment=exp, run_id=run)
RunDetails(dd_run).show()
这会生成以下回溯:
(...)
ImportError: cannot import name 'get_run_ids_and_metric_types_filter_expression'
我相信这个笔记本可能存在版本问题。我正在运行 AzureML SDK 1.0.60,此示例取自笔记本的 1.0.60 版本(至少是截至今天的 master 分支中的那个)
还是我的环境有问题?
我还意识到,通过检查运行的输出日志,我正在对作业本身进行追溯:
The experiment failed. Finalizing run...
Traceback (most recent call last):
File "datadrift_run.py", line 173, in <module>
run.run(target_date)
File "datadrift_run.py", line 100, in run
drift_main(arguments_drift)
File "/mnt/batch/tasks/shared/LS_root/jobs/playground-olivier/azureml/13f371b5-1985-44c2-921c-fd66b0dbe852_1568646629244/mounts/workspacefilestore/azureml/13f371b5-1985-44c2-921c-fd66b0dbe852_1568646629244/_generate_script.py", line 363, in main
'datadrift_id': args.datadrift_id
File "/mnt/batch/tasks/shared/LS_root/jobs/playground-olivier/azureml/13f371b5-1985-44c2-921c-fd66b0dbe852_1568646629244/mounts/workspacefilestore/azureml/13f371b5-1985-44c2-921c-fd66b0dbe852_1568646629244/_generate_script.py", line 75, in _get_drift_metrics
diff_metrics = dsdo.run()
File "/azureml-envs/azureml_9a12ab39ef186b06eb543bbc347567d8/lib/python3.6/site-packages/azureml/data/_dataset_diff.py", line 840, in run
base_profile_metrics = get_dataprofile_metrics(self.base_datasetprofile, self.config)
File "/azureml-envs/azureml_9a12ab39ef186b06eb543bbc347567d8/lib/python3.6/site-packages/azureml/data/_dataset_diff.py", line 163, in get_dataprofile_metrics
column_type = column_type_classifier[(dp.columns[c].value_counts is None, dp.columns[c].histogram is None)]
KeyError: 'usaf'
这两个不相关,但由同一个笔记本生成。