我是 HTcondor 的新手,通过 Putty 提交作业后,作业被搁置,日志文件显示以下错误:
007 (274036.000.000) 06/28 23:12:08 Shadow exception!
Error from slot1@hana-******: Failed to execute '/var/lib/condor/execute/dir_107122/condor_exec.exe': (errno=8: 'Exec format error')
0 - Run Bytes Sent By Job
16520 - Run Bytes Received By Job
...
012 (274036.000.000) 06/28 23:12:08 Job was held.
Error from slot1@hana-2.*******: Failed to execute'/var/lib/condor/execute/dir_107122/condor_exec.exe': (errno=8: 'Exec format error')
Code 6 Subcode 8
...
以下是我的提交描述文件:
# defining the programm
cmd = /***/my_prog.ipynb
# defining the parameters
#args = /***/model_1/data_1.csv
#args = /***/model_2/data_2.csv
#args = /***/model_3/data_3.csv
#args = /***/model_4/data_4.csv
# defining memory requirements
request_memory = 100 GB
# defining to use 1 CPU
request_cpus = 100
# defining the max. run time of the job
+REQUESTED_WALLTIME = 72
# defining the mail address for notification
notify_user = ***@**.com
notification = Always
should_transfer_files = if_needed
when_to_transfer_output = on_exit
output = $(Cluster)-$(Process)_job.out
error = $(Cluster)-$(Process)_job.error
log = $(Cluster)-$(Process)_job.log
queue 1
在这种情况下,my_prog.ipynb 本身具有读取本地四个文件的代码作为输入,同时,my_prog.ipynb 还包含将结果写入本地文件的“打印”命令。
基于这种情况,我真的很困惑我是否也应该在提交描述文件中配置参数?
请给我任何小提示,这个问题让我困难了3天......
任何建议将不胜感激!
谢谢!