Oozie SSH 操作问题:
问题: 我们试图在集群的特定主机上运行一些命令。我们同样选择了 SSH Action。一段时间以来,我们一直面临这个 SSH 问题。这里的真正问题可能是什么?请指出我的解决方案。
日志:
AUTH_FAILED:无法执行操作 [ssh -o PasswordAuthentication=no -o KbdInteractiveDevices=no -o StrictHostKeyChecking=no -o ConnectTimeout=20 USER@1.2.3.4 mkdir -p oozie-oozi/0000000-131008185935754-oozie-oozi-W /action1--ssh/ ] | ErrorStream:警告:将主机 1.2.3.4 (RSA) 永久添加到已知主机列表中。权限被拒绝(公钥、gssapi-keyex、gssapi-with-mic、密码)。
org.apache.oozie.action.ActionExecutorException: AUTH_FAILED: 无法执行操作 [ssh -o PasswordAuthentication=no -o KbdInteractiveDevices=no -o StrictHostKeyChecking=no -o ConnectTimeout=20 user@1.2.3.4 mkdir -p oozie-oozi /0000000-131008185935754-oozie-oozi-W/action1--ssh/ ] | ErrorStream:警告:将 1.2.3.4,192.168.34.208 (RSA) 永久添加到已知主机列表中。权限被拒绝(公钥、gssapi-keyex、gssapi-with-mic、密码)。
at org.apache.oozie.action.ssh.SshActionExecutor.execute(SshActionExecutor.java:589)
at org.apache.oozie.action.ssh.SshActionExecutor.start(SshActionExecutor.java:204)
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:211)
at org.apache.oozie.command.wf.ActionStartXCommand.execute(ActionStartXCommand.java:59)
at org.apache.oozie.command.XCommand.call(XCommand.java:277)
at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.java:326)
at org.apache.oozie.service.CallableQueueService$CompositeCallable.call(CallableQueueService.java:255)
at org.apache.oozie.service.CallableQueueService$CallableWrapper.run(CallableQueueService.java:175)
at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:662)
原因:java.io.IOException: 无法执行操作 [ssh -o PasswordAuthentication=no -o KbdInteractiveDevices=no -o StrictHostKeyChecking=no -o ConnectTimeout=20 user@1.2.3.4 mkdir -p oozie-oozi/0000000- 131008185935754-oozie-oozi-W/action1--ssh/ ] | ErrorStream:警告:将“1.2.3.4,1.2.3.4”(RSA)永久添加到已知主机列表中。权限被拒绝(公钥、gssapi-keyex、gssapi-with-mic、密码)。
at org.apache.oozie.action.ssh.SshActionExecutor.executeCommand(SshActionExecutor.java:340)
at org.apache.oozie.action.ssh.SshActionExecutor.setupRemote(SshActionExecutor.java:373)
at org.apache.oozie.action.ssh.SshActionExecutor$1.call(SshActionExecutor.java:206)
at org.apache.oozie.action.ssh.SshActionExecutor$1.call(SshActionExecutor.java:204)
at org.apache.oozie.action.ssh.SshActionExecutor.execute(SshActionExecutor.java:547)
... 10 more
2013-10-09 12:48:25,982 警告 org.apache.oozie.command.wf.ActionStartXCommand: USER[user] GROUP[-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[0000000-131008185935754-oozie-oozi-W@action1] 暂停工作流作业 id=0000000-131008185935754-oozie-oozi-W 2013-10-09 12:48:27,204 WARN org.apache.oozie.command.coord.CoordAction : USER[user] GROUP[-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[0000000-131008185935754-oozie-oozi-W@action1] E1100: 命令前置条件不成立执行,[,协调动作为空],错误代码:E1100 2013-10-09 12:59:57,477 INFO org.apache.oozie.command.wf.KillXCommand: USER[user] GROUP[-] TOKEN[] APP[测试] 作业 [0000000-131008185935754-oozie-oozi-W] 操作 [-] 已启动 WorkflowKillXCommand for jobId=0000000-131008185935754-oozie-oozi-W 2013-10-09 12:59:57,685 WARN org.apache.oozie.command.coord.CoordActionUpdateXCommand: USER[user] GROUP[-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[-] E1100: 命令前置条件在执行前不保持,[,坐标操作为空],错误代码:E1100 2013-10-09 12:59:57,686 INFO org.apache.oozie.command.wf.KillXCommand: USER[user] GROUP[-] TOKEN[ ] APP [测试] 作业 [0000000-131008185935754-oozie-oozi-W] 操作 [-] ENDED WorkflowKillXCommand for jobId=0000000-131008185935754-oozie-oozi-W 2013-10-09 13:41:32,654 警告 org.apache。 oozie.command.wf.KillXCommand: USER[user] GROUP[-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[-] E0725: 工作流实例不能被杀死,0000000- 131008185935754-oozie-oozi-W,错误代码:E0725 2013-10-09 13:41:45,199 警告 org.apache.oozie.command.wf.KillXCommand:USER[user] GROUP[-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[-] E0725: 工作流实例不能被杀死, 0000000-131008185935754-oozie-oozi-W,错误代码:E0725 2013-10-09 13:42:04,869 WARN org.apache.oozie.command.wf.ResumeXCommand: USER[user] GROUP[-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie- oozi-W] ACTION[-] E1100:命令前置条件在执行前不成立,[工作流的状态为 KILLED 未暂停],错误代码:E1100 2013-10-09 13:45:56,357 WARN org.apache.oozie.command .wf.KillXCommand: USER[user] GROUP[-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[-] E0725: 工作流实例不能被杀死,0000000-131008185935754-oozie -oozi-W,错误代码:E07250000000-131008185935754-oozie-oozi-W,错误代码:E0725 2013-10-09 13:42:04,869 WARN org.apache.oozie.command.wf.ResumeXCommand: USER[user] GROUP[-] TOKEN[] APP[测试] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[-] E1100:命令前置条件在执行前不成立,[工作流的状态是KILLED is not SUSPENDED],错误代码:E1100 2013-10-09 13:45 :56,357 WARN org.apache.oozie.command.wf.KillXCommand: USER[user] GROUP[-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[-] E0725: 工作流实例无法杀死,0000000-131008185935754-oozie-oozi-W,错误代码:E07250000000-131008185935754-oozie-oozi-W,错误代码:E0725 2013-10-09 13:42:04,869 WARN org.apache.oozie.command.wf.ResumeXCommand: USER[user] GROUP[-] TOKEN[] APP[测试] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[-] E1100:命令前置条件在执行前不成立,[工作流的状态是KILLED is not SUSPENDED],错误代码:E1100 2013-10-09 13:45 :56,357 WARN org.apache.oozie.command.wf.KillXCommand: USER[user] GROUP[-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[-] E0725: 工作流实例无法杀死,0000000-131008185935754-oozie-oozi-W,错误代码:E0725命令前置条件在执行前不成立,[workflow's status is KILLED is not SUSPENDED],错误代码:E1100 2013-10-09 13:45:56,357 WARN org.apache.oozie.command.wf.KillXCommand: USER[user] GROUP [-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[-] E0725:工作流实例无法被杀死,0000000-131008185935754-oozie-oozi-W,错误代码:E0725命令前置条件在执行前不成立,[workflow's status is KILLED is not SUSPENDED],错误代码:E1100 2013-10-09 13:45:56,357 WARN org.apache.oozie.command.wf.KillXCommand: USER[user] GROUP [-] TOKEN[] APP[Test] JOB[0000000-131008185935754-oozie-oozi-W] ACTION[-] E0725:工作流实例无法被杀死,0000000-131008185935754-oozie-oozi-W,错误代码:E0725
走近试过:
- 无密码 SSH 设置
- 用户代理集
- 授予所需文件夹的权限
谢谢;
卡萨。