“oozie-coordinator”的相关标签问题

0 投票

1 回答

766 浏览

shell - 在 oozie 中调度/运行 mahout 命令

我正在尝试使用 oozie scheduler 运行 mahout 命令 - sequence2sparse，但它给出了一些错误。我尝试使用 oozie - shell 标签运行 mahout 命令，但没有任何效果。

以下是 oozie 工作流程 -

我还尝试创建一个 shell 脚本并在 oozie 中运行它

与 job.properties 作为

和 generateBrandSparseFile.sh 是

但没有一个选项有效。后一个的错误是 -

SLF4J：有关说明，请参见http://www.slf4j.org/codes.html#multiple_bindings。SLF4J：实际绑定的类型为 [org.slf4j.impl.Log4jLoggerFactory] sudo：不存在 tty，也没有指定 askpass 程序 15/06/05 12:23:59 WARN driver.MahoutDriver：在类路径上找不到 seq2sparse.props，将仅使用命令行参数 15/06/05 12:24:01 INFO vectorizer.SparseVectorsFromSequenceFiles：最大 n-gram 大小为：1

对于sudo: no tty present这个错误，我已经注释掉 /etc/sudoers - Defaults !requiretty

Mahout 安装在安装 oozie 服务器的节点上。

以下 oozie 工作流程也无效-

错误-Error: E0701 : E0701: XML schema error, cvc-complex-type.2.4.a: Invalid content was found starting with element 'ssh'. One of '{"uri:oozie:workflow:0.4":map-reduce, "uri:oozie:workflow:0.4":pig, "uri:oozie:workflow:0.4":sub-workflow, "uri:oozie:workflow:0.4":fs, "uri:oozie:workflow:0.4":java, WC[##other:"uri:oozie:workflow:0.4"]}' is expected.

在所有节点上安装 mahout 会有帮助吗？-（oozie 可以在任何节点上运行脚本）。有没有办法让 mahout 在 hadoop 集群上可用？

也欢迎任何其他解决方案。

提前致谢。

编辑：我稍微改变了方法，现在我直接调用 seq2sparse 类。工作流程是 -

作业仍然没有运行，错误是

2015-06-05T16:43:42.937

0 投票

2 回答

3469 浏览

shell - How to invoke an oozie workflow via shell script and block/wait till workflow completion

I have created a workflow using Oozie that is comprised of multiple action nodes and have been successfully able to run those via coordinator.

I want to invoke the Oozie workflow via a wrapper shell script.

The wrapper script should invoke the Oozie command, wait till the oozie job completes (success or error) and return back the Oozie success status code (0) or the error code of the failed oozie action node (if any node of the oozie workflow has failed).

From what I have seen so far, I know that as soon as I invoke the oozie command to run a workflow, the command exits with the job id getting printed on linux console, while the oozie job keeps running asynchronously in the backend.

I want my wrapper script to block till the oozie coordinator job completes and return back the success/error code.

Can you please let me know how/if I can achieve this using any of the oozie features?

I am using Oozie version 3.3.2 and bash shell in Linux.

Note: In case anyone is curious about why I need such a feature - the requirement is that my wrapper shell script should know how long an oozie job has been runnig, when an oozie job has completed, and accordingly return back the exit code so that the parent process that is calling the wrapper script knows whether the job completed successfully or not, and if errored out, raise an alert/ticket for the support team.

shell blocking oozie error-code oozie-coordinator

2015-06-20T10:58:16.277

0 投票

1 回答

1963 浏览