2

如何从迪斯科蟒蛇获得工作结果?

我尝试过迪斯科工作:

jmunsch@disco-master-5147:~$ disco jobs
KeyCount@5ca:2d323:53093
KeyCount@5ca:2bcb5:4f479

迪斯科结果:

jmunsch@disco-master-5147:~$ disco results "KeyCount@5ca:2bcb5:4f479"
dir://disco-node-9144/disco/disco-node-9144/1a/KeyCount@5ca:2bcb5:4f479/.disco/reduce-1001-1482183896238504.results

这是输入:

jmunsch@disco-master-5147:~$ disco jobdict "KeyCount@5ca:2bcb5:4f479"
inputs  [ ... a bunch of inputs ... ]
pipeline    [[u'iter_pgs_item', u'split', False], [u'reduce', u'group_label', False]]
save_info   ddfs
worker  virtualenvworker
save_results    False
prefix  KeyCount
scheduler   {}
owner   jenkins@jenkins-4139

有关的:

4

1 回答 1

1

我打开了一个拉取请求,但基本上这是一种方法,我想将 reduce 结果作为 len(2) 元组流式传输,作为以下内容的一部分bin/discocli.py

@Disco.job_command
def results_get(program, jobname):
    """Usage: jobname

    Print out the data of a completed job.
    `disco jobs | xargs -IJOB_ID disco results_get JOB_ID`
    """
    status, results = program.disco.results(jobname)
    if sys.version_info[0] == 2:
        binary_type = str
    elif sys.version_info[0] == 3:
        binary_type = bytes
    if status == 'ready':
        for line in program.disco.result_iterator(results):
            if isinstance(line, binary_type):
                line = line.decode('utf-8')
            print(line)

看:

于 2016-12-20T23:30:34.587 回答