python - 了解多处理：Python 中的共享内存管理、锁和队列

Question

多处理是python中一个强大的工具，我想更深入地了解它。我想知道何时使用常规锁和队列以及何时使用多处理管理器在所有进程之间共享这些。

我提出了以下测试场景，其中有四种不同的多处理条件：

使用池和NO管理器
使用池和管理器
使用单独的流程和NO Manager
使用单独的流程和管理器

工作

所有条件都执行一个作业功能the_job。the_job由一些由锁固定的印刷组成。此外，函数的输入只是简单地放入一个队列中（看是否可以从队列中恢复）。此输入只是在名为（显示在底部）的主脚本中创建idx的索引。range(10)start_scenario

def the_job(args):
    """The job for multiprocessing.

    Prints some stuff secured by a lock and 
    finally puts the input into a queue.

    """
    idx = args[0]
    lock = args[1]
    queue=args[2]

    lock.acquire()
    print 'I'
    print 'was '
    print 'here '
    print '!!!!'
    print '1111'
    print 'einhundertelfzigelf\n'
    who= ' By run %d \n' % idx
    print who
    lock.release()

    queue.put(idx)

条件的成功被定义为从队列中完全调用输入，请参见read_queue底部的函数。

条件

条件 1 和 2 是不言自明的。条件 1 涉及创建锁和队列，并将它们传递给进程池：

def scenario_1_pool_no_manager(jobfunc, args, ncores):
    """Runs a pool of processes WITHOUT a Manager for the lock and queue.

    FAILS!

    """
    mypool = mp.Pool(ncores)
    lock = mp.Lock()
    queue = mp.Queue()

    iterator = make_iterator(args, lock, queue)

    mypool.imap(jobfunc, iterator)

    mypool.close()
    mypool.join()

    return read_queue(queue)

（帮助函数make_iterator在这篇文章的底部给出。）条件 1 失败，RuntimeError: Lock objects should only be shared between processes through inheritance.

条件 2 非常相似，但现在锁和队列在管理器的监督下：

def scenario_2_pool_manager(jobfunc, args, ncores):
    """Runs a pool of processes WITH a Manager for the lock and queue.

    SUCCESSFUL!

    """
    mypool = mp.Pool(ncores)
    lock = mp.Manager().Lock()
    queue = mp.Manager().Queue()

    iterator = make_iterator(args, lock, queue)
    mypool.imap(jobfunc, iterator)
    mypool.close()
    mypool.join()

    return read_queue(queue)

在条件 3 中，手动启动新进程，并且在没有管理器的情况下创建锁和队列：

def scenario_3_single_processes_no_manager(jobfunc, args, ncores):
    """Runs an individual process for every task WITHOUT a Manager,

    SUCCESSFUL!

    """
    lock = mp.Lock()
    queue = mp.Queue()

    iterator = make_iterator(args, lock, queue)

    do_job_single_processes(jobfunc, iterator, ncores)

    return read_queue(queue)

条件 4 类似，但现在再次使用管理器：

def scenario_4_single_processes_manager(jobfunc, args, ncores):
    """Runs an individual process for every task WITH a Manager,

    SUCCESSFUL!

    """
    lock = mp.Manager().Lock()
    queue = mp.Manager().Queue()

    iterator = make_iterator(args, lock, queue)

    do_job_single_processes(jobfunc, iterator, ncores)

    return read_queue(queue)

在这两种情况下 - 3 和 4 - 我为 10 个任务中的每一个启动一个新进程，the_job其中最多ncores进程同时运行。这是通过以下辅助函数实现的：

def do_job_single_processes(jobfunc, iterator, ncores):
    """Runs a job function by starting individual processes for every task.

    At most `ncores` processes operate at the same time

    :param jobfunc: Job to do

    :param iterator:

        Iterator over different parameter settings,
        contains a lock and a queue

    :param ncores:

        Number of processes operating at the same time

    """
    keep_running=True
    process_dict = {} # Dict containing all subprocees

    while len(process_dict)>0 or keep_running:

        terminated_procs_pids = []
        # First check if some processes did finish their job
        for pid, proc in process_dict.iteritems():

            # Remember the terminated processes
            if not proc.is_alive():
                terminated_procs_pids.append(pid)

        # And delete these from the process dict
        for terminated_proc in terminated_procs_pids:
            process_dict.pop(terminated_proc)

        # If we have less active processes than ncores and there is still
        # a job to do, add another process
        if len(process_dict) < ncores and keep_running:
            try:
                task = iterator.next()
                proc = mp.Process(target=jobfunc,
                                                   args=(task,))
                proc.start()
                process_dict[proc.pid]=proc
            except StopIteration:
                # All tasks have been started
                keep_running=False

        time.sleep(0.1)

结果

只有条件 1 失败 ( RuntimeError: Lock objects should only be shared between processes through inheritance) 而其他 3 个条件成功。我试着绕开这个结果。

为什么池需要在所有进程之间共享锁和队列，但条件 3 中的各个进程不需要？

我所知道的是，对于池条件（1 和 2），来自迭代器的所有数据都是通过酸洗传递的，而在单进程条件（3 和 4）中，来自迭代器的所有数据都是通过从主进程继承来传递的（我是使用Linux）。我想在从子进程中更改内存之前，会访问父进程使用的相同内存（写时复制）。但是一旦有人说lock.acquire()，这应该改变并且子进程确实使用放置在内存中其他地方的不同锁，不是吗？一个子进程如何知道一个兄弟激活了一个不是通过管理器共享的锁？

最后，有点相关的是我的问题 3 和 4 有多少不同。两者都有单独的流程，但它们在管理器的使用上有所不同。两者都被认为是有效代码吗？或者如果实际上不需要经理，是否应该避免使用经理？

完整脚本

对于那些只想复制和粘贴所有内容来执行代码的人，这里是完整的脚本：

__author__ = 'Me and myself'

import multiprocessing as mp
import time

def the_job(args):
    """The job for multiprocessing.

    Prints some stuff secured by a lock and 
    finally puts the input into a queue.

    """
    idx = args[0]
    lock = args[1]
    queue=args[2]

    lock.acquire()
    print 'I'
    print 'was '
    print 'here '
    print '!!!!'
    print '1111'
    print 'einhundertelfzigelf\n'
    who= ' By run %d \n' % idx
    print who
    lock.release()

    queue.put(idx)


def read_queue(queue):
    """Turns a qeue into a normal python list."""
    results = []
    while not queue.empty():
        result = queue.get()
        results.append(result)
    return results


def make_iterator(args, lock, queue):
    """Makes an iterator over args and passes the lock an queue to each element."""
    return ((arg, lock, queue) for arg in args)


def start_scenario(scenario_number = 1):
    """Starts one of four multiprocessing scenarios.

    :param scenario_number: Index of scenario, 1 to 4

    """
    args = range(10)
    ncores = 3
    if scenario_number==1:
        result =  scenario_1_pool_no_manager(the_job, args, ncores)

    elif scenario_number==2:
        result =  scenario_2_pool_manager(the_job, args, ncores)

    elif scenario_number==3:
        result =  scenario_3_single_processes_no_manager(the_job, args, ncores)

    elif scenario_number==4:
        result =  scenario_4_single_processes_manager(the_job, args, ncores)

    if result != args:
        print 'Scenario %d fails: %s != %s' % (scenario_number, args, result)
    else:
        print 'Scenario %d successful!' % scenario_number


def scenario_1_pool_no_manager(jobfunc, args, ncores):
    """Runs a pool of processes WITHOUT a Manager for the lock and queue.

    FAILS!

    """
    mypool = mp.Pool(ncores)
    lock = mp.Lock()
    queue = mp.Queue()

    iterator = make_iterator(args, lock, queue)

    mypool.map(jobfunc, iterator)

    mypool.close()
    mypool.join()

    return read_queue(queue)


def scenario_2_pool_manager(jobfunc, args, ncores):
    """Runs a pool of processes WITH a Manager for the lock and queue.

    SUCCESSFUL!

    """
    mypool = mp.Pool(ncores)
    lock = mp.Manager().Lock()
    queue = mp.Manager().Queue()

    iterator = make_iterator(args, lock, queue)
    mypool.map(jobfunc, iterator)
    mypool.close()
    mypool.join()

    return read_queue(queue)


def scenario_3_single_processes_no_manager(jobfunc, args, ncores):
    """Runs an individual process for every task WITHOUT a Manager,

    SUCCESSFUL!

    """
    lock = mp.Lock()
    queue = mp.Queue()

    iterator = make_iterator(args, lock, queue)

    do_job_single_processes(jobfunc, iterator, ncores)

    return read_queue(queue)


def scenario_4_single_processes_manager(jobfunc, args, ncores):
    """Runs an individual process for every task WITH a Manager,

    SUCCESSFUL!

    """
    lock = mp.Manager().Lock()
    queue = mp.Manager().Queue()

    iterator = make_iterator(args, lock, queue)

    do_job_single_processes(jobfunc, iterator, ncores)

    return read_queue(queue)


def do_job_single_processes(jobfunc, iterator, ncores):
    """Runs a job function by starting individual processes for every task.

    At most `ncores` processes operate at the same time

    :param jobfunc: Job to do

    :param iterator:

        Iterator over different parameter settings,
        contains a lock and a queue

    :param ncores:

        Number of processes operating at the same time

    """
    keep_running=True
    process_dict = {} # Dict containing all subprocees

    while len(process_dict)>0 or keep_running:

        terminated_procs_pids = []
        # First check if some processes did finish their job
        for pid, proc in process_dict.iteritems():

            # Remember the terminated processes
            if not proc.is_alive():
                terminated_procs_pids.append(pid)

        # And delete these from the process dict
        for terminated_proc in terminated_procs_pids:
            process_dict.pop(terminated_proc)

        # If we have less active processes than ncores and there is still
        # a job to do, add another process
        if len(process_dict) < ncores and keep_running:
            try:
                task = iterator.next()
                proc = mp.Process(target=jobfunc,
                                                   args=(task,))
                proc.start()
                process_dict[proc.pid]=proc
            except StopIteration:
                # All tasks have been started
                keep_running=False

        time.sleep(0.1)


def main():
    """Runs 1 out of 4 different multiprocessing scenarios"""
    start_scenario(1)


if __name__ == '__main__':
    main()

score 35 · Accepted Answer

multiprocessing.Lock使用 OS 提供的 Semaphore 对象实现。在 Linux 上，子进程只是通过os.fork. 这不是信号量的副本；它实际上继承了父级具有的相同句柄，可以继承文件描述符的相同方式。另一方面，Windows 不os.fork支持Lock. 它通过使用 Windows API创建multiprocessing.Lock对象内部使用的 Windows Semaphore 的重复句柄来做到这一点，其中指出：DuplicateHandle

复制句柄指的是与原始句柄相同的对象。因此，对对象的任何更改都会通过两个句柄反映出来

该DuplicateHandleAPI 允许您将重复句柄的所有权授予子进程，以便子进程在 unpickling 后可以实际使用它。通过创建孩子拥有的重复句柄，您可以有效地“共享”锁定对象。

这是信号量对象multiprocessing/synchronize.py

class SemLock(object):

    def __init__(self, kind, value, maxvalue):
        sl = self._semlock = _multiprocessing.SemLock(kind, value, maxvalue)
        debug('created semlock with handle %s' % sl.handle)
        self._make_methods()

        if sys.platform != 'win32':
            def _after_fork(obj):
                obj._semlock._after_fork()
            register_after_fork(self, _after_fork)

    def _make_methods(self):
        self.acquire = self._semlock.acquire
        self.release = self._semlock.release
        self.__enter__ = self._semlock.__enter__
        self.__exit__ = self._semlock.__exit__

    def __getstate__(self):  # This is called when you try to pickle the `Lock`.
        assert_spawning(self)
        sl = self._semlock
        return (Popen.duplicate_for_child(sl.handle), sl.kind, sl.maxvalue)

    def __setstate__(self, state): # This is called when unpickling a `Lock`
        self._semlock = _multiprocessing.SemLock._rebuild(*state)
        debug('recreated blocker with handle %r' % state[0])
        self._make_methods()

注意assert_spawning调用 in __getstate__，它在腌制对象时被调用。这是如何实现的：

#
# Check that the current thread is spawning a child process
#

def assert_spawning(self):
    if not Popen.thread_is_spawning():
        raise RuntimeError(
            '%s objects should only be shared between processes'
            ' through inheritance' % type(self).__name__
            )

该函数是确保您Lock通过调用“继承” 的函数thread_is_spawning。在 Linux 上，该方法只返回False：

@staticmethod
def thread_is_spawning():
    return False

这是因为Linux不需要pickle来继承Lock，所以如果__getstate__真的在Linux上被调用，我们一定不是继承。在 Windows 上，还有更多事情要做：

def dump(obj, file, protocol=None):
    ForkingPickler(file, protocol).dump(obj)

class Popen(object):
    '''
    Start a subprocess to run the code of a process object
    '''
    _tls = thread._local()

    def __init__(self, process_obj):
        ...
        # send information to child
        prep_data = get_preparation_data(process_obj._name)
        to_child = os.fdopen(wfd, 'wb')
        Popen._tls.process_handle = int(hp)
        try:
            dump(prep_data, to_child, HIGHEST_PROTOCOL)
            dump(process_obj, to_child, HIGHEST_PROTOCOL)
        finally:
            del Popen._tls.process_handle
            to_child.close()


    @staticmethod
    def thread_is_spawning():
        return getattr(Popen._tls, 'process_handle', None) is not None

在这里，如果对象具有属性，则thread_is_spawning返回。我们可以看到属性是在中创建的，然后我们想要继承的数据是从父级传递给子级的，然后属性被删除。所以只会在. 根据这个 python-ideas 邮件列表线程，这实际上是为了模拟与Linux 上相同的行为而添加的人为限制。Windows实际上可以支持随时通过，因为可以随时运行。TruePopen._tlsprocess_handleprocess_handle__init__dumpthread_is_spawningTrue__init__os.forkLockDuplicateHandle

以上所有内容都适用于Queue对象，因为它在Lock内部使用。

我会说继承Lock对象比使用 a 更可取Manager.Lock()，因为当您使用 a 时Manager.Lock，您对的每个调用都Lock必须通过 IPC 发送到进程，这比使用调用内部的Manager共享要慢得多Lock过程。不过，这两种方法都是完全有效的。

最后，可以使用/关键字参数将 a 传递给 aLock的所有成员Pool而不使用 a ：Managerinitializerinitargs

lock = None
def initialize_lock(l):
   global lock
   lock = l

def scenario_1_pool_no_manager(jobfunc, args, ncores):
    """Runs a pool of processes WITHOUT a Manager for the lock and queue.

    """
    lock = mp.Lock()
    mypool = mp.Pool(ncores, initializer=initialize_lock, initargs=(lock,))
    queue = mp.Queue()

    iterator = make_iterator(args, queue)

    mypool.imap(jobfunc, iterator) # Don't pass lock. It has to be used as a global in the child. (This means `jobfunc` would need to be re-written slightly.

    mypool.close()
    mypool.join()

return read_queue(queue)

这是有效的，因为传递的参数initargs被传递给在内部运行的对象的__init__方法，所以它们最终被继承，而不是被腌制。ProcessPool

python - 了解多处理：Python 中的共享内存管理、锁和队列

工作

条件

结果

完整脚本

1 回答 1

Related

Reference