0

我在 year_queue 中有一个大约 15 年的列表,我需要每年生成一个进程。但是根据我运行代码的服务器,处理器的数量会有所不同。如何根据服务器中的处理器数量动态改变变量 num_processes?

如果我设置 num_processes > 处理器数量,它会相应地自动生成吗?当我对此进行测试时 - 它会创建 15 个进程并在它们之间分配 CPU 能力。我正在寻找一种方法来首先创建“n”个进程,其中 n = 服务器中的处理器数,然后随着每个进程完成,下一个进程被生成。

for i in range(num_processes):
    worker = ForEachPerson(year_queue, result_queue, i, dict_of_files)
    print "worker spawned for " + str(i)
    worker.start()

results = []
while len(results) < len(years):
    result = result_queue.get()
    results.append(result)

有人有同样的问题吗?


while year_queue.empty() != True:
    for i in range(num_processes):
      worker = ForEachPerson(year_queue, result_queue, i, dict_of_files)
      print "worker spawned for " + str(i)
      worker.start()

    # collect results off the queue
    print "results being collected"
    results = []
    while len(results) < len(num_processes):
      result = result_queue.get()
      results.append(result)
4

2 回答 2

4

使用多处理池。该类完成了选择正确数量的进程并为您运行它们的所有繁琐工作。它也不会为每个任务生成一个新进程,而是在完成后重用进程。

def process_year(year):
    ...
    return result

pool = multiprocessing.Pool()
results = pool.map(process_year, year_queue)
于 2011-10-08T11:16:01.720 回答
0
from multiprocessing import Process, Queue, cpu_count
from Queue import Empty

class ForEachPerson(Process):
    def __init__(self, year_queue, result_queue, i, dict_of_files):
        self.year_queue=year_queue
        self.result_queue=result_queue
        self.i=i
        self.dict_of_files=dict_of_files
        super(ForEachPerson, self).__init__()

    def run(self):
        while True:
            try:
                year=self.year_queue.get()

                ''' Do something '''

                self.result_queue.put(year)
            except Empty:
                self.result_queue.close()
                return

if __name__ == '__main__':
    year_queue=Queue()
    result_queue=Queue()
    dict_of_files={}

    start_year=1996
    num_years=15

    for year in range(start_year, start_year + num_years):
        year_queue.put(year)

    workers=[]
    for i in range(cpu_count()):
        worker = ForEachPerson(year_queue, result_queue, i, dict_of_files)
        print 'worker spawned for', str(i)
        worker.start()
        workers.append(worker)

    results=[]
    while len(results) < num_years:
        try:
            year=result_queue.get()
            results.append(year)
            print 'Result:', year
        except Empty:
            pass

    for worker in workers:
        worker.terminate()
于 2011-10-09T17:53:01.690 回答