使用多重处理在powerset上迭代

def powerset(seq): '''Returns all the subsets of the list. This is a generator.''' if len(seq) == 0: yield seq if len(seq) == 1: yield seq yield [] elif len(seq) > 1: for item in powerset(seq[1:]): yield [seq[0]]+item yield item def job(l): # do some calculation with the list l return do_some_hard_work(l) def calculate(): pool_size = multiprocessing.cpu_count() * 2 pool = multiprocessing.Pool(processes=pool_size, maxtasksperchild=2) pool_outputs = pool.map(job, powerset(list(range(1,10))) pool.close() pool.join() return sum(pool_outputs)

1条回答

网友

1楼 · 发布于 2024-09-27 22:26:29

如果问题是您希望避免将整个powerset放在一个列表中，那么可以使用^{}，它将一次使用迭代器chunksize元素，并将这些元素发送给工作进程，而不是将整个内容转换为一个列表并将其分块。你知道吗

pool_size = multiprocessing.cpu_count() * 2
pool = multiprocessing.Pool(processes=pool_size, maxtasksperchild=2)
pool_outputs = pool.imap(job, powerset(list(range(1,10))), chunksize=<some chunksize>)
pool.close()
pool.join()

如果powerset非常大，则需要指定一个chunksize而不是默认值，即1:

The chunksize argument is the same as the one used by the map() method. For very long iterables using a large value for chunksize can make the job complete much faster than using the default value of 1.

map函数使用以下算法，让您了解一个好的大小：

chunksize, extra = divmod(len(iterable), len(pool_size) * 4)
if extra:
    chunksize += 1

相关问题更多 >

编程相关推荐

热门问题

热门文章