在Dask中，是否有一种方法可以在依赖项可用时进行处理，如在multiprocessing.imap_unordered中？

from multiprocessing import Pool from dask import delayed import numpy as np from time import sleep def wait(i): """Something embarrassingly parallel""" np.random.seed() t = np.random.uniform() sleep(t) print(i, t) return i, t def lineup(who_when): """Aggregate""" order = [] for who, when in who_when: print(f'who: {who}') order.append(who) return order

n = 5 pool = Pool(processes=n) lineup(pool.imap_unordered(wait, range(n))) # Produces something like the following 2 0.2837069069881948 4 0.44156753704276597 who: 2 who: 4 1 0.5563172244950703 0 0.6696008076879393 who: 1 who: 0 3 0.9911326214345308 who: 3 [2, 4, 1, 0, 3]

n = 5 order = delayed(lineup)([delayed(wait)(i) for i in range(n)]) order.compute() # produces something like: 0 0.2792789023871932 2 0.44570072028850705 4 0.6969597596416385 1 0.766705306208266 3 0.9889956337687371 who: 0 who: 1 who: 2 who: 3 who: 4 [0, 1, 2, 3, 4]

1条回答

网友

1楼 · 发布于 2024-10-17 00:19:42

对。您可能正在寻找Dask Futures interface的as_completed函数

这里有一个关于Handling Evolving Workflows的Dask示例

为方便起见，我将复制此处填写的as_的文档字符串

竣工

按期货完成的顺序返回期货

这将返回一个迭代器，该迭代器按照输入对象完成的顺序生成输入对象。无论顺序如何，在迭代器上调用next都将阻塞，直到下一个future完成

此外，您还可以在使用.add方法计算期间向该对象添加更多未来

参数

期货：期货集合按完成顺序迭代的未来对象的列表

结果为：bool（False）是否等待并包括期货结果；在本例中，当_完成时，将产生（未来、结果）的元组

raise_错误：布尔（真）当未来的结果引发例外时，我们是否应该提出；仅在结果为True时影响行为

例子

>>> x, y, z = client.map(inc, [1, 2, 3])  # doctest: +SKIP
>>> for future in as_completed([x, y, z]):  # doctest: +SKIP
...     print(future.result())  # doctest: +SKIP
3
2
4

在计算过程中添加更多期货

>>> x, y, z = client.map(inc, [1, 2, 3])  # doctest: +SKIP
>>> ac = as_completed([x, y, z])  # doctest: +SKIP
>>> for future in ac:  # doctest: +SKIP
...     print(future.result())  # doctest: +SKIP
...     if random.random() < 0.5:  # doctest: +SKIP
...         ac.add(c.submit(double, future))  # doctest: +SKIP
4
2
8
3
6
12
24

也可以选择等待，直到收集到结果

>>> ac = as_completed([x, y, z], with_results=True)  # doctest: +SKIP
>>> for future, result in ac:  # doctest: +SKIP
...     print(result)  # doctest: +SKIP
2
4
3

竣工

参数

例子

相关问题更多 >

编程相关推荐

热门问题

热门文章