循环的并行dask比常规循环慢?

2024-10-04 11:34:31 发布

您现在位置:Python中文网/ 问答频道 /正文

如果我试图用dask并行for循环,它的执行速度会比常规版本慢。基本上,我只是遵循dask教程中的介绍性示例,但由于某些原因,它在我这方面失败了。我做错什么了?在

In [1]: import numpy as np
   ...: from dask import delayed, compute
   ...: import dask.multiprocessing

In [2]: a10e4 = np.random.rand(10000, 11).astype(np.float16)
   ...: b10e4 = np.random.rand(10000, 11).astype(np.float16)

In [3]: def subtract(a, b):
   ...:     return a - b

In [4]: %%timeit
   ...: results = [subtract(a10e4, b10e4[index]) for index in range(len(b10e4))]
1 loop, best of 3: 10.6 s per loop

In [5]: %%timeit
   ...: values = [delayed(subtract)(a10e4, b10e4[index]) for index in range(len(b10e4)) ]
   ...: resultsDask = compute(*values, get=dask.multiprocessing.get)
1 loop, best of 3: 14.4 s per loop

Tags: inimportloopforindexnprandommultiprocessing