在python中为多个参数并行运行单个函数的最快方法

def processing(image_location): image = rasterio.open(image_location) ... ... return(result) #calling function serially one after the other with different parameters and saving the results to a variable. results1 = processing(r'/home/test/image_1.tif') results2 = processing(r'/home/test/image_2.tif') results3 = processing(r'/home/test/image_3.tif')

3条回答

网友

1楼 · 编辑于 2024-10-03 11:20:34

你可能想看看IPython Parallel。它允许您在负载平衡（本地）集群上轻松运行函数

对于这个小示例，请确保已安装IPython Parallel、NumPy和Pillow。要运行示例，首先需要启动集群。要启动具有四个并行引擎的本地集群，请在终端中键入（一个引擎对应一个处理器核心似乎是一个合理的选择）：

ipcluster 4

然后，您可以运行以下脚本，该脚本在给定目录中搜索jpg图像，并计算每个图像中的像素数：

import ipyparallel as ipp


rc = ipp.Client()
with rc[:].sync_imports():  # import on all engines
    import numpy
    from pathlib import Path
    from PIL import Image


lview = rc.load_balanced_view()  # default load-balanced view
lview.block = True  # block until map() is finished


@lview.parallel()
def count_pixels(fn: Path):
    """Silly function to count the number of pixels in an image file"""
    im = Image.open(fn)
    xx = numpy.asarray(im)
    num_pixels = xx.shape[0] * xx.shape[1]
    return fn.stem, num_pixels


pic_dir = Path('Pictures')
fn_lst = pic_dir.glob('*.jpg')  # list all jpg-files in pic_dir

results = count_pixels.map(fn_lst)  # execute in parallel

for n_, cnt in results:
    print(f"'{n_}' has {cnt} pixels.")

网友

2楼 · 编辑于 2024-10-03 11:20:34

您可以使用multiprocessing并行执行函数，并将结果保存到results变量：

from multiprocessing.pool import ThreadPool

pool = ThreadPool()
images = [r'/home/test/image_1.tif', r'/home/test/image_2.tif', r'/home/test/image_3.tif']
results = pool.map(delineation, images)

网友

3楼 · 编辑于 2024-10-03 11:20:34

使用multiprocessing库的另一种写入方式（请参阅@Alderven以了解其他函数）

import multiprocessing as mp

def calculate(input_args):
    result = input_args * 2
    return result

N = mp.cpu_count()
parallel_input = np.arange(0, 100)
print('Amount of CPUs ', N)
print('Amount of iterations ', len(parallel_input))

with mp.Pool(processes=N) as p:
    results = p.map(calculate, list(parallel_input))

results变量将包含一个包含已处理数据的列表。然后你就可以写了

相关问题更多 >

编程相关推荐

热门问题

热门文章