带有mpi4py的Openmpi不能在多个节点上工作

2024-05-05 02:59:49 发布

您现在位置:Python中文网/ 问答频道 /正文

我有一个用mpi4py编写的并行python程序。我正在努力把它分发出去。我设置虚拟机,安装openmpi,openssh服务器,交换密钥等等。在本地计算机上我有主机文件:

127.0.0.1 slots=4
192.168.1.104 slots=2

我试着用:

^{pr2}$

但我得到了以下错误:

[Kreutz:13090] tcp_peer_recv_connect_ack: invalid header type: 0

ORTE was unable to reliably start one or more daemons. This usually is caused by:

  • not finding the required libraries and/or binaries on one or more nodes. Please check your PATH and LD_LIBRARY_PATH settings, or configure OMPI with --enable-orterun-prefix-by-default

  • lack of authority to execute on one or more specified nodes. Please verify your allocation and authorities.

  • the inability to write startup files into /tmp (--tmpdir/orte_tmpdir_base). Please check with your sys admin to determine the correct location to use.

  • compilation of the orted with dynamic libraries when static are required (e.g., on Cray). Please check your configure cmd line and consider using one of the contrib/platform definitions for your system type.

  • an inability to create a connection back to mpirun due to a lack of common network interfaces and/or no route found between them. Please check network connectivity (including firewalls and network routing requirements).

我不知道该怎么办。你有什么想法我可以试试吗?在


Tags: orandofthetoyouronmore