1

我正在尝试通过连接两台笔记本电脑并运行 mpi 程序来创建一个 mpi 集群。我按照这里提到的步骤(https://medium.com/mpi-cluster-setup/mpi-clusters-within-a-lan-77168e0191b1)。我可以在没有密码的情况下 ssh 到其他节点。但是,当我尝试运行时,出现mpiexec -n 2 -hosts manager,worker ./main以下错误。

[proxy:0:1@gunavaran-HP-Pavilion-Notebook] HYDU_sock_connect (utils/sock/sock.c:113): unable to get host address for gunavaran-HP-ENVY-15-Notebook-PC
[proxy:0:1@gunavaran-HP-Pavilion-Notebook] main (pm/pmiserv/pmip.c:181): unable to connect to server gunavaran-HP-ENVY-15-Notebook-PC at port 43211 (check for firewalls!)
Host key verification failed.

这是我的主机文件

127.0.0.1       localhost
#127.0.1.1      gunavaran-HP-ENVY-15-Notebook-PC

#MPI SETUP
192.168.8.102   manager
192.168.8.108   worker


# The following lines are desirable for IPv6 capable hosts
::1     ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters
4

1 回答 1

2

我将主机名更改为 manager 和 worker 使用sudo hostnamectl set-hostname. 它现在工作正常。

于 2020-07-29T02:58:01.800 回答