3

我在尝试运行 MPI 时遇到问题。我安装了 mpich2-1.4.1p1,当我使用 mpirun 运行 mpi 程序时,我得到:

[starsky:27711] [[INVALID],INVALID] ORTE_ERROR_LOG: A system-required executable either could not be found or was not executable by this user in file ../../../../../../orte/mca/ess/singleton/ess_singleton_module.c at line 357
[starsky:27711] [[INVALID],INVALID] ORTE_ERROR_LOG: A system-required executable either could not be found or was not executable by this user in file ../../../../../../orte/mca/ess/singleton/ess_singleton_module.c at line 230
[starsky:27711] [[INVALID],INVALID] ORTE_ERROR_LOG: A system-required executable either could not be found or was not executable by this user in file ../../../orte/runtime/orte_init.c at line 132
--------------------------------------------------------------------------
It looks like orte_init failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during orte_init; some of which are due to configuration or
environment problems.  This failure appears to be an internal failure;
here's some additional information (which may only be relevant to an
Open MPI developer):

  orte_ess_set_name failed
  --> Returned value A system-required executable either could not be found or was not executable by this user (-127) instead of ORTE_SUCCESS
--------------------------------------------------------------------------
[starsky:27719] [[INVALID],INVALID] ORTE_ERROR_LOG: A system-required executable either could not be found or was not executable by this user in file ../../../../../../orte/mca/ess/singleton/ess_singleton_module.c at line 357
--------------------------------------------------------------------------
It looks like MPI_INIT failed for some reason; your parallel process is
likely to abort.  There are many reasons that a parallel process can
fail during MPI_INIT; some of which are due to configuration or environment
problems.  This failure appears to be an internal failure; here's some
additional information (which may only be relevant to an Open MPI
developer):

  ompi_mpi_init: orte_init failed
  --> Returned "A system-required executable either could not be found or was not executable by this user" (-127) instead of "Success" (0)

以及其他非常相似的错误消息。

这与 mpiexec 中的问题非常相似,因为 MPI init aborts 失败,因为我似乎同时拥有 Open MPI 和 mpich2 。但是,我使用 cmake 编译,当我执行建议的 mpicc-vt.mpich2(mpicc-vt 因为我使用 vampir trace)进行编译,然后运行 ​​mpirun.mpich2 时,出现以下错误:

To run 'mpirun.mpich2' please ask your administrator to install the package 'mpich2'

我已经安装了 mpich2。有没有我想安装 mpich2 的配置?当我在配置中查看本手册时,似乎没有一个符合我的需要。

谢谢

4

1 回答 1

2

很难判断这里发生了什么,因为您的系统上安装了两个处于不同完成状态的 MPI 库。我认为此时最好的办法是卸载 Open MPI 和 MPICH,然后转到您正在使用的任何包管理器并重新安装。这是让一切正常工作的最简单方法。

如果您想要最新版本(这通常是一件好事),您必须直接访问源代码,因此请查看与您下载的 tarball 捆绑在一起的 README。通常,对于这两个软件包,您都使用安装 Unix 软件的标准方法。

./configure --prefix=<path for installation>
make
make install

MPICH 和 Open MPI 都有很多自定义选项,如果您执行./configure --help并阅读自述文件,您可以阅读这些选项,但除非您有特殊系统,否则通常不需要这些选项。

于 2013-08-16T13:57:12.633 回答