c++ - Boost.MPI：收到的不是发送的！

Question

我对使用 Boost MPI 比较陌生。我已经安装了库，代码编译了，但是我遇到了一个非常奇怪的错误——从节点接收到的一些整数数据不是主节点发送的。到底是怎么回事？

我正在使用 boost 版本 1.42.0，使用 mpic++ 编译代码（它在一个集群上包装 g++，在另一个集群上包装 icpc）。下面是一个简化的示例，包括输出。

代码：

#include <iostream>
#include <boost/mpi.hpp>

using namespace std;
namespace mpi = boost::mpi;

class Solution
{
public:
  Solution() :
  solution_num(num_solutions++)
  {
    // Master node's constructor
  }

  Solution(int solutionNum) :
  solution_num(solutionNum)
  {
    // Slave nodes' constructor.
  }

  int solutionNum() const
  {
    return solution_num;
  }

private:
  static int num_solutions;
  int solution_num;
};

int Solution::num_solutions = 0;

int main(int argc, char* argv[])
{
  // Initialization of MPI
  mpi::environment env(argc, argv);
  mpi::communicator world;

  if (world.rank() == 0)
  {
    // Create solutions
    int numSolutions = world.size() - 1;  // One solution per slave
    vector<Solution*> solutions(numSolutions);
    for (int sol = 0; sol < numSolutions; ++sol)
    {
      solutions[sol] = new Solution;
    }

    // Send solutions
    for (int sol = 0; sol < numSolutions; ++sol)
    {
      world.isend(sol + 1, 0, false);  // Tells the slave to expect work
      cout << "Sending solution no. " << solutions[sol]->solutionNum() << " to node " << sol + 1 << endl;
      world.isend(sol + 1, 1, solutions[sol]->solutionNum());
    }

    // Retrieve values (solution numbers squared)
    vector<double> values(numSolutions, 0);
    for (int i = 0; i < numSolutions; ++i)
    {
      // Get values for each solution
      double value = 0;
      mpi::status status = world.recv(mpi::any_source, 2, value);
      int source = status.source();

      int sol = source - 1;
      values[sol] = value;
    }
    for (int i = 1; i <= numSolutions; ++i)
    {
      world.isend(i, 0, true);  // Tells the slave to finish
    }

    // Output the solutions numbers and their squares
    for (int i = 0; i < numSolutions; ++i)
    {
      cout << solutions[i]->solutionNum() << ", " << values[i] << endl;
      delete solutions[i];
    }
  }
  else
  {
    // Slave nodes merely square the solution number
    bool finished;
    mpi::status status = world.recv(0, 0, finished);
    while (!finished)
    {
      int solNum;
      world.recv(0, 1, solNum);
      cout << "Node " << world.rank() << " receiving solution no. " << solNum << endl;

      Solution solution(solNum);
      double value = static_cast<double>(solNum * solNum);
      world.send(0, 2, value);

      status = world.recv(0, 0, finished);
    }

    cout << "Node " << world.rank() << " finished." << endl;
  }

  return EXIT_SUCCESS;
}

在 21 个节点（1 个主节点，20 个从节点）上运行它会产生：

Sending solution no. 0 to node 1
Sending solution no. 1 to node 2
Sending solution no. 2 to node 3
Sending solution no. 3 to node 4
Sending solution no. 4 to node 5
Sending solution no. 5 to node 6
Sending solution no. 6 to node 7
Sending solution no. 7 to node 8
Sending solution no. 8 to node 9
Sending solution no. 9 to node 10
Sending solution no. 10 to node 11
Sending solution no. 11 to node 12
Sending solution no. 12 to node 13
Sending solution no. 13 to node 14
Sending solution no. 14 to node 15
Sending solution no. 15 to node 16
Sending solution no. 16 to node 17
Sending solution no. 17 to node 18
Sending solution no. 18 to node 19
Sending solution no. 19 to node 20
Node 1 receiving solution no. 0
Node 2 receiving solution no. 1
Node 12 receiving solution no. 19
Node 3 receiving solution no. 19
Node 15 receiving solution no. 19
Node 13 receiving solution no. 19
Node 4 receiving solution no. 19
Node 9 receiving solution no. 19
Node 10 receiving solution no. 19
Node 14 receiving solution no. 19
Node 6 receiving solution no. 19
Node 5 receiving solution no. 19
Node 11 receiving solution no. 19
Node 8 receiving solution no. 19
Node 16 receiving solution no. 19
Node 19 receiving solution no. 19
Node 20 receiving solution no. 19
Node 1 finished.
Node 2 finished.
Node 7 receiving solution no. 19
0, 0
1, 1
2, 361
3, 361
4, 361
5, 361
6, 361
7, 361
8, 361
9, 361
10, 361
11, 361
12, 361
13, 361
14, 361
15, 361
16, 361
17, 361
18, 361
19, 361
Node 6 finished.
Node 3 finished.
Node 17 receiving solution no. 19
Node 17 finished.
Node 10 finished.
Node 12 finished.
Node 8 finished.
Node 4 finished.
Node 15 finished.
Node 18 receiving solution no. 19
Node 18 finished.
Node 11 finished.
Node 13 finished.
Node 20 finished.
Node 16 finished.
Node 9 finished.
Node 19 finished.
Node 7 finished.
Node 5 finished.
Node 14 finished.

因此，当主节点发送 0 到节点 1、1 到节点 2、2 到节点 3 等时，大多数从节点（出于某种原因）接收到数字 19。因此，与其生成从 0 到 19 的数字的平方，我们得到 0 平方、1 平方和 19 平方 18 次！

提前感谢任何可以解释这一点的人。

艾伦

score 10 · Accepted Answer

好的，我想我有答案了，这需要一些底层 C 风格 MPI 调用的知识。Boost 的“isend”函数本质上是“MPI_Isend”的包装，它不会保护用户无需了解有关“MPI_Isend”如何工作的一些细节。

'MPI_Isend' 的一个参数是指向包含您希望发送的信息的缓冲区的指针。但是，重要的是，在您知道已收到消息之前，不能重用此缓冲区。所以考虑下面的代码：

// Get solution numbers from the solutions and store in a vector
vector<int> solutionNums(numSolutions);
for (int sol = 0; sol < numSolutions; ++sol)
{
  solutionNums[sol] = solutions[sol]->solutionNum();
}

// Send solution numbers
for (int sol = 0; sol < numSolutions; ++sol)
{
  world.isend(sol + 1, 0, false);  // Indicates that we have not finished, and to expect a solution representation
  cout << "Sending solution no. " << solutionNums[sol] << " to node " << sol + 1 << endl;
  world.isend(sol + 1, 1, solutionNums[sol]);
}

这非常有效，因为每个解决方案编号都位于内存中自己的位置。现在考虑以下细微调整：

// Create solutionNum array
vector<int> solutionNums(numSolutions);
for (int sol = 0; sol < numSolutions; ++sol)
{
  solutionNums[sol] = solutions[sol]->solutionNum();
}

// Send solutions
for (int sol = 0; sol < numSolutions; ++sol)
{
  int solNum = solutionNums[sol];
  world.isend(sol + 1, 0, false);  // Indicates that we have not finished, and to expect a solution representation
  cout << "Sending solution no. " << solNum << " to node " << sol + 1 << endl;
  world.isend(sol + 1, 1, solNum);
}

现在，为底层的“MPI_Isend”调用提供了指向 solNum 的指针。不幸的是，这部分内存在每次循环中都会被覆盖，所以虽然看起来数字 4 被发送到节点 5，但当发送实际发生时，该内存位置的新内容（例如 19）而是通过了。

现在考虑原始代码：

// Send solutions
for (int sol = 0; sol < numSolutions; ++sol)
{
  world.isend(sol + 1, 0, false);  // Tells the slave to expect work
  cout << "Sending solution no. " << solutions[sol]->solutionNum() << " to node " << sol + 1 << endl;
  world.isend(sol + 1, 1, solutions[sol]->solutionNum());
}

在这里，我们通过一个临时的。同样，这个临时在内存中的位置在每次循环时都会被覆盖。同样，错误的数据被发送到从节点。

碰巧的是，我已经能够重组我的“真实”代码以使用“发送”而不是“发送”。但是，如果我以后需要使用'isend'，我会更加小心一点！

score 4 · Accepted Answer

我想我今天偶然发现了一个类似的问题。在序列化自定义数据类型时，我注意到它（有时）在另一边被损坏了。解决方法是mpi::request存储isend. 如果您查看communicator::isend_impl(int dest, int tag, const T& value, mpl::false_)in communicator.hppof boost，您会看到序列化的数据作为共享指针放入请求中。如果它再次被删除，数据就会失效，任何事情都可能发生。

所以：总是保存 isend 返回值！

score 2 · Accepted Answer

您的编译器已经优化了“solutions[sol] = new Solution;”中的废话。循环并得出结论，它可以跳转到所有 num_solution++ 增量的末尾。这样做当然是错误的，但这就是发生的事情。

尽管不太可能，自动线程化或自动并行化编译器可能导致 numsolutions++ 的 20 个实例相对于 Solutions() 的 ctor 列表中的 20 个 solution_num = num_solutions 实例以半随机顺序发生。优化出错的可能性更大。

代替

for (int sol = 0; sol < numSolutions; ++sol)
    {
      解决方案[sol] = 新解决方案；
    }

和

for (int sol = 0; sol < numSolutions; ++sol)
    {
      解决方案[溶胶] = 新解决方案（溶胶）；
    }

你的问题就会消失。特别是，每个解决方案都将获得自己的编号，而不是在编译器对 20 个增量进行错误重新排序期间获得共享静态恰好有一段时间的任何编号。

score 1 · Accepted Answer

基于 milianw 的回答：我的印象是使用 isend 的正确方法是保留它返回的请求对象，并在再次调用 isend 之前使用其 test() 或 wait() 方法检查它是否已完成。我认为继续调用 isend() 并将请求对象推送到向量上也可以。然后，您可以使用 {test,wait}_{any,some,all} 测试或等待这些请求。

在某些时候，您还需要担心发布发送的速度是否比收件人接收的速度快，因为迟早您会用完 MPI 缓冲区。根据我的经验，这只会表现为崩溃。

c++ - Boost.MPI：收到的不是发送的！

4 回答 4

Related

Reference