14

为什么我在使用“#pragma omp parallel num_threads(4)”时没有得到不同的线程 ID。在这种情况下,所有线程 id 都是 0。但是当我评论该行并使用默认线程数时,我得到了不同的线程 ID。注意:- 变量我使用变量 tid 来获取线程 ID。

#include <omp.h>
#include <stdio.h>
#include <stdlib.h>

int main (int argc, char *argv[]) 
{
int nthreads, tid;
int x = 0;

#pragma omp parallel num_threads(4)
#pragma omp parallel private(nthreads, tid)
  {
  /* Obtain thread number */
 tid = omp_get_thread_num();
  printf("Hello World from thread = %d\n", tid);

  // /* Only master thread does this */
   if (tid == 0) 
     {
     nthreads = omp_get_num_threads();
     printf("Number of threads = %d\n", nthreads);
     }

  }


}

上述代码的输出:-

Hello World from thread = 0
Hello World from thread = 0
Number of threads = 1
Hello World from thread = 0
Number of threads = 1
Hello World from thread = 0
Number of threads = 1
Number of threads = 1

当我评论上述行时输出:-

Hello World from thread = 3
Hello World from thread = 0
Number of threads = 4
Hello World from thread = 1
Hello World from thread = 2
4

2 回答 2

14

您正在创建两个嵌套的并行区域。这和这样做是一样的:

#pragma omp parallel num_threads(4)
{
  #pragma omp parallel private(nthreads, tid)
  {
    /* Obtain thread number */
    tid = omp_get_thread_num();
    printf("Hello World from thread = %d\n", tid);

    // /* Only master thread does this */
    if (tid == 0) 
    {
      nthreads = omp_get_num_threads();
      printf("Number of threads = %d\n", nthreads);
    }
  }
}

omp_get_num_threads()返回最内层区域的线程数。所以你正在执行四个线程,每个线程都在执行一个线程。

内部并行区域只执行一个线程,因为您没有启用嵌套并行。您可以通过调用来启用它omp_set_nested(1)

http://docs.oracle.com/cd/E19205-01/819-5270/aewbi/index.html

如果您想创建一个并行区域并指定两个属性,而不是创建两个嵌套的并行区域,则可以这样做:

#pragma omp parallel num_threads(4) private(nthreads,tid)
{
  .
  .
  .
}
于 2012-11-03T14:18:52.813 回答
0

也可以通过将环境变量 OMP_NESTED 设置为 true 来启用嵌套

于 2019-07-19T22:40:31.977 回答