c++ - pthread_once() 中的竞争条件？

Question

我有std::future一个线程正在等待std::promise在另一个线程中设置。

编辑： 用一个将永远阻塞的示例应用程序更新了问题：

更新：如果我改用 a pthread_barrier，下面的代码不会阻塞。

我创建了一个测试应用程序来说明这一点：

基本上类foo创建一个在它的运行函数thread中设置一个promise，并在构造函数中等待它promise被设置。一旦设置，它会增加一个atomic计数

然后我创建了一堆这样的foo对象，把它们拆掉，然后检查我的count.

#include <iostream>
#include <thread>
#include <atomic>
#include <future>
#include <list>
#include <unistd.h>

struct foo
{
    foo(std::atomic<int>& count)
        : _stop(false)
    {
        std::promise<void> p;
        std::future <void> f = p.get_future();

        _thread = std::move(std::thread(std::bind(&foo::run, this, std::ref(p))));

        // block caller until my thread has started 
        f.wait();

        ++count; // my thread has started, increment the count
    }
    void run(std::promise<void>& p)
    {
        p.set_value(); // thread has started, wake up the future

        while (!_stop)
            sleep(1);
    }
    std::thread _thread;
    bool _stop;
};

int main(int argc, char* argv[])
{
    if (argc != 2)
    {
        std::cerr << "usage: " << argv[0] << " num_threads" << std::endl;
        return 1;
    }
    int num_threads = atoi(argv[1]);
    std::list<foo*> threads;
    std::atomic<int> count(0); // count will be inc'd once per thread

    std::cout << "creating threads" << std::endl;
    for (int i = 0; i < num_threads; ++i)
        threads.push_back(new foo(count));

    std::cout << "stopping threads" << std::endl;
    for (auto f : threads)
        f->_stop = true;

    std::cout << "joining threads" << std::endl;
    for (auto f : threads)
    {
        if (f->_thread.joinable())
            f->_thread.join();
    }

    std::cout << "count=" << count << (num_threads == count ? " pass" : " fail!") << std::endl;
    return (num_threads == count);
}

如果我在一个有 1000 个线程的循环中运行它，它只需要执行几次，直到发生竞争并且其中一个futures永远不会被唤醒，因此应用程序会永远卡住。

# this loop never completes
$ for i in {1..1000}; do ./a.out 1000; done

如果我现在SIGABRT使用该应用程序，则生成的堆栈跟踪显示它卡在future::wait 堆栈跟踪如下：

// main thread
    pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0
    __gthread_cond_wait (__mutex=<optimized out>, __cond=<optimized out>) at libstdc++-v3/include/x86_64-unknown-linux-gnu/bits/gthr-default.h:846
    std::condition_variable::wait (this=<optimized out>, __lock=...) at ../../../../libstdc++-v3/src/condition_variable.cc:56
    std::condition_variable::wait<std::__future_base::_State_base::wait()::{lambda()#1}>(std::unique_lock<std::mutex>&, std::__future_base::_State_base::wait()::{lambda()#1}) (this=0x93a050, __lock=..., __p=...) at include/c++/4.7.0/condition_variable:93
    std::__future_base::_State_base::wait (this=0x93a018) at include/c++/4.7.0/future:331
    std::__basic_future<void>::wait (this=0x7fff32587870) at include/c++/4.7.0/future:576
    foo::foo (this=0x938320, count=...) at main.cpp:18
    main (argc=2, argv=0x7fff32587aa8) at main.cpp:52


// foo thread
    pthread_once () from /lib64/libpthread.so.0
    __gthread_once (__once=0x93a084, __func=0x4378a0 <__once_proxy@plt>) at gthr-default.h:718
    std::call_once<void (std::__future_base::_State_base::*)(std::function<std::unique_ptr<std::__future_base::_Result_base, std::__future_base::_Result_base::_Deleter> ()>&, bool&), std::__future_base::_State_base* const, std::reference_wrapper<std::function<std::unique_ptr<std::__future_base::_Result_base, std::__future_base::_Result_base::_Deleter> ()> >, std::reference_wrapper<bool> >(std::once_flag&, void (std::__future_base::_State_base::*&&)(std::function<std::unique_ptr<std::__future_base::_Result_base, ...) at include/c++/4.7.0/mutex:819
    std::promise<void>::set_value (this=0x7fff32587880) at include/c++/4.7.0/future:1206
    foo::run (this=0x938320, p=...) at main.cpp:26

我很确定我的代码没有做错任何事情，对吧？

这是 pthread 实现还是 std::future/std::promise 实现的问题？

我的图书馆版本是：

libstdc++.so.6
libc.so.6 (GNU C Library stable release version 2.11.1 (20100118))
libpthread.so.0 (Native POSIX Threads Library by Ulrich Drepper et al Copyright (C) 2006)

score 8 · Accepted Answer

promise确实，本地对象的析构函数（在构造函数的末尾和从线程的调用之间）存在竞争条件set_value()。也就是说，set_value()唤醒主线程，接下来会销毁 promise 对象，但set_value()函数没有尚未完成，并且死锁。

阅读 C++11 标准，我不确定是否允许您使用：

void promise<void>::set_value();

效果：以原子方式将值 r 存储在共享状态中并使该状态准备就绪。

但在其他地方：

set_value、set_exception、set_value_at_thread_exit 和 set_exception_at_thread_exit 成员函数的行为就像它们在更新 Promise 对象时获取与 Promise 对象关联的单个互斥锁。

set_value()对于其他函数，例如析构函数，调用是否应该是原子的？

恕我直言，我会说不。效果相当于在其他线程仍在锁定互斥锁时销毁互斥锁。结果未定义。

解决方案是使p线程寿命更长。我能想到的两个解决方案：

p正如迈克尔伯尔在另一个答案中建议的那样，成为班级成员。
将承诺移动到线程中。

在构造函数中：

std::promise<void> p;
std::future <void> f = p.get_future();
_thread = std::thread(&foo::run, this, std::move(p));

顺便说一句，您不需要调用bind, （线程构造函数已经重载）或调用来std::move移动线程（正确的值已经是 r 值）。但是，对承诺的调用std::move是强制性的。

并且线程函数没有收到引用，而是收到了移动的承诺：

void run(std::promise<void> p)
{
    p.set_value();
}

我认为这正是 C++11 定义两个不同类的原因：promise和future: 你将 promise 移动到线程中，但你保留 future 来恢复结果。

score 5 · Accepted Answer

尝试移动，std::promise<void> p;以便它不是构造函数的本地成员，而是struct foo：

struct foo
{
    foo(std::atomic<int>& count)
        : _stop(false)
    {
        // std::promise<void> p;    // <-- moved to be a member
        std::future <void> f = p.get_future();

        // ...same as before...
    }
    void run(std::promise<void>& p)
    {
        // ... same ...
    }

    std::promise<void> p;   // <---
    std::thread _thread;
    bool _stop;
};

我相信可能发生的事情是你进入了一场比赛，p在构造函数中被破坏，同时p.set_value()对那个的引用起作用promise。set_value()在完成/清理时内部发生了一些事情；对已销毁的引用进行操作正在破坏库std::promise中的某些状态pthread。

这只是猜测 - 我目前无法访问重现问题的系统。但是创建p一个成员将确保它的生命周期远远超过set_value()调用完成。

c++ - pthread_once() 中的竞争条件？

2 回答 2

Related

Reference