1

我试图将 CUDA 项目移植到广泛使用 C++ 模板的 ROCm 平台。在此过程中,我收到以下编译错误

/root/warp-ctc/include/detail/gpu_ctc.h:381:5: error: no matching function for call to 'hipLaunchKernelGGL'
    hipLaunchKernelGGL((prepare_stable_SM_kernel<ProbT, VT>), dim3(grid_size), dim3(NT), 0, stream_, ctc_helper::identity<ProbT>(), probs_,
    ^~~~~~~~~~~~~~~~~~
.....
.....
/opt/rocm/hip/include/hip/hcc_detail/functional_grid_launch.hpp:138:13: note: candidate function [with Args = <ctc_helper::identity<float, float>, float *,
      float *, int, int>, F = void (*)(ctc_helper::identity<float, float>, float *, float *, int, int)] not viable: no overload of 'prepare_stable_SM_kernel'
      matching 'void (*)(ctc_helper::identity<float, float>, float *, float *, int, int)' for 1st argument
inline void hipLaunchKernelGGL(F kernel, const dim3& numBlocks, const dim3& dimBlocks,

用于以下 C++ 代码

hipLaunchKernelGGL((prepare_stable_SM_kernel<ProbT, VT>),
                   dim3(grid_size),
                   dim3(NT),
                   0,
                   stream_,
                   ctc_helper::identity<ProbT>(),
                   probs_,
                   denoms_,
                   out_dim_,
                   num_elements);

其中hipLaunchKernelGGL定义

template <typename... Args, typename F = void (*)(Args...)>
inline void hipLaunchKernelGGL(F kernel,
                               const dim3& numBlocks,
                               const dim3& dimBlocks,
                               std::uint32_t sharedMemBytes,
                               hipStream_t stream,
                               Args... args) {

// ...
// ...
}

prepare_stable_SM_kernel定义

template <typename ProbT, int VT = 1, typename Op>
__global__ void prepare_stable_SM_kernel(Op f, ProbT* probs,
                                         const ProbT* const col_max,
                                         int alphabet_size,
                                         int count) {
// ...
}

任何人请通过提供一些提示来解决此问题来帮助我。

4

1 回答 1

1

ctc_helper::identity<ProbT>通过向prepare_stable_SM_kernel函数添加第三个参数已修复此错误

参考https://github.com/harish2704/warp-ctc/commit/c7ec45febc9c0077ffa35932b3d11a05daf8bf7c#diff-1e372cfc8edab301b6c284e9db8dbf68

于 2019-03-31T20:01:18.167 回答