image - 如何在opencv中获取真实位置的图像像素？

Question

我想检索图像中像素的 rgb。但该位置不是整数位置，而是实数值 (x,y)。我想要一个双线性插值。我怎么能做到opencv？

非常感谢

score 49 · Accepted Answer

亚像素访问没有简单的功能，但我可以建议您几个选项：

使用getRectSubPix并提取 1 个像素区域：

cv::Vec3b getColorSubpix(const cv::Mat& img, cv::Point2f pt)
{
    cv::Mat patch;
    cv::getRectSubPix(img, cv::Size(1,1), pt, patch);
    return patch.at<cv::Vec3b>(0,0);
}

对单像素地图使用更灵活但不太精确的重映射：

cv::Vec3b getColorSubpix(const cv::Mat& img, cv::Point2f pt)
{
    cv::Mat patch;
    cv::remap(img, patch, cv::Mat(1, 1, CV_32FC2, &pt), cv::noArray(),
        cv::INTER_LINEAR, cv::BORDER_REFLECT_101);
    return patch.at<cv::Vec3b>(0,0);
}

自己实现双线性插值，因为它不是火箭科学：

cv::Vec3b getColorSubpix(const cv::Mat& img, cv::Point2f pt)
{
    assert(!img.empty());
    assert(img.channels() == 3);

    int x = (int)pt.x;
    int y = (int)pt.y;

    int x0 = cv::borderInterpolate(x,   img.cols, cv::BORDER_REFLECT_101);
    int x1 = cv::borderInterpolate(x+1, img.cols, cv::BORDER_REFLECT_101);
    int y0 = cv::borderInterpolate(y,   img.rows, cv::BORDER_REFLECT_101);
    int y1 = cv::borderInterpolate(y+1, img.rows, cv::BORDER_REFLECT_101);

    float a = pt.x - (float)x;
    float c = pt.y - (float)y;

    uchar b = (uchar)cvRound((img.at<cv::Vec3b>(y0, x0)[0] * (1.f - a) + img.at<cv::Vec3b>(y0, x1)[0] * a) * (1.f - c)
                           + (img.at<cv::Vec3b>(y1, x0)[0] * (1.f - a) + img.at<cv::Vec3b>(y1, x1)[0] * a) * c);
    uchar g = (uchar)cvRound((img.at<cv::Vec3b>(y0, x0)[1] * (1.f - a) + img.at<cv::Vec3b>(y0, x1)[1] * a) * (1.f - c)
                           + (img.at<cv::Vec3b>(y1, x0)[1] * (1.f - a) + img.at<cv::Vec3b>(y1, x1)[1] * a) * c);
    uchar r = (uchar)cvRound((img.at<cv::Vec3b>(y0, x0)[2] * (1.f - a) + img.at<cv::Vec3b>(y0, x1)[2] * a) * (1.f - c)
                           + (img.at<cv::Vec3b>(y1, x0)[2] * (1.f - a) + img.at<cv::Vec3b>(y1, x1)[2] * a) * c);

    return cv::Vec3b(b, g, r);
}

score 5 · Accepted Answer

不幸的是，我没有足够的分数将其发布为对已接受答案的评论...我调整了代码以适应我自己的问题，该问题需要在单通道浮点矩阵上进行插值。

我想我想知道哪种方法最快。

我实现了 Andrey Kamaev 的回答中的 3 种方法以及一个简单的最近邻（基本上只是四舍五入坐标）。

我用刚刚填充垃圾的矩阵 A(100x100) 进行了实验。然后，我制作了一个矩阵 B(400x400)，其中填充了从 a 中插入的值：B(i,j) = A(i/4, j/4)。

每次运行 1000 次，以下是平均次数：

最近邻居：2.173 毫秒
获取矩形子像素：26.506 毫秒
重映射：114.265 毫秒
手动：5.086 毫秒
无边框手动插值：3.842 毫秒

如果您不太关心实际插值并且只需要一个值 - 特别是如果您的数据变化非常平稳，那么最近的邻居可以获得超高速。对于其他任何事情，我都会使用手动双线性插值，因为它似乎始终比其他方法快。（OpenCV 2.4.9 - Ubuntu 15.10 回购 - 2016 年 2 月）。

如果您知道所有 4 个贡献像素都在矩阵的范围内，那么您可以使其在时间上基本等同于最近邻 - 尽管无论如何差异都可以忽略不计。

score 4 · Accepted Answer

双线性插值仅意味着根据与您正在检查的像素最近的 4 个像素对值进行加权。权重可以如下计算。

cv::Point2f current_pos; //assuming current_pos is where you are in the image

//bilinear interpolation
float dx = current_pos.x-(int)current_pos.x;
float dy = current_pos.y-(int)current_pos.y;

float weight_tl = (1.0 - dx) * (1.0 - dy);
float weight_tr = (dx)       * (1.0 - dy);
float weight_bl = (1.0 - dx) * (dy);
float weight_br = (dx)       * (dy);

您的最终值计算为每个像素与其各自权重的乘积之和

score 2 · Accepted Answer

如果您想重复或一致地执行此操作，使用映射会更有效。另一个优点是选择插值方法以及如何处理边界条件。最后，一些插值函数也在 GPU 上实现。重新映射

image - 如何在opencv中获取真实位置的图像像素？

4 回答 4

Related

Reference