python - 如何使用 numpy 数组找到具有最小 MSE 的值？

Question

我可能的价值观是：

0: [0 0 0 0]
1: [1 0 0 0]
2: [1 1 0 0]
3: [1 1 1 0]
4: [1 1 1 1]

我有一些价值观：

[[0.9539342  0.84090066 0.46451256 0.09715253],
 [0.9923432  0.01231235 0.19491441 0.09715253]
 ....

我想弄清楚我的哪些可能值最接近我的新值。理想情况下，我想避免for循环并想知道是否有某种矢量化方法来搜索最小均方误差？

我希望它返回一个如下所示的数组：[2, 1 ....

score 1 · Accepted Answer

您可以使用np.argmin来获取 rmse 值的最低索引，该索引可以使用来计算np.linalg.norm

import numpy as np
a = np.array([[0, 0, 0, 0], [1, 0, 0, 0], [1, 1, 0, 0],[1, 1, 1, 0], [1, 1, 1, 1]])
b = np.array([0.9539342, 0.84090066, 0.46451256, 0.09715253])
np.argmin(np.linalg.norm(a-b, axis=1))
#outputs 2 which corresponds to the value [1, 1, 0, 0]

如编辑中所述， b 可以有多行。操作员想避免 for 循环，但我似乎找不到避免 for 循环的方法。这是一个列表组合方式，但可能有更好的方法

[np.argmin(np.linalg.norm(a-i, axis=1)) for i in b] 
#Outputs [2, 1]

score 1 · Accepted Answer

假设您的输入数据是字典。然后，您可以将 NumPy 用于矢量化解决方案。您首先将输入列表转换为 NumPy 数组，然后使用axis=1参数来获取 RMSE。

# Input data
dicts = {0: [0, 0, 0, 0], 1: [1, 0, 0, 0], 2: [1, 1, 0, 0], 3: [1, 1, 1, 0],4: [1, 1, 1, 1]}
new_value = np.array([0.9539342, 0.84090066, 0.46451256, 0.09715253])

# Convert values to array
values = np.array(list(dicts.values()))

# Compute the RMSE and get the index for the least RMSE 
rmse = np.mean((values-new_value)**2, axis=1)**0.5
index = np.argmin(rmse)    

print ("The closest value is %s" %(values[index]))
# The closest value is [1 1 0 0]

score 0 · Accepted Answer

纯麻木：

val1 = np.array ([
   [0, 0, 0, 0],
   [1, 0, 0, 0],
   [1, 1, 0, 0],
   [1, 1, 1, 0],
   [1, 1, 1, 1]
  ])

print val1
val2 = np.array ([0.9539342, 0.84090066, 0.46451256, 0.09715253], float)
val3 = np.round(val2, 0)
print val3

print np.where((val1 == val3).all(axis=1)) # show a match on row 2 (array([2]),)

python - 如何使用 numpy 数组找到具有最小 MSE 的值？

3 回答 3

Related

Reference