正如 lejlot 所指出的,使用“排名的精确度和召回率”来衡量排名性能是很奇怪的。在您指出的参考论文中,“精度”和“召回”的定义非常“定制” 。
It is a measure of the tradeoff between the precision and
recall of the particular ranking algorithm. Precision is the
accuracy of the ranks i.e. how well the algorithm has
ranked the services according to the user preferences.
Recall is the deviation between the top ranked service and
the next relevant service in the list. Both these metrics are
used together to arrive at the f-measure which then tests the
algorithm efficiency.
可能原作者有一些特定的动机来使用这样的定义。评估排名算法的一些常用指标包括:
- 归一化折扣信息增益或nDCG(用于许多 kaggle 比赛)
- 精度@K,召回@K
本文还列出了一些常见的排名措施。