neo4j - 意外的 neo4j 密码查询结果

Question

我想确定与认识特定人的邻居的对话持续时间的相对百分比。

例如，当首先观察节点 A 时，我们必须知道他花了多少时间与他的所有邻居交谈，这是通过以下查询执行的：

neo4j-sh (0)$ start a = node(351061) match (a)-[r:TALKED_TO]->(b) return sum(r.duration)
==> +-----------------+
==> | sum(r.duration) |
==> +-----------------+
==> | 12418           |
==> +-----------------+
==> 1 row, 0 ms

接下来，我们必须检查他的哪些邻居认识特定的人（例如 c），并仅将 b 认识 c 的 a 和 b 之间的对话持续时间相加：

neo4j-sh (0)$ start a = node(351061) match (a)-[r:TALKED_TO]->(b)-[p:KNOWS]->(c) return sum(r.duration)
==> +-----------------+
==> | sum(r.duration) |
==> +-----------------+
==> | 21013           |
==> +-----------------+
==> 1 row, 0 ms

这里似乎不合逻辑的是，第二个总和大于第一个，而第二个应该只是第一个的一部分。有谁知道获得这样的结果可能是什么问题？该错误出现在 15000 个用户中的 7 个用户上。

score 2 · Accepted Answer

您没有在该查询中查看特定的人 C。您正在匹配任何 :KNOWS 关系的所有路径，因此如果您有 a->b->c 和 a->b->d 您在 a->b 之间的持续时间将被计算两次。

您可能需要做的是：

start a = node(351061), c=node(xxxxx) // set c explicitly
match (a)-[r:TALKED_TO]->(b)
where b-[:KNOWS]->c // putting this in the where clause forces you to set C
return sum(r.duration)

这是控制台中的一个示例：http: //console.neo4j.org/r/irm0zy

请记住，这会match扩大和where收紧结果。您也可以使用来执行此操作match，但您需要在中指定 c start。

测试聚合函数在做什么的一个好方法是返回所有命名变量（或设置可以返回的路径）——这样你就可以看到聚合被分成小计。像这样：

start a=node(1) 
match a-[r:TALKED_TO]->b-[:KNOWS]->c 
return sum(r.duration), a,b,c;
+-----------------------------------------------------------------------------------------------+
| sum(r.duration) | a                       | b                       | c                       |
+-----------------------------------------------------------------------------------------------+
| 20              | Node[1]{name:"person1"} | Node[2]{name:"person2"} | Node[4]{name:"person4"} |
| 20              | Node[1]{name:"person1"} | Node[2]{name:"person2"} | Node[3]{name:"person3"} |
| 20              | Node[1]{name:"person1"} | Node[5]{name:"person5"} | Node[6]{name:"person6"} |
+-----------------------------------------------------------------------------------------------+

neo4j - 意外的 neo4j 密码查询结果

1 回答 1

Related

Reference