3

我想PageRank从格式如下的边缘的 CSV 文件中计算:

12,13,1.0
12,14,1.0
12,15,1.0
12,16,1.0
12,17,1.0
...

我的代码:

var filename = "<filename>.csv"

val graph = Graph.fromCsvReader[Long,Double,Double]( 
                   env = env, 
                   pathEdges = filename, 
                   readVertices = false, 
                   hasEdgeValues = true, 
                   vertexValueInitializer = new MapFunction[Long, Double] { 
                           def map(id: Long): Double = 0.0 } )

val ranks = new PageRank[Long](0.85, 20).run(graph)

我从 Flink Scala Shell 收到以下错误:

error: type mismatch;
 found   : org.apache.flink.graph.scala.Graph[Long,_23,_24] where type _24 >: Double with _22, type _23 >: Double with _21
 required: org.apache.flink.graph.Graph[Long,Double,Double]
            val ranks = new PageRank[Long](0.85, 20).run(graph)
                                                         ^

我究竟做错了什么?

(并且每个顶点的初始值 0.0 和每个边的初始值 1.0 是否正确?)

4

1 回答 1

2

问题是您将 Scala 提供org.apache.flink.graph.scala.GraphPageRank.run期望 Java org.apache.flink.graph.Graph

为了运行GraphAlgorithmScala对象,Graph您必须run使用.GraphGraphAlgorithm

graph.run(new PageRank[Long](0.85, 20))

更新

PageRank算法的情况下,重要的是要注意该算法需要一个 type 的实例Graph[K, java.lang.Double, java.lang.Double]。由于 Java 的Double类型不同于 Scala 的Double类型(在类型检查方面),因此必须考虑到这一点。

对于示例代码,这意味着

val graph = Graph.fromCsvReader[Long,java.lang.Double,java.lang.Double]( 
  env = env, 
  pathEdges = filename, 
  readVertices = false, 
  hasEdgeValues = true, 
  vertexValueInitializer = new MapFunction[Long, java.lang.Double] { 
         def map(id: Long): java.lang.Double = 0.0 } )
  .asInstanceOf[Graph[Long, java.lang.Double, java.lang.Double]]
于 2015-11-16T11:30:44.290 回答