1

我正在 spark 中尝试一个简单的电影推荐机器学习程序。Spark 版本:2.1.1 Java 版本:java 8 Scala 版本:Scala 代码运行器版本 2.11.7 环境:windows 7

运行这些命令来启动 master 和 worker slave

//start master
spark-class org.apache.spark.deploy.master.Master

//start worker
spark-class org.apache.spark.deploy.worker.Worker spark://valid ip:7077

我正在尝试一个非常简单的电影推荐代码:http: //blogs.quovantis.com/recommendation-engine-using-apache-spark/

我已将代码更新为:

SparkConf conf = new SparkConf().setAppName("Collaborative Filtering Example").setMaster("spark://valid ip:7077");
conf.setJars(new String[] {"C:\\Spark2.1.1\\spark-2.1.1-bin-hadoop2.7\\jars\\spark-mllib_2.11-2.1.1.jar"});

我无法通过 intelliJ 运行它运行 mvn clean install 并将 jar 复制到文件夹不起作用。我用来运行的命令:

bin\spark-submit --verbose –-jars jars\spark-mllib_2.11-2.1.1.jar –-class “com.abc.enterprise.RecommendationEngine” –-master spark://valid ip:7077 C:\Spark2.1.1\spark-2.1.1-bin-hadoop2.7\spark-mllib-example\spark-poc-1.0-SNAPSHOT.jar C:\Spark2.1.1\spark-2.1.1-bin-hadoop2.7\spark-mllib-example\ratings.csv C:\Spark2.1.1\spark-2.1.1-bin-hadoop2.7\spark-mllib-example\movies.csv 10

我看到的错误是:

C:\Spark2.1.1\spark-2.1.1-bin-hadoop2.7>bin\spark-submit --verbose --class "com.sandc.enterprise.RecommendationEngine" --master spark://10.64.98.101:7077 C:\Spark2.1.1\spark-2.1.1-
bin-hadoop2.7\spark-mllib-example\spark-poc-1.0-SNAPSHOT.jar C:\Spark2.1.1\spark-2.1.1-bin-hadoop2.7\spark-mllib-example\ratings.csv C:\Spark2.1.1\spark-2.1.1-bin-hadoop2.7\spark-m
llib-example\movies.csv 10
Using properties file: C:\Spark2.1.1\spark-2.1.1-bin-hadoop2.7\bin\..\conf\spark-defaults.conf
Adding default property: spark.serializer=org.apache.spark.serializer.KryoSerializer
Adding default property: spark.executor.extraJavaOptions=-XX:+PrintGCDetails -Dkey=value -Dnumbers="one two three"
Adding default property: spark.eventLog.enabled=true
Adding default property: spark.driver.memory=5g
Adding default property: spark.master=spark://valid ip:7077
Error: Cannot load main class from JAR file:/C:/Spark2.1.1/spark-2.1.1-bin-hadoop2.7/û-class
Run with --help for usage help or --verbose for debug output

如果我给出 --jar 命令,它会给出错误:

Error: Cannot load main class from JAR file:/C:/Spark2.1.1/spark-2.1.1-bin-hadoop2.7/û-jars

有什么想法可以提交这份工作来激发火花吗?

4

1 回答 1

0

你的罐子建得正确吗?此外,您不需要为 --class 选项值添加双引号。

于 2017-06-22T20:11:59.513 回答